How do we easily access documents with similar content?

Clustering