Text Similarity

This function is capable of computing similarity between individual documents or groups of documents provided in the input file. The employed technique involves a bag-of-words approach with subsequent TFIDF transformation and L2 regularization to account for potential differences in text lengths.

Parameters

Output

The function will produce the following files: