/ir-lab

Primary LanguagePython

document indexing

to run queries on some folder:

python -m document_index test_folder/

Right now, there is stemming support for english so please make files in english

String distance

to run calculation of corelation for string distances on concatenated documents in folder:

python -m string_distance test_folder/

PageRank

to run calculations for page rank:

python -m pagerank 20

Argument to script is number of nodes in strongly connected component