/Wikipedia-Search-Engine

Given a query, searches the Wikipedia Corpus (46 GB) and give the titles of top ten retrieved documents, in ranked order for phrase queries or field based queries using multi-level indexes, tf-idf scoring, cosine similarity, threading, index compression.

Primary LanguagePython

No issues in this repository yet.