/Wiki_Search_Engine

Search Engine for Wikipedia Dump

Primary LanguagePython

Wikipedia Search Engine

To run the search engine, please run the following commands:

index.sh is run as follows ./index.sh <path_to_dump> <path_to_index_folder>. search.sh is run as follows ./search.sh <path_to_index_folder> <path_to_input_query_file> <path_to_output_file>.

Please note that the data dump has not been pushed.

This project relies on the concepts of inverted index creation, hash mapping, creating tries and page ranking algorithms based on TF-IDF.