/Search-Engine

The Search Engine for TREC Standard Format Document Collections

Primary LanguageJava

Search-Engine

Usage

Download the data

wget http://crystal.exp.sis.pitt.edu:8080/iris/data.rar
rar x data.rar
wget http://crystal.exp.sis.pitt.edu:8080/iris/result_data.rar
rar x result_data.rar

Section1: Reading documents from collection files and reading documents from collection files.

Section2: Build an index and retrieve posting lists of tokens from an index.

Section3: Automatically translate topic statements to queries and implementing the statistical language model.

Section4: Implement relevance feedback for language model.