/IBD1

The goal of this project to implement a naive search engine with MapReduce on Apache Hadoop.

Primary LanguageJava

Assignment №1. MapReduce. Simple Text Indexer

User guide

To run Indexer Engine:

$hadoop jar <jar_name>.jar Indexer <path to input directory in HDFS> <path to output directory in HDFS> 
Example: $hadoop jar project3.jar Indexer /EnWikiSmall /indexer_output

To run Query Engine:

$hadoop jar <jar_name>.jar Query <path to output directory of indexer in HDFS> <query text> <number of most relevant docs> 
Example: $hadoop jar project3.jar Query /indexer_output "Big data technologies" 5