- Run the hadoop server on localhost
- Unzip the folder threaded_crawler.zip
Obtain the jar file at this link. https://app.box.com/s/k37anoksz0bjwg5mnfiqjpyabnagft47.
- use the command 'java threaded_crawler.GUI' to run the program
- the program will wait for the input of path which needs to be indexed. All the files in that path will be index and ranked
- Use the GUI.
Note: The hadoop server configurations are kept as defualt i.e. the server runs at port 9000, localhost