/IR_search_engine

QMUL cw group w

Primary LanguagePython

IR_search_engine QMUL group w

Download Dataset: https://www.kaggle.com/bittlingmayer/amazonreviews

image

Need to pay attention to the file path. Modify the file path if needed.

Put the file and dataset folder under the same folder and run it

python preprocess.py

Create index and import data to elastic search engine

I found that when I import data many times, elastic would not delete the repeated data. So just run this file one time

After start elastic search engine, run createIndex.py Creat index and import data to elastic search engine.

python creatIndex.py

Empty data

python emptyData.py

Train model

python prediction.py

Begin to search

python search.py

image image

Runing flask

python searchWebsite.py

Open localhost:80 with browser

image