- Click on following link
- Enter text to search. Eg : "Time Travel"
https://drive.google.com/file/d/1P6hGsJ0Lwrk1OmJexMmp0VXBKdpC-GhF/view?usp=sharing
To Execute Locally :
- Clone the Repo
- Go to directory actor_search_engine
- Execute buildSearchIndex.py or buildSearchIndex.ipynb to create inverted index - idx folder
- python movieApp.py
Output :
-
Serving Flask app "movieApp" (lazy loading)
-
Environment: production
-
Running on http://0.0.0.0:8080/ (Press CTRL+C to quit)
sudo apt-get update
- sudo apt-get install libgl1-mesa-glx libegl1-mesa libxrandr2 libxrandr2 libxss1 libxcursor1 libxcomposite1 libasound2 libxi6 libxtst6
- wget https://repo.anaconda.com/archive/Anaconda3-5.0.1-Linux-x86_64.sh
- sudo reboot
-
sudo apt-get install python3-bs4
-
conda install selenium
-
sudo apt-get install chromium-chromedriver
-
pip install --upgrade pandas
-
pip install metapy
-
pip install pytoml
-
pip install requests
-
pip install flask
- clone the repo
- cd CS410_Project/actor_search_engine/
- python buildSearchIndex.py
nohup python movieApp.py &
Steps
Go to directory actor_search_engine
Link - https://datasets.imdbws.com/
Files :
- name.basics.tsv.gz
- title.akas.tsv.gz
- title.basics.tsv.gz
- title.principals.tsv.gz
- title.ratings.tsv.gz
There are 2 options. Execute either one of them
- prepareMovieActorsFile
- ScrapeMoviesActors.ipynb
- ScrapeMoviesTags.ipynb
- buildCorpus.ipynb
OR
executeAllScripts.sh
Execute buildSearchIndex
Execute "python movieApp.py"