Project developed during the Information Retrival Course. Objective of the project was to scrape 3 different websites for football clothing and then display the scraped data on our own website, implementing search functions / indexing on said data.
Create a python virtual env, here I am using pyenv:
pyenv virtualenv ir_football_clothing
Make sure it is activated, it should return something similar to this:
pip3 -V
pip 23.2.1 from /home/martin/.pyenv/versions/ir_football_clothing/lib/python3.10/site-packages/pip (python 3.10)
Install python dependencies:
pip3 install -r requirements.txt
Run the scraper:
sh run_scraper.sh
The resulting .json data will be stored in /data/[site_name].json
Run the site:
sh run_ui.sh
For code quality / formatting run:
black .
isort --profile black .