- Download chromedriver
- Install Anaconda/ Miniconda (Recommended).
- Create/run in a python >v3.6 virtual environment
- run
pip install -r requirements.txt
First add tags and related file names. Then do the following.
$ cd scrapping/scripts/v.Apr.2022/
$ python main.py
$ cd dataset_building
$ python all_posts_integration.py
$ cd dataset_building
$ python remove_duplicate_items_json.py
$ cd analysis/scripts
$ python post_analysis.py
- Web Scrapping: Finding Necessary Contents from a Medium Dot Com Blog Post
- Web Scrapping: Clicking the ‘Show More’ Button Multiple times in Medium.com Blog via Selenium
- Do not forget to download the
chromedriver
of the similar version as of the chrome browser - Miniconda is recommended as it is very lightweight