This project consists of 3 different web scraping techniques.
The project has been implemented and run on Windows 10 machine with Python 3.8.8. The requirements can be installed by:
pip install -r requirements.txt
WebScraping-BeautifulSoup
contains web scraping using BeautifulSoup. Articles on Algorithms present on GeeksForGeeks have been scraped.
Run the program using:
python main.py
WebScraping-Scrapy
contains web scraping using Scrapy. There are 2 different scraping projects in this.
Run the program using:
spider crawl flipkart_games
Run the program using:
spider crawl amazon_watches
For scraping through Amazon, User-Agents and Proxy methods have also been used
WebScraping-Selenium
contains web scraping using Selenium. Videos present on a Gaming Channel (Insym) have been scraped.
Run the program using:
python main.py