Introduction to Static Site Web Scraping Workshop For MadHacks 2023
-
Get Prices From Home Page (Demo)
-
Print Product Title & Prices
-
Get All Colors Available For Each Product
-
Get Every Product's Material
- This is visible when you click into a product's page
Hint 1
You'll need to make an additional HTML request for each product.
- This is visible when you click into a product's page
-
Filter all the products from highest reviewed to lowest reviewed
-
Product Availability
-
Not all products are available, look at
Gerald the Giraffe
Bonus
Add some logic to check products every X minutes, so you can be notified when products come back in stock Note: This website won't have items come back into stock, but if it was a real website you could have a Discord bot or something notify you. - Example: Old commisioned project I made (https://github.com/davidteather/Hotukdeals-Discord-Notifier)
-
-
Scrape Reviews For Each Product
Bonus
Try and do sentiment analysis on product reviews and sort by ones with the best average sentiment. You might find https://realpython.com/python-nltk-sentiment-analysis/#using-nltks-pre-trained-sentiment-analyzer helpful
This workshop is also almost 1:1 with this Everything Web Scraping Video with solutions so if you get stuck this is a great resource!
FEEDBACK Please fill out the feedback form, so I can make better content in the future