/WebScraping

Create a database from scratch by extracting html elements from a webpage

Primary LanguageJupyter Notebook

WebScraping

Create a database from scratch by extracting html elements from a webpage
Modules Used: Urllib.request, BeautifulSoup, Regex and Pandas.

Step by step walk-through:

Step 1: pulling HTML out of a webpage.
Step 2: targeting elements of interest inside the HTML.
Step 3: fine-tuning targeted elements with Regex (Regular Expressions), string concatenation and slicing.
Step 4: storing the data inside a DataFrame.
Step 5: exporting DataFrame into a CSV file.

Also available in a video explination: https://youtu.be/ySNSY7iiBDY
Author: Mariya Sha
Email: mariyasha888@gmail.com
LinkedIn: www.linkedin.com/in/mariyasha888/