/WEB-SCRAPING

Primary LanguageJupyter Notebook

Simple Python Web Scraping

In this project, I extract related information about vocabulary:

  • text (.json and .csv): word, meaning, type, type, translation
  • image (.png): illustration image
  • audio (.mp3): audio spelling

Tools

  • BeautifulSoup : a Python library for pulling data out of HTML and XML files. It works with your favorite parser to provide idiomatic ways of navigating, searching, and modifying the parse tree. It commonly saves programmers hours or days of work.

  • Requests : a HTTP library for the Python programming language. The goal of the project is to make HTTP requests simpler and more human-friendly.

Setup

Prerequisite

  • Install Python3
  • Install PIP

Setup

pip install bs4 
pip install requests

Now you can run the project in vocabulary.ipynb