Web scraping in Python - Step by step guide

  • Become familiar with a web environment in which you want to extract certain information.
  • Develop a code to obtain its content and extract it from the specific table.
  • Use a database to store the data and process it.

🌱 How to start this project.

Follow the instructions below:

  1. Create a new repository by forking the Git project or by clicking here.
  2. Open the newly created repository in Codespace using the Codespace button extension.
  3. Once the Codespace VSCode has finished opening, start your project by following the instructions below.

🚛 How to deliver this project

Once you have finished solving the exercises, be sure to commit your changes, push to your repository and go to 4Geeks.com to upload the repository link.

📝 Instructions

Read the contents of the ./INSTRUCTIONS.md file containing the steps to complete the project, follow them and develop the solution first in ./src/explore.ipynb for testing and then move the clean code to ./src/app.py. Be sure to delete the notebook once you no longer need it to keep the project clean and tidy.

NOTE: Solution: https://github.com/4geeksacademy/web-scraping-project-tutorial/blob/main/src/solution.py