/scrape-urls-from-multiple-websites

This repository will help you to download all the press releases from multiple websites. A simple way to build a custom database of press releases without any manual efforts.

Primary LanguagePython

Scrape urls from multiple websites

This repository will help you to download all the press releases from multiple websites. A simple way to build a custom database of press releases without any manual efforts.

Instructions

  • pip install -r requirements
  • Run url_extract.py

Reference

I devised the solution from the following pages of the documentation:

  • [beautyfulsoup4] to scrape information from web pages
  • [pandas] for data structuring