BBC_News_article_web_scraper

Create a solution that crawls for articles from a news website BBC.com, cleanses the response, stores in a mongo database.

Scrapy framework based crawler which traverses page links recursively and uses css response to fetch article details and text, then stores to external MongoDB server

Installation

pip install pymongo

Please make sure you have good internet connection (to avoid speed issues).

Run your terminal.
Navigate (change directory) to the BBC_News_article_web_scraper/NewsApp/ folder.
Type the command :

scrapy crawl bbc