/simple-web-scraper

A simple web scraper with axios and express

Primary LanguageJavaScript

Axios & Express Web Scraper

A simple web scraper that gets all articles from the guardian

Sources

Dependencies

  • axios v0.27.2
  • cheerio v1.0.0-rc.11
  • cors v2.8.5
  • express v4.18.1

Installation

  • clone this repo to your computer
  • cd simple-web-scraper
  • run npm install

Usage

  • run npm run start in the ./simple-web-scraper folder

Details

The app is configured to scrape the data from https://www.theguardian.com/uk

The app runs on http://localhost:8000

You can see the resulted json file on http://localhost:8000/results

You can see the resulted html by opening the index.html file in the browser