Basic playwright
/apify
based web-scraper!
- Clone repository and install dependencies
$ git clone git@github.com:ndom91/web-scraper-berlin.git
$ cd web-scraper-berlin
$ npm install
-
Paste your list of URLs to be scraped into
sites.txt
-
Double check the
SEARCH_TERM
variable towards the top ofindex.js
. This is the term which will trigger sites to be written tooutput.txt
during the scraping process. -
Run
npm run scrape
🎉
MIT