/wikiscrapy

Scraper searching for dates in wikipedia

Primary LanguageJavaScript

Wikiscraper

N|Solid

This is a simple phantomjs (http://phantomjs.org/) scraper I whipped up to scrape a wikipedia page for content containing a date (19xx or 20xx) and dump the related sentence into output.csv

Install phantomjs globally to run

npm install
phantomjs index.js <url to wikipedia site>