Creating this script to scrape stats from this URL: http://www.westcoastroasting.com/.
Requires Scrapy (http://scrapy.org/)
Developed using my scrapestat virtual environment.
-
Enter the scrapestat environment: $> workon scrapestat
-
Go to
$HOME/PyDevel/scrapestat/westcoastroasters and run the spider: $ > scrapy crawl wcr -
If you want to export to json: $> scrapy crawl wcr -o top_selling_beans.json -t json