/SA_Parl_sitting_dates

Dates for both SA parliaments for which Hansard has been published (Senate and House of Assembly)

Primary LanguageRuby

A scraper to read Hansard dates

Scraper is built using Poltergeist / Capybara / Rspec due to heavy AJAX insertion of content that doesn't appear when scraped with Mechanize

Notes to self: When debugging/running scraper on own ubuntu system, prerequisites are [code] sudo gem install poltergeist [/code] Then go to the PhantomJS website and manually wget the PhantomJS build .tar.bz2 because the Ubuntu package is built against the wrong graphics toolkit and crashes on load.

This is a scraper that runs on Morph. To get started see the documentation