
Parser for open government data

Primary LanguagePython

Please note that this parser is outdated as it doesn't support new design of rnp.fas.gov.ru. Parsed data available at scraped_items.csv Parsed at 26.08.2010 (now it's changed)

The program works in scrapy framework (scrapy.org), version 0.8 and 0.9. In order to run parser, type ./scrapy-ctl.py runspider fas/spiders/rnp.py

This program itself is cross-platform, as it has been tested by developer with following settings:

  • Linux Ubuntu 10.04 (Lucid Lynx) + Python 2.6, both Scrapy 0.8 and 0.9, Twisted 10.0.0
  • Windows Vista + Python 2.6, both Scrapy 0.8 and 0.9, Twisted 10.1.0

Note that it doesn't claim to work with scrapy 0.10!

Windows users should follow these instructions:

Note that MSI are explicitly stated just for convenience. All the setup takes about 5 minutes.

Parsing takes about 40 min, so please be patient.

Author: Victor Mireyev License: Creative Commons Attribution 3.0 Unported License