/scrape_ecocyc

Get all the gene and summaries from EcoCyc

Primary LanguagePythonMIT LicenseMIT

Scrape EcoCyc

Get all the gene and summaries from EcoCyc.

Requirements:

  • Python 2.7 or 3.3+
  • Scrapy 1.x

Getting started:

Clone this repository, and in the top directory, call on the command line:

pip install -e .
./run ~/my-output-directory

This will run scrapy and generate a new results.json file in my-output-directory.

Example output

If you just want to see the output of this code, here is a results.json file that was generated with it:

https://github.com/zakandrewking/y-ome/blob/master/sources/ecocyc/results.json.gz