scrapi

Getting started

You will need to:
- Install requirements.
- Install Elasticsearch
- Install consumers
- Install rabbitmq

Create and enter virtual environment for scrapi, and go to the top level project directory. From there, run

$ pip install -r requirements.txt

and the python requirements for the project will download and install.

note: JDK 7 must be installed for elasticsearch to run

$ brew install elasticsearch

Now, just run

$ elasticsearch

$ invoke elasticsearch

and you should be good to go.

$ wget https://download.elasticsearch.org/elasticsearch/elasticsearch/elasticsearch-1.2.1.deb 
$ sudo dpkg -i elasticsearch-1.2.1.deb

Now, just run

$ sudo service elasticsearch start

$ invoke elasticsearch

and you should be good to go.

$ python main.py

from the scrapi/website/ directory, and the server should be up and running!

$ invoke install_consumers

and the consumers specified in the manifest files of the worker_manager, and their requirements, will be installed.

$ brew install rabbitmq

$ sudo apt-get install rabbitmq-server

$ invoke celery_beat

to start the scheduler, and

$ invoke celery_worker

to start the worker.

$ invoke test

and all of the tests in the 'tests/' directory will be run.