/SkyHub-Crawler-Python

Website Crawler written in Python using Scrapy

Primary LanguagePython

Installation steps for SkyHub Crawler in Python

install Miniconda

  1. https://conda.io/miniconda.html
  2. wget https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh
  3. bash Miniconda3-latest-Linux-x86_64.sh

In case you encounter problems with conda:

  1. Restart terminal
  2. rm -r miniconda3
  3. bash Miniconda3-latest-Linux-x86_64.sh

and follow the instructions

install other libraries

conda create -n crawler3
source activate crawler3
pip install scrapy
pip install dateparser
pip install parsel
pip install requests
pip install ujson
pip install urllib3
pip install fuzzywuzzy

Optionals

    pip install beautifulsoup4

Running

source activate crawler3 python3 main.py