/pycrawler

powerful python crawler: proxy-ip,mutiprocessing+Queue+yaml configurable crawler, readability, bs4(beautiful soup), pybloom, PooledDB, MysqlDb, selenium-webdriver-phantomjs, reids,anti-geetest, yaml, email

Primary LanguagePython

Stargazers