/Crawler

life website crawler

Primary LanguagePython

crawler_Life

life website crawler

scrapy life websit (http://love.heima.com/)

first commit is a raw and the simplest pitch

after then , some work will add into it for faster and more efficient

step one : using mutlithread or twist step two : using scrapy framework step three : support mutlistorages, like mongodb, mysql.

requirement: sqlite3, sqlalchemy, beautifulsoup