/house

Use scrapy to crawl Lianjia house price data

Primary LanguagePython

A spider to crawl lianjia house price

Use scrapy as scraping framework

  • Including Beijing, Shanghai, Guangzhou and Shenzhen
  • Incremental crawl
  • Running On multiple machines at the same time
  • Randomly switch agents & proxies avoid banned
  • Data stored in Hbase

bj unit gz range