/DYWHouse

房产爬虫:BeautifulSoup、Mysql

Primary LanguagePython

DYWHouse

最近关注大亚湾房产,正好学习python中,就使用BeautifulSoup+Mysql,爬了10W+条的房产登记数据

涉及的url

网页主页

url_host = "http://61.142.120.214:9000/web/salepermit.jsp?"

第一层

herf_host = "http://61.142.120.214:9000/web/"

allherf_content_1 = herf_host + "salepermit.jsp?page={0}"

第二层

allherf_content_2 = herf_host + "realestate_presale.jsp?licenceCode=#&ProjectCode=DYW00121014"

第三层

allherf_content_3 = herf_host + "salestable.jsp?buildingcode=DYW0012101401&projectcode=DYW00121014"

第四层

allherf_content_4 = herf_host + "House.jsp?id={0}&lcStr={1}"

示例图片

示例1

示例1

示例2

示例2

示例3

示例3

示例4

示例4