v5tech/notes

使用Python抓取迅雷VIP会员

Opened this issue · 3 comments

前奏

pip install lxml

xunlei.py

#coding=utf-8
import urllib2
import lxml.html
import codecs

xunlei = codecs.open("xunlei.txt", "wb", "utf-8")

response = urllib2.urlopen("http://www.vipfenxiang.com/xunlei/")
doc = lxml.html.fromstring(response.read())
link = doc.xpath('//div/article[1]/header/h2/a/@href')

response = urllib2.urlopen(link[0])
content = response.read()
doc = lxml.html.fromstring(content.decode("UTF-8"))
xunlei.write(doc.xpath('//article/p[@data-title]')[0].text_content())
xunlei.close()

结果:

vip分享网迅雷会员账号905519478:2密码9250239
vip分享网迅雷会员账号NoNoupup:2密码9675229
vip分享网迅雷会员账号bwwysysalx:1密码9860279
vip分享网迅雷会员账号gcjforever:1密码9934039
vip分享网迅雷会员账号geonleon:1密码9018439
vip分享网迅雷会员账号wzn0625:2密码9809529
vip分享网迅雷会员账号903156860:1密码9984639
vip分享网迅雷会员账号szy931107:2密码9895529
vip分享网迅雷会员账号813970685:2密码9667389
vip分享网迅雷会员账号146805410:1密码9307809
vip分享网迅雷会员账号894704330:1密码9541829
vip分享网迅雷会员账号reisonc:1密码9064729
vip分享网迅雷会员账号127227731:1密码9114899
vip分享网迅雷会员账号419184398:1密码9568899
vip分享网迅雷会员账号jy00902582:1密码9276159
vip分享网迅雷会员账号117013172:1密码9904909
vip分享网迅雷会员账号notcoder:2密码9870249
vip分享网迅雷会员账号dong18929:2密码9481709
vip分享网迅雷会员账号893487924:1密码9251899
vip分享网迅雷会员账号842752587:1密码9982459
vip分享网迅雷会员账号xiaoxiaoyu0000:1密码9923669
vip分享网迅雷会员账号eveboylee:1密码9602979
vip分享网迅雷会员账号suzhongyue:2密码9042999
vip分享网迅雷会员账号goodboylin:1密码9953339
vip分享网迅雷会员账号sunkepeter:1密码9400599
vip分享网迅雷会员账号zgtzgtzgtzgt:1密码9010759
vip分享网迅雷会员账号113929376:1密码9474219
vip分享网迅雷会员账号898814782:1密码9479119
vip分享网迅雷会员账号395257188:1密码9898899
vip分享网迅雷会员账号tianyong729:1密码9304449
vip分享网迅雷会员账号yantao0721:2密码9401909
vip分享网迅雷会员账号272604661:2密码9725389
vip分享网迅雷会员账号835374392:1密码9210239
vip分享网迅雷会员账号833875698:1密码9917579
vip分享网迅雷会员账号mly023:1密码9449609
vip分享网迅雷会员账号309517578:2密码9066779
vip分享网迅雷会员账号shenmars:2密码9290099
vip分享网迅雷会员账号lanlan85930:1密码9065429
vip分享网迅雷会员账号rainslayer:1密码9174939
vip分享网迅雷会员账号123695743:2密码9941249
vip分享网迅雷会员账号893972032:2密码9592219
vip分享网迅雷会员账号687585690:2密码9547089
vip分享网迅雷会员账号dadele5:2密码9069749
vip分享网迅雷会员账号310899221:1密码9163679
vip分享网迅雷会员账号263715396:1密码9419289
vip分享网迅雷会员账号flybirdf:1密码9356249
vip分享网迅雷会员账号839914126:3密码9022439
vip分享网迅雷会员账号311105338:1密码9812869
vip分享网迅雷会员账号129533708:1密码9808289
vip分享网迅雷会员账号893103998:2密码9065239

🙀 🙀 🙀

这个好

报错:
Traceback (most recent call last):
File "1.py", line 13, in
response = urllib2.urlopen(link[0])
IndexError: list index out of range