Issues
- 3
如果微博内容有图片,如何爬取
#25 opened by AaronZhang2015 - 0
Fix simple typo: falese -> false
#66 opened by timgates42 - 5
任务执行完成后为什么始终不退出
#64 opened by brightgems - 2
- 2
不太明白weibo.yaml里面的部分配置,有详细的一对一解释吗?
#62 opened by xiaoleihuang - 0
任务现场保存问题,任务现场保存在tmp里面,重启pc tmp会被清空
#61 opened by tottilin - 3
weibosearch 运行问题
#21 opened by shanshanpt - 1
weibosearch无法运行
#20 opened by shanshanpt - 1
在抓取过程中突然卡住三四个小时,ctrl C不会退出。应该是mq处理出现问题了
#53 opened by tottilin - 0
分布式爬取中,worker的主备mq同步问题
#60 opened by tottilin - 0
看了下,和上一个issues的log是一样的,应该是mq没有保护好的问题把
#59 opened by tottilin - 3
- 2
爬取follow列表的问题
#57 opened by baduni - 4
怎么设置要爬取的用户
#56 opened by baduni - 8
ValueError: No JSON object could be decoded
#55 opened by Yuanyuan199 - 2
- 0
抓取网页出现HTTP ERROR处理问题
#52 opened by tottilin - 0
在parser中获取网页html信息时卡住出不来
#51 opened by tottilin - 0
instances设置为大于core个数时,会出问题,过一段时间就会停止爬取了
#50 opened by tottilin - 0
在CentOS 6中无法运行
#49 opened by fengkaijia - 5
cola近期的发布情况
#43 opened by hitalex - 1
遇到执行weibosearch的时候包不存在包问题
#48 opened by liwei123o0 - 1
windows下coca无法启动分布式程序
#47 opened by rena521 - 2
抓去用户信息时不能判断是否是企业帐号
#40 opened by MythHack - 8
- 1
json.loads(br.response().read())["data"]
#46 opened by MingleiLI - 11
Name or Service not known
#45 opened by jlovedragon - 5
dev版本:no budget left to process
#44 opened by hitalex - 7
dev版本weibo.yaml配置问题
#42 opened by hitalex - 2
- 6
用户信息部分模块失效
#38 opened by MythHack - 5
develop分支不能抓取新增微博
#37 opened - 15
develop分支爬虫无法自行终止
#34 opened by hitalex - 1
如果程序raise exception,cola如何处理?
#36 opened by hitalex - 1
cola对多个登录账号的处理方式
#35 opened by hitalex - 7
develop分支中微博parser出错
#33 opened by hitalex - 2
实现每个用户最大抓取的微博数
#32 opened by hitalex - 4
develop分支中运行stop.py出现错误
#31 opened by hitalex - 3
新浪微博爬虫访问页面模式疑问
#30 opened by hitalex - 4
weibo模块登录失败
#29 opened by hitalex - 11
大量的start to get None
#28 opened by jiajunhuang - 1
weibo 模块抓取总是登陆失败
#27 opened by EdmundZhang - 8
请教一个问题,就是爬取的页面你是如何解析的?
#26 opened by liudonglei - 1
能否在README中叙述一下抓取新浪微博的思路?
#24 opened by DashYang - 1
为何运行contrib/wiki下的__init__.py经常会出现UserWarning
#23 opened by DashYang - 0
关于微博抓取的线程数选择的疑问
#22 opened by huntzhan - 9
非Ctrl+C异常退出后,程序锁死
#19 opened by LiuChaofan - 11
Ctrl+C退出后,无法重新启动
#18 opened by LiuChaofan - 3
develop下stop.py不能运行
#17 opened by LiuChaofan - 5
develop分支FileBloomFilter有问题
#16 opened by ddmbr