crawlab-team/crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
GoBSD-3-Clause
Issues
- 1
- 2
SDK支持条用统计接口么?目前数据源有bug,我应该如何在爬虫完成时自己更新爬虫的总结果数
#1474 opened by TDxhfeng - 0
gitlab可以拉取代码,但是在本地开发完成提交后,平台重新点击拉去显示拉取成功,但进入文件并没有更新代码
#1476 opened by TDxhfeng - 0
- 8
当数据源选择MySQL时,存储失败
#1457 opened by suntanic - 2
- 0
创建python 虚拟环境
#1473 opened by BabyBoy-Yuan - 0
社区版本V0.6.0的节点名称能否通过环境变量指定呢?谢谢
#1472 opened by zhouwenjun0820 - 0
社区版v0.6.0爬虫任务数据导出功能不可用
#1470 opened by zhouwenjun0820 - 0
社区版v0.6.0爬虫任务数据导出功能不可用
#1471 opened by zhouwenjun0820 - 0
- 0
Standalone Runtime
#1468 opened by tikazyq - 0
Crawlab File System Optimization
#1458 opened by tikazyq - 0
docker 启动的时候报错
#1467 opened by luchatex - 3
v0.6.3 docker 增加worker一直报错[已解决]
#1464 opened by luchatex - 2
v0.6.3 docker 任务状态不显示[已解决]
#1463 opened by luchatex - 1
通过api获取任务日志,返回结果为None
#1466 opened by wz-farming - 0
v0.6.3 python安装库 worker节点,状态运行中,始终无法完成
#1465 opened by luchatex - 0
- 0
Local files getting deleted after git pull
#1461 opened by codewithraga - 0
Crawlab AI Assistant
#1436 opened by tikazyq - 1
Data Sources
#1445 opened by Vr1llon - 6
crawlab 重启之后,重启前 pending 的任务无法继续执行(After crawlab restart, tasks pending before restart cannot continue)
#1446 opened by ma-pony - 0
随机节点偶发调度到多slave上
#1455 opened by Shinku-Chen - 0
重置密码未在文档中看到
#1454 opened by riluo - 3
crawlab-server 占用内存过高问题
#1437 opened by mayuanyuan199625 - 3
Git同步似乎和SeaweedFS有冲突
#1453 opened by jasonz1360 - 6
SeaweedFS定期出现8888端口无法连接,爬虫的文件也丢失。
#1451 opened by jasonz1360 - 1
使用elastic数据源,存储数据失败
#1427 opened by aibots-team - 3
单节点安装以后总是显示requests依赖没有安装,导致爬虫运行失败
#1426 opened by perpetually - 0
点击取消后,节点任务并未实际结束的问题
#1441 opened by Lxingy - 0
希望增加爬虫任务失败重试执行次数设置
#1442 opened by 534146825 - 0
希望未来可以开放通知的api接口
#1443 opened by 534146825 - 1
container crash (Panic)
#1444 opened by saleh-hom - 1
Install crawlab with spesific python version
#1434 opened by kurniawanlucky - 2
不同爬虫项目之间的Python环境进行隔离
#1421 opened by ma-pony - 1
The background page can execute any command
#1440 opened by aomanbuaoman - 2
Installing Software Dependencies
#1433 opened by username-mike - 0
git 拉取远程代码失败却显示成功
#1432 opened by coder-gao - 2
Unable to launch Playwright
#1430 opened by username-mike - 0
环境依赖中,包实际已经安装完成,仍显示任务进行中
#1428 opened by zzyy951 - 2
github代码第一次拉取成功,之后无法拉取
#1425 opened by wesleywgu - 1
- 2
crawlab依赖管理界面 安装指定版本的依赖包
#1423 opened by whoishuhu - 0
爬虫删除,未级联删除相关数据,数据库冗余
#1416 opened by glacierck - 1
"增量同步文件" 开关的文档说明
#1420 opened by glacierck - 1
git获取文件自动请求web_logs其他网站?
#1418 opened by jdzgd - 0
是否考虑支持Git子项目管理功能?
#1417 opened by m220745 - 0
创建项目级数据库,项目下爬虫共享该库。
#1414 opened by glacierck - 2
pip 搜索trafilatura包,找不到
#1412 opened by glacierck