收藏夹无法抓取
FollowHeart007 opened this issue · 4 comments
2.4.0版本,windows系统,无论是自己的还是他人的,尝试抓取公开收藏夹失败。抓取用户回答,想法,文章功能正常。
同样的问题,不论是自己的还是别人的,私密的还是公开的都无法抓取。win10x64
2022-01-22 22:24:22.006: 开始工作
2022-01-22 22:24:22.007: 重新载入cookie配置
2022-01-22 22:24:22.008: 开始执行任务
2022-01-22 22:24:22.008: [DispatchCommand] command start
2022-01-22 22:24:22.009: [DispatchCommand] 检查更新
2022-01-22 22:24:22.009: [InitEnv] command start
2022-01-22 22:24:22.009: [InitEnv] 检查更新
2022-01-22 22:24:22.664: [InitEnv] 初始化文件夹
2022-01-22 22:24:22.666: [InitEnv] 文件夹初始化完毕
2022-01-22 22:24:22.667: [InitEnv] 初始化数据库
2022-01-22 22:24:22.678: [InitEnv] 数据库初始化完毕
2022-01-22 22:24:22.679: [InitEnv] command finish
2022-01-22 22:24:22.679: [DispatchCommand] 创建任务实例
2022-01-22 22:24:22.679: [DispatchCommand] 执行抓取命令
2022-01-22 22:24:22.680: [FetchCustomer] command start
2022-01-22 22:24:22.680: [FetchCustomer] 从C:\Program Files\zhihuhelp\resources\app\customer_task_config.json中读取配置文件
2022-01-22 22:24:22.680: [FetchCustomer] content =>{
"configList": [
{
"type": "collection",
"id": "60569287",
"rawInputText": "https://www.zhihu.com/collection/60569287",
"comment": "黑魔法",
"skipFetch": false,
"defaultTitle": "收藏夹_60569287(60569287)回答合集",
"lastId": "60569287"
}
],
"imageQuilty": "raw",
"bookTitle": "黑魔法",
"maxQuestionOrArticleInBook": 10000,
"orderByList": [
{
"orderBy": "createAt",
"order": "asc"
}
],
"comment": "",
"skipFetch": false
}
2022-01-22 22:24:22.682: [FetchCustomer] 开始进行自定义抓取, 共有1个任务
2022-01-22 22:24:22.682: [FetchCustomer] 合并抓取任务
2022-01-22 22:24:22.682: [FetchCustomer] 抓取任务合并完毕, 最终结果为=>{"collection":["60569287"]}
2022-01-22 22:24:22.683: [FetchCustomer] 开始派发自定义任务=>
2022-01-22 22:24:22.683: [BatchFetchCollection] 启动第1/1个抓取任务(60569287)
2022-01-22 22:24:22.683: [BatchFetchCollection] 开始抓取收藏夹60569287内的回答
2022-01-22 22:24:22.684: [BatchFetchCollection] 获取收藏夹信息
2022-01-22 22:24:23.104: [BatchFetchCollection] 话题undefined(undefined)信息获取完毕, 共有回答undefined个
2022-01-22 22:24:23.105: [BatchFetchCollection] 开始抓取回答列表
2022-01-22 22:24:23.105: 任务队列已满, 开始执行任务, 共1个任务待执行
2022-01-22 22:24:23.105: 队列已满, 休眠1s, 保护知乎服务器
2022-01-22 22:24:24.106: 任务队列内所有任务执行完毕
2022-01-22 22:24:24.107: [BatchFetchCollection] 全部回答列表抓取完毕
2022-01-22 22:24:24.107: [BatchFetchCollection] 开始抓取收藏夹undefined(undefined)的下所有回答,共0条
2022-01-22 22:24:24.107: [BatchFetchAnswer] 派发所有待抓取任务
2022-01-22 22:24:24.108: 任务队列已满, 开始执行任务, 共1个任务待执行
2022-01-22 22:24:24.108: 队列已满, 休眠1s, 保护知乎服务器
2022-01-22 22:24:25.114: 任务队列内所有任务执行完毕
2022-01-22 22:24:25.115: [BatchFetchAnswer] 所有抓取任务执行完毕
2022-01-22 22:24:25.115: [BatchFetchCollection] 收藏夹undefined(undefined)下所有回答抓取完毕
2022-01-22 22:24:25.115: [BatchFetchCollection] 第1/1个任务(60569287)执行完毕
2022-01-22 22:24:25.116: [BatchFetchCollection] 派发所有待抓取任务
2022-01-22 22:24:25.116: 任务队列已满, 开始执行任务, 共1个任务待执行
2022-01-22 22:24:25.116: 队列已满, 休眠1s, 保护知乎服务器
2022-01-22 22:24:26.122: 任务队列内所有任务执行完毕
2022-01-22 22:24:26.122: [BatchFetchCollection] 所有抓取任务执行完毕
2022-01-22 22:24:26.123: [FetchCustomer] 自定义任务抓取完毕
2022-01-22 22:24:26.124: [FetchCustomer] command finish
2022-01-22 22:24:26.124: [DispatchCommand] 抓取命令执行完毕
2022-01-22 22:24:26.124: [DispatchCommand] 执行生成电子书命令
2022-01-22 22:24:26.124: [GenerateCustomer] command start
2022-01-22 22:24:26.125: [GenerateCustomer] 从C:\Program Files\zhihuhelp\resources\app\customer_task_config.json中读取配置文件
2022-01-22 22:24:26.125: [GenerateCustomer] content =>{
"configList": [
{
"type": "collection",
"id": "60569287",
"rawInputText": "https://www.zhihu.com/collection/60569287",
"comment": "黑魔法",
"skipFetch": false,
"defaultTitle": "收藏夹_60569287(60569287)回答合集",
"lastId": "60569287"
}
],
"imageQuilty": "raw",
"bookTitle": "黑魔法",
"maxQuestionOrArticleInBook": 10000,
"orderByList": [
{
"orderBy": "createAt",
"order": "asc"
}
],
"comment": "",
"skipFetch": false
}
2022-01-22 22:24:26.126: [GenerateCustomer] 开始输出自定义电子书, 共有1个任务
2022-01-22 22:24:26.126: [GenerateCustomer] 将任务中的数据按照问题/文章/想法进行汇总
2022-01-22 22:24:26.126: [GenerateCustomer] 处理第1/1个任务, 任务类型:collection, 任务备注:黑魔法
2022-01-22 22:24:26.127: [GenerateCustomer] 获取收藏夹60569287下所有回答id
2022-01-22 22:24:26.129: [GenerateCustomer] 收藏夹60569287下回答id列表获取完毕
2022-01-22 22:24:26.129: [GenerateCustomer] 获取收藏夹60569287下回答列表
2022-01-22 22:24:26.130: [GenerateCustomer] 收藏夹60569287下回答列表获取完毕
2022-01-22 22:24:26.131: [GenerateCustomer] 所有数据获取完毕, 最终结果为=>
2022-01-22 22:24:26.131: [GenerateCustomer] 问题 => 0个
2022-01-22 22:24:26.131: [GenerateCustomer] 文章 => 0篇
2022-01-22 22:24:26.132: [GenerateCustomer] 想法 => 0条
2022-01-22 22:24:26.132: [GenerateCustomer] 按配置排序
2022-01-22 22:24:26.132: [GenerateCustomer] command finish
2022-01-22 22:24:26.133: [DispatchCommand] 生成电子书命令执行完毕
2022-01-22 22:24:26.133: [DispatchCommand] 所有命令执行完毕
2022-01-22 22:24:26.133: [DispatchCommand] command finish
2022-01-22 22:24:26.134: 所有任务执行完毕, 打开电子书文件夹 => C:\Program Files\zhihuhelp\resources\app\知乎助手输出的电子书
一模一样的问题。2.4.0. Windows10x64。他人公开收藏夹内/自己私密收藏夹内“回答”均无法抓取。其他抓取正常。
2022-02-09 19:00:37.111: 开始工作
2022-02-09 19:00:37.113: 重新载入cookie配置
2022-02-09 19:00:37.119: 开始执行任务
2022-02-09 19:00:37.120: [DispatchCommand] command start
2022-02-09 19:00:37.121: [DispatchCommand] 检查更新
2022-02-09 19:00:37.121: [InitEnv] command start
2022-02-09 19:00:37.122: [InitEnv] 检查更新
2022-02-09 19:00:38.191: [InitEnv] 初始化文件夹
2022-02-09 19:00:38.195: [InitEnv] 文件夹初始化完毕
2022-02-09 19:00:38.196: [InitEnv] 初始化数据库
2022-02-09 19:00:38.212: [InitEnv] 数据库初始化完毕
2022-02-09 19:00:38.213: [InitEnv] command finish
2022-02-09 19:00:38.213: [DispatchCommand] 创建任务实例
2022-02-09 19:00:38.214: [DispatchCommand] 执行抓取命令
2022-02-09 19:00:38.214: [FetchCustomer] command start
2022-02-09 19:00:38.215: [FetchCustomer] 从C:\Users\zxcha\AppData\Local\Programs\zhihuhelp\resources\app\customer_task_config.json中读取配置文件
2022-02-09 19:00:38.216: [FetchCustomer] content =>{
"configList": [
{
"type": "collection",
"id": "369876193",
"rawInputText": "https://www.zhihu.com/collection/369876193",
"comment": "",
"skipFetch": false,
"defaultTitle": "收藏夹_369876193(369876193)回答合集",
"lastId": "369876193"
}
],
"imageQuilty": "hd",
"bookTitle": "收藏夹_369876193(369876193)回答合集",
"maxQuestionOrArticleInBook": 500,
"orderByList": [
{
"orderBy": "createAt",
"order": "asc"
}
],
"comment": "",
"skipFetch": false
}
2022-02-09 19:00:38.216: [FetchCustomer] 开始进行自定义抓取, 共有1个任务
2022-02-09 19:00:38.217: [FetchCustomer] 合并抓取任务
2022-02-09 19:00:38.217: [FetchCustomer] 抓取任务合并完毕, 最终结果为=>{"collection":["369876193"]}
2022-02-09 19:00:38.218: [FetchCustomer] 开始派发自定义任务=>
2022-02-09 19:00:38.218: [BatchFetchCollection] 启动第1/1个抓取任务(369876193)
2022-02-09 19:00:38.219: [BatchFetchCollection] 开始抓取收藏夹369876193内的回答
2022-02-09 19:00:38.219: [BatchFetchCollection] 获取收藏夹信息
2022-02-09 19:00:38.641: [BatchFetchCollection] 话题undefined(undefined)信息获取完毕, 共有回答undefined个
2022-02-09 19:00:38.641: [BatchFetchCollection] 开始抓取回答列表
2022-02-09 19:00:38.642: 任务队列已满, 开始执行任务, 共1个任务待执行
2022-02-09 19:00:38.642: 队列已满, 休眠1s, 保护知乎服务器
2022-02-09 19:00:39.646: 任务队列内所有任务执行完毕
2022-02-09 19:00:39.646: [BatchFetchCollection] 全部回答列表抓取完毕
2022-02-09 19:00:39.647: [BatchFetchCollection] 开始抓取收藏夹undefined(undefined)的下所有回答,共0条
2022-02-09 19:00:39.647: [BatchFetchAnswer] 派发所有待抓取任务
2022-02-09 19:00:39.648: 任务队列已满, 开始执行任务, 共1个任务待执行
2022-02-09 19:00:39.648: 队列已满, 休眠1s, 保护知乎服务器
2022-02-09 19:00:40.649: 任务队列内所有任务执行完毕
2022-02-09 19:00:40.650: [BatchFetchAnswer] 所有抓取任务执行完毕
2022-02-09 19:00:40.650: [BatchFetchCollection] 收藏夹undefined(undefined)下所有回答抓取完毕
2022-02-09 19:00:40.651: [BatchFetchCollection] 第1/1个任务(369876193)执行完毕
2022-02-09 19:00:40.651: [BatchFetchCollection] 派发所有待抓取任务
2022-02-09 19:00:40.651: 任务队列已满, 开始执行任务, 共1个任务待执行
2022-02-09 19:00:40.652: 队列已满, 休眠1s, 保护知乎服务器
2022-02-09 19:00:41.654: 任务队列内所有任务执行完毕
2022-02-09 19:00:41.655: [BatchFetchCollection] 所有抓取任务执行完毕
2022-02-09 19:00:41.655: [FetchCustomer] 自定义任务抓取完毕
2022-02-09 19:00:41.656: [FetchCustomer] command finish
2022-02-09 19:00:41.656: [DispatchCommand] 抓取命令执行完毕
2022-02-09 19:00:41.657: [DispatchCommand] 执行生成电子书命令
2022-02-09 19:00:41.657: [GenerateCustomer] command start
2022-02-09 19:00:41.658: [GenerateCustomer] 从C:\Users\zxcha\AppData\Local\Programs\zhihuhelp\resources\app\customer_task_config.json中读取配置文件
2022-02-09 19:00:41.658: [GenerateCustomer] content =>{
"configList": [
{
"type": "collection",
"id": "369876193",
"rawInputText": "https://www.zhihu.com/collection/369876193",
"comment": "",
"skipFetch": false,
"defaultTitle": "收藏夹_369876193(369876193)回答合集",
"lastId": "369876193"
}
],
"imageQuilty": "hd",
"bookTitle": "收藏夹_369876193(369876193)回答合集",
"maxQuestionOrArticleInBook": 500,
"orderByList": [
{
"orderBy": "createAt",
"order": "asc"
}
],
"comment": "",
"skipFetch": false
}
2022-02-09 19:00:41.659: [GenerateCustomer] 开始输出自定义电子书, 共有1个任务
2022-02-09 19:00:41.660: [GenerateCustomer] 将任务中的数据按照问题/文章/想法进行汇总
2022-02-09 19:00:41.660: [GenerateCustomer] 处理第1/1个任务, 任务类型:collection, 任务备注:
2022-02-09 19:00:41.660: [GenerateCustomer] 获取收藏夹369876193下所有回答id
2022-02-09 19:00:41.664: [GenerateCustomer] 收藏夹369876193下回答id列表获取完毕
2022-02-09 19:00:41.665: [GenerateCustomer] 获取收藏夹369876193下回答列表
2022-02-09 19:00:41.666: [GenerateCustomer] 收藏夹369876193下回答列表获取完毕
2022-02-09 19:00:41.667: [GenerateCustomer] 所有数据获取完毕, 最终结果为=>
2022-02-09 19:00:41.667: [GenerateCustomer] 问题 => 0个
2022-02-09 19:00:41.667: [GenerateCustomer] 文章 => 0篇
2022-02-09 19:00:41.668: [GenerateCustomer] 想法 => 0条
2022-02-09 19:00:41.668: [GenerateCustomer] 按配置排序
2022-02-09 19:00:41.668: [GenerateCustomer] command finish
2022-02-09 19:00:41.669: [DispatchCommand] 生成电子书命令执行完毕
2022-02-09 19:00:41.669: [DispatchCommand] 所有命令执行完毕
2022-02-09 19:00:41.670: [DispatchCommand] command finish
2022-02-09 19:00:41.670: 所有任务执行完毕, 打开电子书文件夹 => C:\Users\zxcha\AppData\Local\Programs\zhihuhelp\resources\app\知乎助手输出的电子书
收藏夹内回答无法抓取 v2.4.0 系统为Win8.1x64
2022-02-21 20:11:05.935: 重新载入cookie配置
2022-02-21 20:11:10.866: 开始工作
2022-02-21 20:11:10.868: 重新载入cookie配置
2022-02-21 20:11:10.869: 开始执行任务
2022-02-21 20:11:10.869: [DispatchCommand] command start
2022-02-21 20:11:10.869: [DispatchCommand] 检查更新
2022-02-21 20:11:10.870: [InitEnv] command start
2022-02-21 20:11:10.871: [InitEnv] 检查更新
2022-02-21 20:11:12.089: [InitEnv] 初始化文件夹
2022-02-21 20:11:12.099: [InitEnv] 文件夹初始化完毕
2022-02-21 20:11:12.100: [InitEnv] 初始化数据库
2022-02-21 20:11:12.137: [InitEnv] 数据库初始化完毕
2022-02-21 20:11:12.138: [InitEnv] command finish
2022-02-21 20:11:12.139: [DispatchCommand] 创建任务实例
2022-02-21 20:11:12.140: [DispatchCommand] 执行抓取命令
2022-02-21 20:11:12.140: [FetchCustomer] command start
2022-02-21 20:11:12.141: [FetchCustomer] 从D:\Program Files\zhihuhelp\resources\app\customer_task_config.json中读取配置文件
2022-02-21 20:11:12.142: [FetchCustomer] content =>{
"configList": [
{
"type": "collection",
"id": "210843329",
"rawInputText": "https://www.zhihu.com/collection/210843329",
"comment": "",
"skipFetch": false,
"defaultTitle": "收藏夹_210843329(210843329)回答合集",
"lastId": "210843329"
}
],
"imageQuilty": "raw",
"bookTitle": "收藏夹_210843329(210843329)回答合集",
"maxQuestionOrArticleInBook": 100,
"orderByList": [
{
"orderBy": "createAt",
"order": "desc"
}
],
"comment": "",
"skipFetch": false
}
2022-02-21 20:11:12.144: [FetchCustomer] 开始进行自定义抓取, 共有1个任务
2022-02-21 20:11:12.145: [FetchCustomer] 合并抓取任务
2022-02-21 20:11:12.146: [FetchCustomer] 抓取任务合并完毕, 最终结果为=>{"collection":["210843329"]}
2022-02-21 20:11:12.146: [FetchCustomer] 开始派发自定义任务=>
2022-02-21 20:11:12.147: [BatchFetchCollection] 启动第1/1个抓取任务(210843329)
2022-02-21 20:11:12.148: [BatchFetchCollection] 开始抓取收藏夹210843329内的回答
2022-02-21 20:11:12.148: [BatchFetchCollection] 获取收藏夹信息
2022-02-21 20:11:12.430: [BatchFetchCollection] 话题undefined(undefined)信息获取完毕, 共有回答undefined个
2022-02-21 20:11:12.432: [BatchFetchCollection] 开始抓取回答列表
2022-02-21 20:11:12.433: 任务队列已满, 开始执行任务, 共1个任务待执行
2022-02-21 20:11:12.434: 队列已满, 休眠1s, 保护知乎服务器
2022-02-21 20:11:13.436: 任务队列内所有任务执行完毕
2022-02-21 20:11:13.437: [BatchFetchCollection] 全部回答列表抓取完毕
2022-02-21 20:11:13.438: [BatchFetchCollection] 开始抓取收藏夹undefined(undefined)的下所有回答,共0条
2022-02-21 20:11:13.440: [BatchFetchAnswer] 派发所有待抓取任务
2022-02-21 20:11:13.441: 任务队列已满, 开始执行任务, 共1个任务待执行
2022-02-21 20:11:13.442: 队列已满, 休眠1s, 保护知乎服务器
2022-02-21 20:11:14.444: 任务队列内所有任务执行完毕
2022-02-21 20:11:14.446: [BatchFetchAnswer] 所有抓取任务执行完毕
2022-02-21 20:11:14.447: [BatchFetchCollection] 收藏夹undefined(undefined)下所有回答抓取完毕
2022-02-21 20:11:14.448: [BatchFetchCollection] 第1/1个任务(210843329)执行完毕
2022-02-21 20:11:14.449: [BatchFetchCollection] 派发所有待抓取任务
2022-02-21 20:11:14.450: 任务队列已满, 开始执行任务, 共1个任务待执行
2022-02-21 20:11:14.451: 队列已满, 休眠1s, 保护知乎服务器
2022-02-21 20:11:15.454: 任务队列内所有任务执行完毕
2022-02-21 20:11:15.456: [BatchFetchCollection] 所有抓取任务执行完毕
2022-02-21 20:11:15.457: [FetchCustomer] 自定义任务抓取完毕
2022-02-21 20:11:15.458: [FetchCustomer] command finish
2022-02-21 20:11:15.459: [DispatchCommand] 抓取命令执行完毕
2022-02-21 20:11:15.460: [DispatchCommand] 执行生成电子书命令
2022-02-21 20:11:15.461: [GenerateCustomer] command start
2022-02-21 20:11:15.462: [GenerateCustomer] 从D:\Program Files\zhihuhelp\resources\app\customer_task_config.json中读取配置文件
2022-02-21 20:11:15.464: [GenerateCustomer] content =>{
"configList": [
{
"type": "collection",
"id": "210843329",
"rawInputText": "https://www.zhihu.com/collection/210843329",
"comment": "",
"skipFetch": false,
"defaultTitle": "收藏夹_210843329(210843329)回答合集",
"lastId": "210843329"
}
],
"imageQuilty": "raw",
"bookTitle": "收藏夹_210843329(210843329)回答合集",
"maxQuestionOrArticleInBook": 100,
"orderByList": [
{
"orderBy": "createAt",
"order": "desc"
}
],
"comment": "",
"skipFetch": false
}
2022-02-21 20:11:15.467: [GenerateCustomer] 开始输出自定义电子书, 共有1个任务
2022-02-21 20:11:15.468: [GenerateCustomer] 将任务中的数据按照问题/文章/想法进行汇总
2022-02-21 20:11:15.469: [GenerateCustomer] 处理第1/1个任务, 任务类型:collection, 任务备注:
2022-02-21 20:11:15.471: [GenerateCustomer] 获取收藏夹210843329下所有回答id
2022-02-21 20:11:15.477: [GenerateCustomer] 收藏夹210843329下回答id列表获取完毕
2022-02-21 20:11:15.478: [GenerateCustomer] 获取收藏夹210843329下回答列表
2022-02-21 20:11:15.480: [GenerateCustomer] 收藏夹210843329下回答列表获取完毕
2022-02-21 20:11:15.481: [GenerateCustomer] 所有数据获取完毕, 最终结果为=>
2022-02-21 20:11:15.482: [GenerateCustomer] 问题 => 0个
2022-02-21 20:11:15.483: [GenerateCustomer] 文章 => 0篇
2022-02-21 20:11:15.483: [GenerateCustomer] 想法 => 0条
2022-02-21 20:11:15.484: [GenerateCustomer] 按配置排序
2022-02-21 20:11:15.484: [GenerateCustomer] command finish
2022-02-21 20:11:15.485: [DispatchCommand] 生成电子书命令执行完毕
2022-02-21 20:11:15.486: [DispatchCommand] 所有命令执行完毕
2022-02-21 20:11:15.486: [DispatchCommand] command finish
2022-02-21 20:11:15.487: 所有任务执行完毕, 打开电子书文件夹 => D:\Program Files\zhihuhelp\resources\app\知乎助手输出的电子书
遇到一样的问题,但作者好像不维护了:(