Zhihu-Spider: A JavaScript repository from starkwang

#知乎关系网爬虫

#使用方法 1、初始化

git clone https://github.com/starkwang/Spider.git && cd Spider

npm run init

2、配置

参考server.config.example.js与spider.config.example.js，配置你自己的server.config.js与spider.config.js

3、构建并开始

npm run build

npm run start // Server runs at localhost:3000

#配置 1、spider.config.js

由于知乎的API较不稳定，concurrency并发数太大可能会造成卡死，在网络环境不好时建议设置为2或者1

2、server.config.js

###附：cookie与_xsrf配置方法

打开知乎任意用户的关注者页，例如https://www.zhihu.com/people/starkwei/followers

打开浏览器控制台，选择Network：

下拉页面，会自动加载更多关注者，可以看到对/node/ProfileFollowersListV2这个接口发起了多次请求：打开请求详情，Cookie和_xsrf就在里面：

#已知的BUG或者缺陷

starkwang/Zhihu-Spider