/GithubCrawler

Crawl the project information on github to customize the amount of information, such as crawling content, quantity, and sorting methods

Primary LanguagePython

Crawl the project information on github to customize the amount of information, such as crawling content, quantity, and sorting methods

从github上爬取处理的流程大致为:根据定义的量化信息爬取页面、解析html页面取到相关内容信息、进行内容存储。

爬虫项目

爬虫项目

爬虫流程图

爬虫流程图