carrotshub's Stars
JerryZhong/LDA
Modify the GibbsLDA++ and use perplexity to evaluate the topic model.
goto456/stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
HIT-SCIR/ltp
Language Technology Platform
TheAlgorithms/Python
All Algorithms implemented in Python
luyishisi/Anti-Anti-Spider
越来越多的网站具有反爬虫特性,有的用图片隐藏关键数据,有的使用反人类的验证码,建立反反爬虫的代码仓库,通过与不同特性的网站做斗争(无恶意)提高技术。(欢迎提交难以采集的网站)(因工作原因,项目暂停)
carrotshub/Chinese-sentiment-analysis-with-Doc2Vec
using jieba and doc2vec to implement sentiment analysis for Chinese docs
lybroman/Chinese-sentiment-analysis-with-Doc2Vec
using jieba and doc2vec to implement sentiment analysis for Chinese docs
carrotshub/ivideo
一个可以观看国内主流视频平台所有视频的客户端(Mac、Windows、Linux) A client that can watch video of domestic(China) mainstream video platform
phobal/ivideo
一个可以观看国内主流视频平台所有视频的客户端(Mac、Windows、Linux) A client that can watch video of domestic(China) mainstream video platform
carrotshub/AHP
层次化分析,python实现
Chalarangelo/30-seconds-of-python
Short Python code snippets for all your development needs
liangliangyy/DjangoBlog
🍺基于Django的博客系统
tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
DigAPieceOfDay/DataAnalysisCommonProblem
常见数据分析面试题
xiaoyichao/-python-gensim-LDA-
基于python gensim 库的LDA算法 对中文进行文本分析,很难得,网上都是英文的,基本上没有中文的,需要安装jieba分词进行分词,然后去除停用词最后才能使用LDA
Zephery/weiboflask
微博情感分析,使用flask制作restful api,毕业设计衍生项目
SpiderClub/smart_login
各大网站登陆方式,有的是通过selenium登录,有的是通过抓包直接模拟登录(精力原因,目前不再继续维护)
ageron/handson-ml
⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.
liangweiSam/Sproject
zhihu(scrapy)
xchaoinfo/fuck-login
模拟登录一些知名的网站,为了方便爬取需要登录的网站
carrotshub/lda
Topic modeling with latent Dirichlet allocation using Gibbs sampling
chaoming0625/SentimentPolarityAnalysis
情感极性分析repository1:基于情感词典、k-NN、Bayes、最大熵、SVM的情感极性分析。
sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
lda-project/lda
Topic modeling with latent Dirichlet allocation using Gibbs sampling
harvardnlp/sent-conv-torch
Text classification using a convolutional neural network.
memect/hao
好东西传送门
ZixuanKe/Ch2r_ood_understanding
SpiderClub/weibospider
:zap: A distributed crawler for weibo, building with celery and requests.
majinliang123/zhihu-crawler
a crawler for zhihu,如果无法运行,请回滚到版本da2f0d9e5488ec23bb91a2b6b24bcfff29d0304d