graus's Stars
mit-ccc/RadioTalk
The RadioTalk dataset of talk radio transcripts
polyval/goodreads-crawler
知乎回答:https://www.zhihu.com/question/19929256/answer/72976430
brendan-w/python-OBD
OBD-II serial module for reading engine data
mjugo/StreamingRec
A news recommendation evaluation framework
textpipe/textpipe
Textpipe: clean and extract metadata from text
blue-yonder/tsfresh
Automatic extraction of relevant features from time series:
clips/pattern
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
niderhoff/nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
gasevi/pyreclab
pyRecLab is a library for quickly testing and prototyping of traditional recommender system methods, such as User KNN, Item KNN and FunkSVD Collaborative Filtering. It is developed and maintained by Gabriel Sepúlveda and Vicente Domínguez, advised by Prof. Denis Parra, all of them in Computer Science Department at PUC Chile, IA Lab and SocVis Lab.
glample/tagger
Named Entity Recognition Tool
fxsjy/jieba
结巴中文分词
lalinsky/mbslave
DEPRECATED: use mbdata instead
Qfusion/qfusion
Source code for cross-platform OpenGL gaming engine
nltk/nltk
NLTK Source
larsmans/seqlearn
Sequence learning toolkit for Python
wimmuskee/readability-score
A Python library to calculate the readability score of a text.
dnouri/nolearn
Combines the ease of use of scikit-learn with the power of Theano/Lasagne
knowitall/reverb
Web-Scale Open Information Extraction
larsmans/weighwords
Python library for creating word clouds from text
piskvorky/gensim
Topic Modelling for Humans
ptwobrussell/Recipes-for-Mining-Twitter
Adaptations and Extensions of Twitter-Related Examples from Mining the Social Web