jcyan's Stars
mikemaccana/python-docx
Reads, queries and modifies Microsoft Word 2007/2008 docx files.
Mendeley/mrec
A recommender systems development and evaluation package by Mendeley
timvieira/arsenal
Arsenal of python utilities.
luispedro/BuildingMachineLearningSystemsWithPython
Source Code for the book Building Machine Learning Systems with Python
getpelican/pelican
Static site generator that supports Markdown and reST syntax. Powered by Python.
bmcmurray/hekyll
a Jekyll generator for Impress.js presentations
jdan/cleaver
30-second slideshows for hackers
paulrouget/dzslides
DZSlides is a one-file HTML template to build slides in HTML5 and CSS3.
nakajima/slidedown
Generate syntax-highlighted slides from Markdown
joemccann/dillinger
The last Markdown editor, ever.
harryaskham/Twitter-L-LDA
A set of tools for performing Labeled Latent Dirichlet Allocation on textual datasets, with an emphasis on Twitter profiles. Contains tools for analysing the results of model training and inference.
shuyo/iir
Machine Learning / Natural Language Processing / Information Retrieval
VowpalWabbit/vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
shravanmn/Yahoo_LDA
Yahoo!'s topic modelling framework using Latent Dirichlet Allocation
d3/d3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
pprett/bolt
Bolt Online Learning Toolbox
scalala/Scalala
Scalala has been superseded by dlwh/breeze. Scalala is a high performance numeric linear algebra library for Scala, with rich Matlab-like operators on vectors and matrices; a library of numerical routines; support for plotting.
Yelp/mrjob
Run MapReduce jobs on Hadoop or Amazon Web Services
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
OmerShapira/Syntactic
Lexical categorization engine for large datasets. Good for NLP and Data Mining.
brendano/ark-tweet-nlp
CMU ARK Twitter Part-of-Speech Tagger
ogrisel/pignlproc
Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
aritter/twitter_nlp
Twitter NLP Tools
Shopify/liquid
Liquid markup language. Safe, customer facing template language for flexible web apps.
python-mode/python-mode
Vim python-mode. PyLint, Rope, Pydoc, breakpoints from box.
dbamman/latex
LaTex examples
h2oai/h2o-2
Please visit https://github.com/h2oai/h2o-3 for latest H2O
kachayev/fn.py
Functional programming in Python: implementation of missing features to enjoy FP
jcyan/jcyan.github.com
personal website
douban/dpark
Python clone of Spark, a MapReduce alike framework in Python