blaiszik's Stars
NSchrading/intro-spacy-nlp
An introduction to using spaCy for NLP and machine learning
globus/globus-sample-data-portal
A Python (Flask-based) web application implementing the Modern Research Data Portal pattern using the Globus platform.
NaturalNode/natural
general natural language facilities for node
objectfox/PyBot
Resources for Slack Bots in Python
howdyai/botkit
Botkit is an open source developer tool for building chat bots, apps and custom integrations for major messaging platforms.
3Top/word2vec-api
Simple web service providing a word embedding model
Trindaz/EFZP
Parse an email to get properties like salutation, body, signature, reply.
tdhopper/notes-on-dirichlet-processes
:game_die: Notes explaining Dirichlet Processes, HDPs, and Latent Dirichlet Allocation
vladsandulescu/topics
Topic modeling with gensim and LDA
keenlabs/KeenClient-Python
Official Python client for the Keen IO API. Build analytics features directly into your Python apps.
explosion/sense2vec
🦆 Contextually-keyed word vectors
NickleDave/ML-comparison-birdsong
Comparison of machine learning algorithms applied to classification of elements of birdsong
BVLC/caffe
Caffe: a fast open framework for deep learning.
matplotlib/matplotlib
matplotlib: plotting with Python
machinalis/iepy
Information Extraction in Python
jmschrei/pomegranate
Fast, flexible and easy to use probabilistic modelling in Python.
captainsafia/notebook-toc
A Jupyter Notebook extension for generating a table of contents at the top of a notebook
ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
rlabbe/ThinkBayes
Jupyter Notebooks derived from Allen Downey's book Think Bayes.
awentzonline/image-analogies
Generate image analogies using neural matching and blending.
brandomr/document_cluster
A guide to document clustering in Python
nexpy/nexpy
NeXpy provides a high-level python interface to NeXus data contained within a simple GUI. It is designed to provide an intuitive interactive toolbox allowing users both to access existing NeXus files and to create new NeXus-conforming data structures without expert knowledge of the file format.
dat-ecosystem/dat
:floppy_disk: peer-to-peer sharing & live syncronization of files via command line
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
bwbaugh/wikipedia-extractor
This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wikiextractor --- Extracts and cleans text from Wikipedia database dump and stores output in a number of files of similar size in a given directory.
rspeer/python-ftfy
Fixes mojibake and other glitches in Unicode text, after the fact.
jbwhit/jupyter-tips-and-tricks
Using Project Jupyter for data science.
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
jakevdp/wpca
Weighted Principal Component Analysis (PCA) in Python
natesilva/jayschema
[Unmaintained] - A comprehensive JSON Schema validator for Node.js