Pinned Repositories
bigdata
Introduction to Big Data
data-science-ipython-notebooks
Continually updated Data Science Python Notebooks: Spark, Hadoop MapReduce, HDFS, AWS, Kaggle, scikit-learn, matplotlib, pandas, NumPy, SciPy, and various command lines.
dedupe
A python library for accurate and scaleable data deduplication and entity-resolution.
DeepLearningMovies
Kaggle's competition for using Google's word2vec package for sentiment analysis
h2o
h2o = fast statistical, machine learning & math runtime for bigdata
langid.py
Stand-alone language identification system
mincemeatpy
Lightweight MapReduce in python
python-LDA
lda模型的python实现
nkhuyu's Repositories
nkhuyu/arrow-1
Better dates & times for Python
nkhuyu/cookiecutter
A command-line utility that creates projects from cookiecutters (project templates). E.g. Python package projects, jQuery plugin projects.
nkhuyu/demo-image-recognizer
nkhuyu/demo-lead-scoring
Lead scoring with ScienceOps Batch.
nkhuyu/falcon
Falcon is a low-level, high-performance Python framework for building HTTP APIs, app backends, and higher-level frameworks.
nkhuyu/fastText
Library for fast text representation and classification.
nkhuyu/google-analytics
A command-line interface and Python API wrapper for Google Analytics.
nkhuyu/jpmml-xgboost
Java library and command-line application for converting XGBoost models to PMML
nkhuyu/kaggle-talkingdata-visualization
Source code for blog post: Interactive Data Visualization of Geospatial Data using D3.js, DC.js, Leaflet.js and Python
nkhuyu/LightGBM
LightGBM is a fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
nkhuyu/models
Models built with TensorFlow
nkhuyu/moment
Parse, validate, manipulate, and display dates in javascript.
nkhuyu/moodle
Moodle - the world's open source learning platform
nkhuyu/mozaik
Mozaïk is a tool based on nodejs / react / d3 / stylus to easily craft beautiful dashboards.
nkhuyu/nameko
Python framework for building microservices
nkhuyu/ng2-admin
Angular 2 admin dashboard framework
nkhuyu/og-aws
📙 Amazon Web Services — a practical guide
nkhuyu/pachyderm
Containerized Data Analytics
nkhuyu/Paddle
PArallel Distributed Deep LEarning
nkhuyu/pandas-profiling
Create HTML profiling reports from pandas DataFrame objects
nkhuyu/paratext
A library for reading text files over multiple cores.
nkhuyu/pysparnn
Approximate Nearest Neighbor Search for Sparse Data in Python!
nkhuyu/pytrends
Pseudo API for Google Trends
nkhuyu/rancher
A Platform for Operating Docker in Production
nkhuyu/ranger
A VIM-inspired filemanager for the console
nkhuyu/RStartHere
A guide to some of the most useful R Packages that we know about
nkhuyu/SparkADMM
nkhuyu/speaker-recognition
A Real-time Speaker Recognition System with GUI
nkhuyu/speech-to-text-nodejs
:microphone: Sample Node.js Application for the IBM Watson Speech to Text Service
nkhuyu/tpot
A Python tool that automatically creates and optimizes machine learning pipelines using genetic programming.