alexeygrigorev
Running @DataTalksClub and hacking some personal projects
@DataTalksClub Berlin, Germany
alexeygrigorev's Stars
openimaj/openimaj
The OpenIMAJ source code repository
scikit-learn-contrib/category_encoders
A library of sklearn compatible categorical variable encoders
aol/cyclops
An advanced, but easy to use, platform for writing functional applications in Java 8.
rushter/heamy
A set of useful tools for competitive data science.
coreylynch/pyFM
Factorization machines in python
ibayer/fastFM
fastFM: A Library for Factorization Machines
helvalius/nominatim-docker
Standalone nominatim server in a docker container
ChenglongChen/kaggle-HomeDepot
3rd Place Solution for HomeDepot Product Search Results Relevance Competition on Kaggle.
abhishekkrthakur/automl_gpu
ru-de/faq
Полезная информация о жизни в Германии
aanilpala/toy-reco
a toy recommender engine
ChenglongChen/kaggle-CrowdFlower
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
alexeygrigorev/project-mlp
a machine learning approach for processing mathematical language in scientific documents
jodaiber/Annotated-WikiExtractor
Simple Wikipedia plain text extractor with article link annotations and Hadoop support.
gereleth/kaggle-telstra
My code for Telstra Network Disruptions Kaggle competition
rhiever/datacleaner
A Python tool that automatically cleans data sets and readies them for analysis.
entron/entity-embedding-rossmann
yandex/rep
Machine Learning toolbox for Humans
jacopofar/wikidump-tools
en.wiktionary Part of Speech extractor for Italian, Wikipedia XML dump to plain text converter and tagger
robert-bor/aho-corasick
Java implementation of the Aho-Corasick algorithm for efficient string matching
markdregan/Bayesian-Modelling-in-Python
A python tutorial on bayesian modeling techniques (PyMC3)
rasbt/python-machine-learning-book
The "Python Machine Learning (1st edition)" book code repository and info resource
rouseguy/intro2stats
Introduction to Statistics using Python
jhclark/bigfatlm
Hadoop MapReduce training of modified Kneser-Ney smoothed language models
linkedin/FeatureFu
Library and tools for advanced feature engineering
donnemartin/data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.