alexeygrigorev
Running @DataTalksClub and hacking some personal projects
@DataTalksClub Berlin, Germany
Pinned Repositories
data-science-interviews
Data science interview questions and answers
datascience-recruitment-challenges
Home assignments for data science positions
llm-rag-workshop
Chat with your own data - LLM+RAG workshop
mlbookcamp-code
The code from the Machine Learning Bookcamp book
outbrain-click-prediction-kaggle
Solution to the Outbrain Click Prediction competition
data-engineering-zoomcamp
Free Data Engineering course!
llm-zoomcamp
LLM Zoomcamp - a free online course about building a Q&A system
machine-learning-zoomcamp
Learn ML engineering for free in 4 months!
mlops-zoomcamp
Free MLOps course from DataTalks.Club
alexeygrigorev's Repositories
alexeygrigorev/outbrain-click-prediction-kaggle
Solution to the Outbrain Click Prediction competition
alexeygrigorev/avito-duplicates-kaggle
Solution for Avito Duplicate Ads Detection competition
alexeygrigorev/unpossibly-instagram-challenge
Predicting the number of likes an instagram post will receive in 24 hours - winning solution
alexeygrigorev/cikm-cup-2016-cross-device
Solution for the Cross-Device linking challenge from CIKM CUP 2016
alexeygrigorev/projects
Various projects
alexeygrigorev/avito-page-view-prediction-boosters
Solution for Avito Page View prediction competition (Avito BI contest task 3 on boosters)
alexeygrigorev/kaggle
Scripts from Kaggle competitions
alexeygrigorev/many-stop-words
stop word lists in several languages
alexeygrigorev/mahout
Mirror of Apache Mahout
alexeygrigorev/maven-repo
Artifacts not available on Maven Central
alexeygrigorev/yt8m-kaggle
The solution to the YouTube-8M Video Understanding Challenge
alexeygrigorev/barololometer
Search engine results tracker and comparer
alexeygrigorev/ds-toolbox
Data Science toolbox for Java
alexeygrigorev/namespacediscovery-pipeline
Mathematical namespace discovery
alexeygrigorev/notebooks
IPython notebooks
alexeygrigorev/project-mlp
a machine learning approach for processing mathematical language in scientific documents
alexeygrigorev/rest-crawler
A REST API for crawling web pager
alexeygrigorev/rseq
Sequence pattern matching library
alexeygrigorev/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
alexeygrigorev/allen-qa
The Allen AI Science Challenge
alexeygrigorev/boosters-evotor-gettingstarted
Getting started code for the Evotor competition on Boosters (Java)
alexeygrigorev/it4bi-ufrt-ir-project
Information Retrieval project at UFRT
alexeygrigorev/khb-mortality-analysis
Analysis of mortality and temperature data from Khabarovsk, Russia
alexeygrigorev/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
alexeygrigorev/MediawikiHighlight
highlight.js for MediaWiki
alexeygrigorev/namespacediscovery
Master thesis "Identifier Namespaces in Mathematical Notation"
alexeygrigorev/SimpleMathJax
alexeygrigorev/smile
Statistical Machine Intelligence & Learning Engine
alexeygrigorev/stolzen
Automatically exported from code.google.com/p/stolzen
alexeygrigorev/wiki-figures
Figures to post on wiki and other online resources