Pinned Repositories
amazon-reviews
LASSO + XGBoost + text2vec ensemble to predict sentiment in R
are-bots-more-emotional
Sentiment Analysis of Twitter Spam
Business-Analyst-Nanodegree
Udacity Business Analyst Nanodegree
concrete_NLP_tutorial
An NLP workshop about concrete solutions to real problems
DeepNLP-Course
Deep NLP Course at ABBYY
franz-plugins
Franz Plugin Repository
geostat18-links
Links and slides from GEOSTAT2018
gis-projects
GIS and Remote Sensing repo
SeekingAlpha_project
Collection and Analysis of text from SeekingAlpha.com in financial project
Severity_models_light_gbm
Modelling Average cost for claims using Light_GBM - as a Tutorial on this algorithm
prokopyev's Repositories
prokopyev/amazon-reviews
LASSO + XGBoost + text2vec ensemble to predict sentiment in R
prokopyev/franz-plugins
Franz Plugin Repository
prokopyev/are-bots-more-emotional
Sentiment Analysis of Twitter Spam
prokopyev/Business-Analyst-Nanodegree
Udacity Business Analyst Nanodegree
prokopyev/Severity_models_light_gbm
Modelling Average cost for claims using Light_GBM - as a Tutorial on this algorithm
prokopyev/twitter_scraping
Grab all a user's tweets (and get past 3200 limit)
prokopyev/gis-projects
GIS and Remote Sensing repo
prokopyev/active_stream
Active learning support for targeted Twitter stream
prokopyev/analyze-ods
prokopyev/datacleaner
A Python tool that automatically cleans data sets and readies them for analysis.
prokopyev/facebook-news
prokopyev/facebook-page-post-scraper
Data scraper for Facebook Pages, and also code accompanying the blog post How to Scrape Data From Facebook Page Posts for Statistical Analysis
prokopyev/GBM-tune
Tuning GBMs (hyperparameter tuning) and impact on out-of-sample predictions
prokopyev/gunsales
Statistical analysis of monthly background checks of gun purchases
prokopyev/heamy
A set of useful tools for competitive data science.
prokopyev/kaggle-house-prices
House Prices: Advanced Regression Techniques
prokopyev/Kaggle-Quora
Kaggle Quora Questions Pairs Competition
prokopyev/kaggle-quora-dup
Solution to Kaggle's Quora Duplicate Question Detection Competition
prokopyev/kaggle-quora-question-pairs
My solution to Kaggle Quora Question Pairs competition (Top 2%, Private LB log loss 0.13497).
prokopyev/Machine-Learning-Engineer-Nanodegree
Projects done on Udacity: Machine Learning Engineer Nanodegree
prokopyev/messenger-platform-samples
Messenger Platform samples for sending and receiving messages. Walk through the Get Started with this code. https://developers.facebook.com/docs/messenger-platform/quickstart
prokopyev/nics-firearm-background-checks
Monthly data from the FBI's National Instant Criminal Background Check System, converted from PDF to CSV.
prokopyev/NLP_HSE_school
HSE MIEM NLP school project
prokopyev/py-jump
prokopyev/sql-samples
prokopyev/states-of-fragility.github.io
prokopyev/timeseries-rady
Time-series analysis in R
prokopyev/tpot
A Python tool that automatically creates and optimizes machine learning pipelines using genetic programming.
prokopyev/wikipedia-word-frequency
Gather modern English word frequencies from all enwiki articles.
prokopyev/yelp-challenge-2017
Project for a Data Mining and Predictive Analytics Class