Practicum by Yandex Projects

These are Data Science projects I completed when going through the Data Science training course in Practicum by Yandex. They involve Data Preprocessing, Analysis and Visualization, Statistical Analysis, Machine Learning, Machine Learning for texts, Time Series, Computer Vision, etc.

Project Description Libraries
Credit Score Analysis Utilizing different metrics to determine the likelihood that a customer defaults on a loan pandas, ntlk
Vehicle Price Analysis Studying vehicle ads to determine what factors influence vehicle price pandas, matplotlib, numpy
Phone Plan Analysis Selecting the plan with the highest potential revenue for different target markets and conducting hypothesis testing pandas, matplotlib, scipy, numpy, nltk
Video Game Analysis Identifying patterns that determine a game's success, creating a user profile for each target market, and conducting hypothesis testing pandas, matplotlib, scipy, numpy, nltk
Taxi Company Analysis Identifying the top taxi companies, neighborhoods with the most drop-offs, and conducting hypothesis testing pandas, matplotlib, scipy, numpy, nltk
Telecom Plans ML Model Studying customer behavior and building a machine learning model that recommends phones plans pandas, sklearn
Bank Customers Model Creating a classification model to predict customer churn from an imbalanced dataset pandas, sklearn
Oil Well Model Analyzing data from several petroleum reservoir regions and oil wells to calculate the profitability and risk of loss of developing a new well in each region pandas, numpy, matplotlib, seaborn
Gold Extraction Model Building a machine learning model that predicts the amount of gold recovered from the extraction process so as to optimize production pandas, matplotlib, numpy, sklearn
Insurance Company Data Masking Developing a data transforming algorithm for data masking, ensuring that it does not impact the quality of a machine learning model pandas, numpy, sklearn
Car Price Model Building a model using historical car data for an app feature that can determine the market value of a user's car pandas, numpy, sklearn, time, catboost, lightgbm
Taxi Orders Model Using historical data on taxi orders at airports to create a model that predicts the amount of taxi orders for any given time frame pandas, numpy, matplotlib, sklearn, catboost, lightgbm, statsmodels
Movie Sentiment Analysis Training a model that classifies positive and negative reviews pandas, numpy, math, re, nltk, matplotlib, seaborn, tqdm
Supermarket Chain Model Training and evaluating a model to automatically detect the age of a customer at a checkout counter pandas, numpy, matplotlib, seaborn, tensorflow
Telecom Client Churn Model Training a model to forecast if a client is going to leave pandas, numpy, sklearn, catboost, lightgbm