VolhaP87
Data Scientist and Machine Learning Engineer who was a criminalist, with a passion for puzzles and curiosity for solving problems through analytics!
New York
Pinned Repositories
als-recommender-system-pyspark-lab
amazon-sagemaker-keras-text-classification
A step-by-step guide that shows how to do text classification by run training/inference for a custom model in Amazon SageMaker
Cryptocurrency_Prediction_Analysis
Forecasted price trends for the top two cryptocurrencies for a half year out starting September 2022. Detrended the data through subtracting the EWMA and differencing transformations. Modeled the data using various strategies including different orders of AR, MA, ARIMA, and SARIMA. Used the models with the lowest AIC for forecasting.
ds-classification_workflow_pipelines-efd32
ds-convolutional_neural_networks-kvm32
House_Sales_Price_Analysis
Developed an algorithm that predicted the best price for selling houses in the King County, Washington. Built various linear regression models based on statistically significant features, p-values and multicollinearity. Applied Recursive Feature Elimination and Brute Force approaches. Investigated linear regression assumptions.
Microsoft_Movie_Analysis
Performed descriptive analyses of movie datasets and provided three business recommendations for Microsoft in terms of what movies were doing the best at the box offices. Worked with compressed CSV files and SQL database. Cleaned and merged the data to use for analysis.
Office_Supplies_Recommendation_System
Recommended office supplies based on reviews of purchased products and advised if it would be valuable to offer products as a two-pack. Worked in surprise library and Spark programming environment. Built recommendation systems using SVD and ALS models. Performed A/B testing to determine the probability of success for a two-pack.
Stroke_Prediction_Analysis
Predicted if patients would develop stroke in their lifetime given clinical features of the patients. Applied one hot encoding and SMOTE-NC. Modeled the data using baseline and tuned Logistic Regression, Decision Tree, Bagged Trees, Random Forest, AdaBoost, Gradient Boosting, XGBoost, Naïve Bayes, KNN, SVM. Achieved recall score of 97%.
Twitter_Sentiment_Analysis
Conducted Twitter sentiment analysis on Google and Apple products. Employed a range of supervised ML algorithms and neural networks to tackle an imbalanced NLP multiclass classification problem. Selected the model with the highest macro F1 score for evaluation.
VolhaP87's Repositories
VolhaP87/Twitter_Sentiment_Analysis
Conducted Twitter sentiment analysis on Google and Apple products. Employed a range of supervised ML algorithms and neural networks to tackle an imbalanced NLP multiclass classification problem. Selected the model with the highest macro F1 score for evaluation.
VolhaP87/Cryptocurrency_Prediction_Analysis
Forecasted price trends for the top two cryptocurrencies for a half year out starting September 2022. Detrended the data through subtracting the EWMA and differencing transformations. Modeled the data using various strategies including different orders of AR, MA, ARIMA, and SARIMA. Used the models with the lowest AIC for forecasting.
VolhaP87/ds-convolutional_neural_networks-kvm32
VolhaP87/House_Sales_Price_Analysis
Developed an algorithm that predicted the best price for selling houses in the King County, Washington. Built various linear regression models based on statistically significant features, p-values and multicollinearity. Applied Recursive Feature Elimination and Brute Force approaches. Investigated linear regression assumptions.
VolhaP87/Office_Supplies_Recommendation_System
Recommended office supplies based on reviews of purchased products and advised if it would be valuable to offer products as a two-pack. Worked in surprise library and Spark programming environment. Built recommendation systems using SVD and ALS models. Performed A/B testing to determine the probability of success for a two-pack.
VolhaP87/dsc-classification-with-word-embeddings-codealong
VolhaP87/dsc-dash-deployment
VolhaP87/dsc-dash-intro
VolhaP87/dsc-facebook-prophet-lab
VolhaP87/dsc-flask-deployment
VolhaP87/dsc-generating-word-embeddings-lab
VolhaP87/dsc-graph-theory-shortest-path
VolhaP87/dsc-graph-theory-shortest-path-lab
VolhaP87/dsc-network-clustering
VolhaP87/dsc-network-clustering-lab
VolhaP87/dsc-network-community-detection-lab
VolhaP87/dsc-network-recomendation-systems-lab
VolhaP87/dsc-network-recommendation-systems
VolhaP87/dsc-networkX-intro
VolhaP87/dsc-networkX-intro-lab
VolhaP87/dsc-node-centrality
VolhaP87/dsc-node-centrality-lab
VolhaP87/dsc-sarima-models-lab
VolhaP87/dsc-using-pretrained-networks
VolhaP87/dsc-using-pretrained-networks-codealong
VolhaP87/dsc-visualizing-activation-functions-lab
VolhaP87/Food_App_Analysis
A 1-Week Live Data Project for Data Scientist by HiCounselor. Included Preprocessing and cleaning the data using Python as well as running SQL queries to solve business problems.
VolhaP87/Interview_Sample_Problem
VolhaP87/ML_Algorithms_Course
Public repo for 365 Data Science ML Algorithms Course
VolhaP87/VolhaP87