RudraChatterjee
Data scientist with extensive experience in analyzing large datasets, visual storytelling, and interpreting results to make better data-driven decisions
Toronto, Ontario
Pinned Repositories
A-B_Testing_StatisticalAnalysis
E-news Express, an online news portal, aims to expand its business by acquiring new subscribers. This python based project performs EDA, data visualization and conducts a no of statistical tests to determine key business insights: in particular whether the new landing page can get greater viewership duration and subscriber conversion
Coursera_MachineLearningSpecialization_UW
Contains codes as part of the ML specialization offered by UW in Coursera
Data-Analysis-with-Py
Hotel-Cancellation_Prediction_Classification
A hotel chain is having issues with cancellations. This project analyzes customer booking data to identify which factors significantly influence cancellations, build models using logistic regression and decision trees to predict cancellations in advance, and help formulate profitable policies for cancellations and refunds for the hotel group
Machine-Failure_Prediction_EnsembleMethods_ModelTuning
This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction by tuning model hyperparameters and addressing class imbalance through over and under sampling data. Final model is productionized using a data pipeline
PricePrediction_Regression
Explored the dataset of a company that specializes in the reselling of used and refurbished devices. The objective of this project was to determine the future price of used phones and identify the factors that significantly influence them using a linear regression model with python
Unsupervised_Learning-Clustering
This project explores and analyzes financial data of a number of securities, applies Hierarchical and K-means clustering to group securities and create cluster profiles to develop personalized portfolios and investment strategies for clients
Visa-approval-prediction-EnsembleMethodsML
This project presents a ML based solution using Ensemble methods to predict which visa applications will be approved and thus recommend a suitable profile for applicants whose visa have a high chance of approval
RudraChatterjee's Repositories
RudraChatterjee/Machine-Failure_Prediction_EnsembleMethods_ModelTuning
This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction by tuning model hyperparameters and addressing class imbalance through over and under sampling data. Final model is productionized using a data pipeline
RudraChatterjee/PricePrediction_Regression
Explored the dataset of a company that specializes in the reselling of used and refurbished devices. The objective of this project was to determine the future price of used phones and identify the factors that significantly influence them using a linear regression model with python
RudraChatterjee/Visa-approval-prediction-EnsembleMethodsML
This project presents a ML based solution using Ensemble methods to predict which visa applications will be approved and thus recommend a suitable profile for applicants whose visa have a high chance of approval
RudraChatterjee/A-B_Testing_StatisticalAnalysis
E-news Express, an online news portal, aims to expand its business by acquiring new subscribers. This python based project performs EDA, data visualization and conducts a no of statistical tests to determine key business insights: in particular whether the new landing page can get greater viewership duration and subscriber conversion
RudraChatterjee/Coursera_MachineLearningSpecialization_UW
Contains codes as part of the ML specialization offered by UW in Coursera
RudraChatterjee/Data-Analysis-with-Py
RudraChatterjee/Hotel-Cancellation_Prediction_Classification
A hotel chain is having issues with cancellations. This project analyzes customer booking data to identify which factors significantly influence cancellations, build models using logistic regression and decision trees to predict cancellations in advance, and help formulate profitable policies for cancellations and refunds for the hotel group
RudraChatterjee/Unsupervised_Learning-Clustering
This project explores and analyzes financial data of a number of securities, applies Hierarchical and K-means clustering to group securities and create cluster profiles to develop personalized portfolios and investment strategies for clients
RudraChatterjee/Food-Aggregator-Customer-Behavior-EDA-DataViz_Python
Data Analysis and visualization using Python on customer data for different orders placed by registered customers in an online portal for a food aggregator company