Pinned Repositories
llm_forecasting
Forecasting with LLMs
AWS-Redshift-ETL-Pipeline-Development-for-Music-Streaming-Startup
Data-Cleaning-Assistant
Data Cleaning Assistant is a web application that uses OpenAI's GPT 3.5 model to suggest data cleaning tasks for a given dataset.
Finding_donors_for_a_charity
Comparing Random Forest, Gradient Boosting, and XGBoost to select the best model to predict potential donors for a Charity.
K-Means_Project_Banknote_Authentication
Built a K-Means Model to detect if a banknote is genuine or forged.
Marketing_Analytics
Using Python to conduct EDA, perform statistical analysis, visualize insights, and present data-driven solutions to Chief Marketing Officer in the company
Random_Forest_Project_Predicting_Popularity_of_Online_News_Articles
Telco_Customer_Churn_Analysis
Use Multiple Linear Regression, Python, Pandas, and Matplotlib to analyze the lifetime value and the key factors of the ‘Telco Customer Churn’ dataset.
Video_Game_Sales_Analysis
Analyze sales data from more than 16,500 games.
WeRateDogs-Twitter-Analysis
Use Twitter Api and Pandas to gather and conduct data cleaning
YuehHanChen's Repositories
YuehHanChen/Marketing_Analytics
Using Python to conduct EDA, perform statistical analysis, visualize insights, and present data-driven solutions to Chief Marketing Officer in the company
YuehHanChen/Telco_Customer_Churn_Analysis
Use Multiple Linear Regression, Python, Pandas, and Matplotlib to analyze the lifetime value and the key factors of the ‘Telco Customer Churn’ dataset.
YuehHanChen/K-Means_Project_Banknote_Authentication
Built a K-Means Model to detect if a banknote is genuine or forged.
YuehHanChen/Finding_donors_for_a_charity
Comparing Random Forest, Gradient Boosting, and XGBoost to select the best model to predict potential donors for a Charity.
YuehHanChen/Random_Forest_Project_Predicting_Popularity_of_Online_News_Articles
YuehHanChen/Video_Game_Sales_Analysis
Analyze sales data from more than 16,500 games.
YuehHanChen/AWS-Redshift-ETL-Pipeline-Development-for-Music-Streaming-Startup
YuehHanChen/Data-Cleaning-Assistant
Data Cleaning Assistant is a web application that uses OpenAI's GPT 3.5 model to suggest data cleaning tasks for a given dataset.
YuehHanChen/WeRateDogs-Twitter-Analysis
Use Twitter Api and Pandas to gather and conduct data cleaning
YuehHanChen/AB_Testing
Conducted A/B tests and regression analysis to examine the conversion rate difference between the control and experiment groups.
YuehHanChen/Data-Modeling-for-a-Music-Streaming-App
Modeled the data with Python and Apache Cassandra, which enabled the analytics team to write queries and answer questions like “Give me every username who listened to a given song”
YuehHanChen/DO-LEFT-HANDED-PEOPLE-REALLY-DIE-YOUNG
Use pandas, Bayesian statistics, and Hypothesis testing to see if left-handed people actually die earlier than righties.
YuehHanChen/Mall_Customer_Segmentation
Comparing KMeans, Hierarchical Clustering, and GMMs and selecting the best model to segment customer information using the Silhouette score.
YuehHanChen/PISA-Analysis
Use Pandas, Matplotlib, and Seaborn to conduct exploratory data analysis by Univariate, Bivariate, and Multivariate visual exploration.
YuehHanChen/Analysis_template
YuehHanChen/Analyze-A-B-Test-Results
Use Regression to perform A/B testing.
YuehHanChen/Crawler-for-Famous-Entrepreneurs-popular-videos
YuehHanChen/Crawler-for-gaming-articles
YuehHanChen/Crawler-for-picture-from-SSENCE
A crawler for downloading all the pictures of men jackets, which is higher than 2000 dollars from SSENSE(fashion website)
YuehHanChen/CS165
YuehHanChen/Data-Wrangling
Data-Wrangling-Practice
YuehHanChen/fxxxxw
YuehHanChen/Identify-Customer-Segments
Unsupervised learning techniques to identify segments of the population that form the core customer base for a mail-order sales company in Germany
YuehHanChen/Marketing-Job-Market-Analysis
Used python(numpy, pandas, matplotib, and seaborn) to analyze the data from h1bdata.info to understand the situation of H1B for marketing jobs.
YuehHanChen/Product_Roles_Job_Market_Analysis
YuehHanChen/TMDb-movie-data-analysis
Focus on Exploratory Data Analysis
YuehHanChen/Trending-YouTube-Video-Analysis
Use Python, Pandas, and Matplotlib to analyize “Trending YouTube Video Statistics”, including Data Assessing, Data Cleaning, EDA, Visualization and Drawing conclusion.