quocduyenanhnguyen
Welcome to my GitHub profile. I enjoy working with data and learning everyday.
Ho Chi Minh city, Vietnam
Pinned Repositories
8-Week-SQL-Challenge
I wrote SQL queries to answer questions from multiple case studies from the '8 Week SQL Challenge' created by Danny Ma.
Accidental-Drug-Related-Deaths-2012-2018-data-exploration-and-analysis
I wrote SQL to explore Accidental Drug Related Deaths in Connecticut 2012-2018 dataset.
Airlines_Web_Scrapping
I scrapped airline data from a Wiki page with Python, did some data cleaning with Google Sheet and SQL, then visualized the data with Tableau.
Analysis-of-Santa-Clara-Valley-bank
In this project, I built logistic regression model in RStudio to predict the probability of customers enroll in direct payroll deposit so that the bank can save their money and resources by contacting only customers that exhibit high probability and how best the bank can use my logistic regression model to help them strategize business goals.
California-Crime-Data-Analysis
I analyzed crime incident-based data in California in the year 2022. I used SQL for analysis and Tableau for visualization.
California-Gas-Prices
In this project, I scrapped data from a website to collect different types of gas data and their prices in California.
Human-trafficking-analysis
I analyzed human trafficking data
Optimization-Project
In this project, I built several optimization models to determine production level, manage shipment, and maximize thesis points. I also built queueing models to analyze queueing systems in a hospital and made recommendations to meet the criteria, and simulation models to analyze system performance under uncertainty.
Unix-commands-cheat-sheet
I compiled a list of helpful Unix commands
Yelp-analysis
Yelp data analysis of business rating, categories, any trends/patterns, correlation, etc.
quocduyenanhnguyen's Repositories
quocduyenanhnguyen/Optimization-Project
In this project, I built several optimization models to determine production level, manage shipment, and maximize thesis points. I also built queueing models to analyze queueing systems in a hospital and made recommendations to meet the criteria, and simulation models to analyze system performance under uncertainty.
quocduyenanhnguyen/8-Week-SQL-Challenge
I wrote SQL queries to answer questions from multiple case studies from the '8 Week SQL Challenge' created by Danny Ma.
quocduyenanhnguyen/Accidental-Drug-Related-Deaths-2012-2018-data-exploration-and-analysis
I wrote SQL to explore Accidental Drug Related Deaths in Connecticut 2012-2018 dataset.
quocduyenanhnguyen/Airlines_Web_Scrapping
I scrapped airline data from a Wiki page with Python, did some data cleaning with Google Sheet and SQL, then visualized the data with Tableau.
quocduyenanhnguyen/Analysis-of-Santa-Clara-Valley-bank
In this project, I built logistic regression model in RStudio to predict the probability of customers enroll in direct payroll deposit so that the bank can save their money and resources by contacting only customers that exhibit high probability and how best the bank can use my logistic regression model to help them strategize business goals.
quocduyenanhnguyen/ANOVA-of-baseball-league-team-Individual-Project
In this project, I wrote R codes to conduct ANOVA test to test for any differences in the mean attendance and among the mean attendance of the three divisions for the 14 baseball teams so that the management team can rearrange 14 teams into different divisions to improve attendance rate.
quocduyenanhnguyen/azure-docs
Open source documentation of Microsoft Azure
quocduyenanhnguyen/California-Crime-Data-Analysis
I analyzed crime incident-based data in California in the year 2022. I used SQL for analysis and Tableau for visualization.
quocduyenanhnguyen/California-Gas-Prices
In this project, I scrapped data from a website to collect different types of gas data and their prices in California.
quocduyenanhnguyen/Classify-range-of-mobile-phone-price
In this project, I visualized training dataset about mobile phone of various companies and built a logistic regression model to predict a range of mobile phone price for the stakeholder’s company.
quocduyenanhnguyen/Customer-Churn-Analytics-project
In this project, I visualized and built Random Forest model for gas and electricity datasets to investigate whether customer churn is affected by price sensitivity and what are the other factors that affect customer churn.
quocduyenanhnguyen/Data-Mining-Case-Study-Project
In this project, for supervised learning, I used regression and decision tree techniques to build predictive models and tested model accuracy by evaluating MSE and misclassification cost. For unsupervised learning, I performed cluster analysis on Iris dataset to identify subgroups and I used association rules to analyze transaction details in the Groceries dataset.
quocduyenanhnguyen/Data-Mining-Employee-Attrition-Project
In this project, I built predictive models to predict employee attrition rate and how best this fictional company can use my “best" model to improve their employee retention rate. The techniques I used to build my “best” model are forward stepwise, Lasso, and classification.
quocduyenanhnguyen/Human-trafficking-analysis
I analyzed human trafficking data
quocduyenanhnguyen/Item-Receipt-Python-Project
In this project, I wrote Python codes to develop simple item receipt by making multiple functions to ask users to select items and pay for the items. The items that I came up with are baby products.
quocduyenanhnguyen/Lung-Cancer-Mortality-data-exploration-and-analysis
I explored lung cancer mortality Kaggle dataset with SQL to generate insights.
quocduyenanhnguyen/Twitter-Despicable-Me-4-hashtag-engagement-analysis
In this project, I explored Despicable Me 4 hashtag on Twitter to gather engagement metrics for data analysis over a one week period.
quocduyenanhnguyen/Unix-commands-cheat-sheet
I compiled a list of helpful Unix commands
quocduyenanhnguyen/Yelp-analysis
Yelp data analysis of business rating, categories, any trends/patterns, correlation, etc.
quocduyenanhnguyen/quocduyenanhnguyen
quocduyenanhnguyen/ROI-modeling-and-Analysis-of-Sports-Dataset
In this project, you will find my ROI model for retirement savings and PowerPoint presentation of my ROI model, as well as my data analysis/visualization of Sports Ticket Sales dataset that I concluded with a PDF group written report
quocduyenanhnguyen/SQL-practice
In this project, I wrote SQL queries to manipulate and transform data from the Restaurant and Library database.
quocduyenanhnguyen/Statistical-Analysis-of-terminated-recalls
In this project, I wrote R codes to conduct hypothesis test of two population proportions on the terminated food recalls dataset to see if the year 2021 has less cases of terminated food recalls than the previous year.