chineseballer06
Data scientist with a passion for solving problems through analytics. Academic background in mathematics along with a career in professional basketball.
Denver, Colorado
Pinned Repositories
applying-gradient-descent-data-science-intro-000
applying-gradient-descent-lab-data-science-intro-000
applying-nearest-neighbors-data-science-intro-000
calculating-distance-data-science-intro-000
King-County-Housing
Predicted the sales prices of homes in King County using multivariate linear regression models.
Lending-Club-Loan-Defaults
Applied deep learning models to classify which types of loans would be more likely to default. Cleaned and analyzed loan data using Pandas and NumPy to prepare data for classification models. Explored different features of a loan with Matplotlib and Seaborn to gain further insight for feature selection. Built different MLP models from Keras in order to determine which loans would default.
NBA-Shot-Analysis
Analyzed NBA data to gain insight on which shooting features indicated a made field goal attempt. Mined data for important trends and features using Pandas, NumPy, and Matplotlib to be plugged into machine learning models. Ran various machine learning models such as XGBoost, Random Forest, and Logistic Regression to predict whether a shot would be successful or not. Optimized different machine learning models using Scikit-Learn’s GridSearch in order to tune hyperparameters.
Statistical-Analysis-of-Northwind-Database
Performed various hypothesis testing to answer questions regarding statistical significance using the Northwind database. Used SQLAlchemy and Python to query and examine the data from the Northwind database in order to perform statistical analysis. Goal was to answer if discounts have a statistically significant effect on the number of products customers order by conducting hypothesis testing.
Stock-Analysis
Predicted and analyzed future stock prices using Monte Carlo simulations. Used pandas-datareader package to scrape historical stock prices of various companies. Calculated CAGR and Annualized Volatility to be fed into a Monte Carlo simulation.
Traffic-Accidents
Predicted where accidents in the UK are more likely to occur using satellite imagery. Extracted important traffic data features using machine learning algorithms to be used in deep learning models. Scraped satellite images throughout the UK using Google Maps Static API to train different CNN models. Constructed a mixed-input deep learning model using Keras functional API to predict where dangerous accidents are likely to occur.
chineseballer06's Repositories
chineseballer06 doesn’t have any repository yet.