ethan-eplee
Mechanical engineer turned finance professional, now a data science and machine learning enthusiast. Excited to have begun this journey and not looking back!
Singapore
Pinned Repositories
120-DS-Interview-Questions
My Answer to 120 Data Science Interview Questions
coursera-challenge-lab
Placed Top 50 in the first Coursera Challenge Labs to predict retention probabilities of Coursera users based on their subscription data. Achieved a ROC AUC score of 0.74 using a Random Forest model.
data-science-aws
Applied data science project using purpose-built ML tools in the AWS cloud, including AutoML, Amazon SageMaker and custom BERT models to build, train and deploy scalable, end-to-end ML pipelines for sentiment analysis
HorseRacePrediction
Using machine learning models to predict the outcome of a horse race, and run backtesting to see if we can profit from betting
Housing-Price-Prediction
Building a linear regression model to predict housing prices in Ames, USA - GA Data Science Immersive Project
Inventory-Forecasting
Completed this project as part of Cognizant's Artificial Intelligence Virtual Experience Program to help a business client optimize the level of inventory
PortfolioThatBeatsTheMarket
Brainstorming Folder
PySpark-Mini-Projects
A series of mini ML projects I have done to familiarize with PySpark
Subreddit-Classification
Using webscraping, APIs, NLP and classification modelling to classify posts from two subreddits
West-Nile-Virus-Prediction
Build a prediction model for the city of Chicago to decide where it can spray pesticide to rid the mosquito population and conduct a cost-benefit analysis including annual cost projections for various levels of pesticide coverage (cost) and the effect of these various levels of pesticide coverage (benefit).
ethan-eplee's Repositories
ethan-eplee/HorseRacePrediction
Using machine learning models to predict the outcome of a horse race, and run backtesting to see if we can profit from betting
ethan-eplee/PortfolioThatBeatsTheMarket
Brainstorming Folder
ethan-eplee/PySpark-Mini-Projects
A series of mini ML projects I have done to familiarize with PySpark
ethan-eplee/Subreddit-Classification
Using webscraping, APIs, NLP and classification modelling to classify posts from two subreddits
ethan-eplee/West-Nile-Virus-Prediction
Build a prediction model for the city of Chicago to decide where it can spray pesticide to rid the mosquito population and conduct a cost-benefit analysis including annual cost projections for various levels of pesticide coverage (cost) and the effect of these various levels of pesticide coverage (benefit).
ethan-eplee/coursera-challenge-lab
Placed Top 50 in the first Coursera Challenge Labs to predict retention probabilities of Coursera users based on their subscription data. Achieved a ROC AUC score of 0.74 using a Random Forest model.
ethan-eplee/data-science-aws
Applied data science project using purpose-built ML tools in the AWS cloud, including AutoML, Amazon SageMaker and custom BERT models to build, train and deploy scalable, end-to-end ML pipelines for sentiment analysis
ethan-eplee/Housing-Price-Prediction
Building a linear regression model to predict housing prices in Ames, USA - GA Data Science Immersive Project
ethan-eplee/Inventory-Forecasting
Completed this project as part of Cognizant's Artificial Intelligence Virtual Experience Program to help a business client optimize the level of inventory
ethan-eplee/cracking-the-data-science-interview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
ethan-eplee/customer-flight-bookings
Data science project offered by British Airways as part of their virtual internship program to analyze customer feedback and to do predictive modelling on flight bookings
ethan-eplee/Deep-Learning-Specialization
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai
ethan-eplee/DesigningMLSystems
Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)
ethan-eplee/Document-Summarizer
ethan-eplee/ethan-eplee
ethan-eplee/ethan-eplee.github.io
ethan-eplee/How-To-Repo
A text file describing some of the more important how-tos when setting up my personal Github
ethan-eplee/Kaggle-Fraud-Detection
It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase. In this Kaggle case, we will use various classification-based models to detect whether a transaction is a normal payment or a fraud.
ethan-eplee/links
Links
ethan-eplee/Machine-Learning-Specialization-Coursera
Contains Solutions and Notes for the Machine Learning Specialization By Stanford University and Deeplearning.ai - Coursera (2022) by Prof. Andrew NG
ethan-eplee/ML-for-algo-trading
Code for Machine Learning for Algorithmic Trading
ethan-eplee/Number-Digits-Classifier
Using the MNIST dataset to train classifier algorithms to visually predict number digits
ethan-eplee/PCA-Eigen-Portfolio
ethan-eplee/Personal-Repo
A place to store all my templates and codes!
ethan-eplee/Plotly-Dashboards-with-Dash
This is the repo for the Udemy Course Python Dashboards with Plotly's Dash
ethan-eplee/python-fundamentals
A Python folder with a few code files collated from past courses on standard libraries (pandas, numpy, matplotlib, seaborn etc)
ethan-eplee/Standardised-Test-Participation-Analysis
An analysis of SAT and ACT participation rates - GA Data Science Immersive Project
ethan-eplee/Stock-Price-Prediction
Predicting stock price and returns using a supervised regression framework. Independent variables used are historical price data, correlated stocks, spot currency rates and price data of major indices.
ethan-eplee/StocksClustering
ethan-eplee/SVM-Facial-Recognition
Using the SVM algorithm for Facial Recognition