kohjiaxuan

Data Scientist. Enjoys doing data science projects in my free time (update: no longer true because doing masters XD)

Singapore

Pinned Repositories

100-pandas-puzzles
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Language:Jupyter Notebook0 1 00
Data-Science-Competition-for-Revenue-Maximization
Data Science Competition that challenged teams to come up with creative ways to increase the revenue of an e-commerce company. Won 1st place! Write-up in repository
1 1 01
Fraud-Detection-Pipeline
A structured data science pipeline for classification problems that does scaling, sampling, k-fold cross validation with evaluation metrics
Language:Jupyter Notebook1 2 00
lstm_basics
basics of lstm and autoencoder
Language:Jupyter Notebook0 1 00
Machine-Learning-with-sklearn
Practice Juypter Notebooks for my machine learning journey with Python. Please refer to other repositories for completed projects!
Language:Jupyter Notebook0 1 00
NLP-Model-for-Corpus-Similarity
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
Language:Python9 2 01
Predicting-HDB-Price-with-Machine-Learning
Data Project of Predicting HDB Resale Flat Prices with data cleaning, feature engineering and machine learning. Models used: Random Forest, XGBoost, Neural Networks, Decision Tree, Support Vector Regressors, Linear Regression
Language:Jupyter Notebook2 2 00
Stock-Market-Dashboard
Creating a stock market dashboard from an external API that tracks daily performance of stocks
Language:Python15 2 01
Visualisation-of-Gradient-Descent
By visualizing the gradient descent algorithm applied on a set of points that fits a quadratic equation, we understand better how the algorithm works in machine learning
Language:Jupyter Notebook2 1 00
Wikipedia-Article-Scraper
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
Language:Python19 2 07

kohjiaxuan's Repositories

kohjiaxuan/Wikipedia-Article-Scraper
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
Language:Python19 2 07
kohjiaxuan/Stock-Market-Dashboard
Creating a stock market dashboard from an external API that tracks daily performance of stocks
Language:Python15 2 01
kohjiaxuan/NLP-Model-for-Corpus-Similarity
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
Language:Python9 2 01
kohjiaxuan/Predicting-HDB-Price-with-Machine-Learning
Data Project of Predicting HDB Resale Flat Prices with data cleaning, feature engineering and machine learning. Models used: Random Forest, XGBoost, Neural Networks, Decision Tree, Support Vector Regressors, Linear Regression
Language:Jupyter Notebook2 2 00
kohjiaxuan/Visualisation-of-Gradient-Descent
By visualizing the gradient descent algorithm applied on a set of points that fits a quadratic equation, we understand better how the algorithm works in machine learning
Language:Jupyter Notebook2 1 00
kohjiaxuan/Data-Science-Competition-for-Revenue-Maximization
Data Science Competition that challenged teams to come up with creative ways to increase the revenue of an e-commerce company. Won 1st place! Write-up in repository
1 1 01
kohjiaxuan/Fraud-Detection-Pipeline
A structured data science pipeline for classification problems that does scaling, sampling, k-fold cross validation with evaluation metrics
Language:Jupyter Notebook1 2 00
kohjiaxuan/100-pandas-puzzles
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Language:Jupyter Notebook0 1 00
kohjiaxuan/lstm_basics
basics of lstm and autoencoder
Language:Jupyter Notebook0 1 00
kohjiaxuan/Machine-Learning-with-sklearn
Practice Juypter Notebooks for my machine learning journey with Python. Please refer to other repositories for completed projects!
Language:Jupyter Notebook0 1 00

kohjiaxuan

Pinned Repositories

100-pandas-puzzles

Data-Science-Competition-for-Revenue-Maximization

Fraud-Detection-Pipeline

lstm_basics

Machine-Learning-with-sklearn

NLP-Model-for-Corpus-Similarity

Predicting-HDB-Price-with-Machine-Learning

Stock-Market-Dashboard

Visualisation-of-Gradient-Descent

Wikipedia-Article-Scraper

kohjiaxuan's Repositories

kohjiaxuan/Wikipedia-Article-Scraper

kohjiaxuan/Stock-Market-Dashboard

kohjiaxuan/NLP-Model-for-Corpus-Similarity

kohjiaxuan/Predicting-HDB-Price-with-Machine-Learning

kohjiaxuan/Visualisation-of-Gradient-Descent

kohjiaxuan/Data-Science-Competition-for-Revenue-Maximization

kohjiaxuan/Fraud-Detection-Pipeline

kohjiaxuan/100-pandas-puzzles

kohjiaxuan/lstm_basics

kohjiaxuan/Machine-Learning-with-sklearn