manchhui
Aerospace engineer who has a passion in solving complex problems using scientific methods and processes to extract insights from data
Bristol, United Kingdom
Pinned Repositories
bathbib
BibTeX and biblatex styles for the University of Bath's Harvard referencing style
Enigma
A program that fully simulates the Enigma machine, encrypting and decrypting plaintext / cipher-text
HarvardX-CapStoneP-MovieLens
HarvardX DataScience Professional Certificate - Final Capstone Project. A movie recommedation system was created and tuned using Machine Learning algorithms.
HarvardX-CapStoneP-PulsarStar
HarvardX DataScience Professional Certificate - Final Capstone IDV Project - Predicting Pulsars using machine learning algorithms. In this project we determine which machine learning algorithm has the highest prediction accuracy in predicting Pulsars.
Kaggle-OpenVaccine-Competition-Entry
In this competition, the organisers were looking to leverage the data science expertise of the Kaggle community to develop models and design rules for RNA degradation. The developed models will predict likely degradation rates at each base of an RNA molecule, trained on a subset of an Eterna dataset comprising over 3000 RNA molecules (which span a panoply of sequences and structures) and their degradation rates at each position.
Udacity-DENG-Capstone
Udacity Data Engineering Nanodegree Programme - Capstone Project: Using Apache Airflow, we create a data warehouse from ground up with high grade ETL pipelines that are automated, easily monitored and have data quality checks that catch any discrepancies in the datasets.
Udacity-DENG-P4-Data-Lake
Udacity Data Engineering Nanodegree Programme - Project 4 - Data Lake - Using Apache Spark we create a Data Lake for start up Sparkify that allows dynamic ELT from and to AWS S3.
Udacity-DENG-P5-Airflow
Udacity Data Engineering Nanodegree Programme - Project 5 - Airflow - Using Apache Airflow to create high grade data pipelines that are dynamic and built from reusable tasks, can be monitored, and allow easy backfills.
Udemy-PythonBCamp-BorrowerDefault
Udemy Python for Data Science and Machine Learning Bootcamp - Final Project - Using "Artifical Neural Networks" (ANN) to predict borrower defaults.
manchhui's Repositories
manchhui/Udacity-DENG-Capstone
Udacity Data Engineering Nanodegree Programme - Capstone Project: Using Apache Airflow, we create a data warehouse from ground up with high grade ETL pipelines that are automated, easily monitored and have data quality checks that catch any discrepancies in the datasets.
manchhui/bathbib
BibTeX and biblatex styles for the University of Bath's Harvard referencing style
manchhui/Enigma
A program that fully simulates the Enigma machine, encrypting and decrypting plaintext / cipher-text
manchhui/HarvardX-CapStoneP-MovieLens
HarvardX DataScience Professional Certificate - Final Capstone Project. A movie recommedation system was created and tuned using Machine Learning algorithms.
manchhui/HarvardX-CapStoneP-PulsarStar
HarvardX DataScience Professional Certificate - Final Capstone IDV Project - Predicting Pulsars using machine learning algorithms. In this project we determine which machine learning algorithm has the highest prediction accuracy in predicting Pulsars.
manchhui/Kaggle-OpenVaccine-Competition-Entry
In this competition, the organisers were looking to leverage the data science expertise of the Kaggle community to develop models and design rules for RNA degradation. The developed models will predict likely degradation rates at each base of an RNA molecule, trained on a subset of an Eterna dataset comprising over 3000 RNA molecules (which span a panoply of sequences and structures) and their degradation rates at each position.
manchhui/Udacity-DENG-P4-Data-Lake
Udacity Data Engineering Nanodegree Programme - Project 4 - Data Lake - Using Apache Spark we create a Data Lake for start up Sparkify that allows dynamic ELT from and to AWS S3.
manchhui/Udacity-DENG-P5-Airflow
Udacity Data Engineering Nanodegree Programme - Project 5 - Airflow - Using Apache Airflow to create high grade data pipelines that are dynamic and built from reusable tasks, can be monitored, and allow easy backfills.
manchhui/Udemy-PythonBCamp-BorrowerDefault
Udemy Python for Data Science and Machine Learning Bootcamp - Final Project - Using "Artifical Neural Networks" (ANN) to predict borrower defaults.
manchhui/manchhui.github.io
manchhui/nlp-roadmap
ROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP
manchhui/Pneumonia-Classification
manchhui/Policy-Gradient
manchhui/RSNA-Screening-Mammography
manchhui/STP
The Songbird Testing Proposal repository
manchhui/styles
Official repository for Citation Style Language (CSL) citation styles.
manchhui/sui
Sui, a next-generation smart contract platform with high throughput, low latency, and an asset-oriented programming model powered by the Move programming language
manchhui/Udacity-DENG-P1-Data-Modelling-With-Postgres
Udacity Data Engineering Nanodegree Programme - Project 1 - Data Modelling With Postgres
manchhui/Udacity-DENG-P2-Data-Modelling-With-Cassandra
Udacity Data Engineering Nanodegree Programme - Project 2 - Data Modelling With Cassandra
manchhui/Udacity-DENG-P3-Data-Warehouse
Udacity Data Engineering Nanodegree Programme - Project 3 - Data Warehouse
manchhui/Udemy-MySQL-Tableau
Udemy MySQL for Data Analytics and Business Intelligence Course - Final Project.
manchhui/US-Presidential-Election-Sentiment-Analysis
manchhui/Youtube-Code-Repository
Repository for most of the code from my YouTube channel