haocai1992
Data Scientist living in Toronto. PhD in Biophysics. Passionate about ML and big data
@ShopifyToronto
Pinned Repositories
AI-drug
Dashboard of AI starups in drug discovery field
Big-Data-Analysis-with-Scala-and-Spark
My homework repository for Coursera's "Big Data Analysis with Scala and Spark" taught by EPFL.
GPT2-News-Classifier
A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.
MovieRecommender
A movie recommendation system built using Scala, Spark and Hadoop
projectStructure
proteinpy
A bioinformatic package for parsing and analysis of protein structure data (PDB format) using pandas dataframes.
PScore-online
ML-powered bioinformatics tool that predicts protein phase separation from sequence
SparkDemo
A demo example of spark project created using IntelliJ IDEA
Venturescope
Venturescope: NLP-powered web app that predicts starup's survival from tweets
LLPhyScore
An interpretable machine learning algorithm to predict disordered protein phase separation based on biophysical interactions
haocai1992's Repositories
haocai1992/GPT2-News-Classifier
A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.
haocai1992/MovieRecommender
A movie recommendation system built using Scala, Spark and Hadoop
haocai1992/proteinpy
A bioinformatic package for parsing and analysis of protein structure data (PDB format) using pandas dataframes.
haocai1992/PScore-online
ML-powered bioinformatics tool that predicts protein phase separation from sequence
haocai1992/AI-drug
Dashboard of AI starups in drug discovery field
haocai1992/Big-Data-Analysis-with-Scala-and-Spark
My homework repository for Coursera's "Big Data Analysis with Scala and Spark" taught by EPFL.
haocai1992/projectStructure
haocai1992/SparkDemo
A demo example of spark project created using IntelliJ IDEA
haocai1992/Venturescope
Venturescope: NLP-powered web app that predicts starup's survival from tweets
haocai1992/Deep-Learning-Specialization
Course notes, quizzes, and programming assignments for DeepLearning.AI's "Deep Learning Specialization"
haocai1992/haocai1992.github.io
A blog about big data & machine learning
haocai1992/ML-algorithms-from-scratch
Implementation of popular ML algorithms from scratch
haocai1992/MLOps-Specialization
Course notes, quizzes, and programming assignments for DeepLearning.AI's "Machine Learning Engineering for Production (MLOps) Specialization"