Pinned Repositories
chav-ngvyen.github.io
Repo for my portfolio page https://www.chaungvyen.com/
clustering_D3
Light-hearted "scrollytelling" tutorial of DBSCAN and HDBSCAN (clustering algorithms) using D3.js. My first stab at web dev/ HTML/ SVG/ D3/ JavaScript https://www.chaungvyen.com/clustering_D3/
colorblind
Colblindor within a notebook - Tutorial on how to setup and use the colorblind package inside a Jupyter Notebook to check data visualizations for colorblind accessibility. https://www.chaungvyen.com/colorblind/
forest_fire_prediction
hugging_face_course
massive_data
Repo for Massive Data Fundamentals with SQL Specialization
neural_nets_deep_learning
This repo houses my scripts for Neural Nets & Deep Learning (ANLY-590) where I code a universal Machine Learning workflow, gradient descent, loss functions, optimizer, linear and logistic regressions from scratch using numpy, then move onto building neural nets pipelines with hyper-parameter tuning in keras-Tensorflow and PyTorch.
NFL_home_field_advantage_dyad
This repo houses my project where I study home field advantage in the NFL by pairing teams into dyads in a novel dataset I created from scratch. Included are scripts I created to scrape the web for game data (time, location, fan attendance and outcome), GPS coordinates of NFL stadiums, teams' ratings, and codes clean & merge data sets, pre-process features, train models, generate visualizations, and compile the report in LaTeX.
NLP_tvtropes
What is a dragon? When is a dragon *not* a dragon? Standard NLP tools are not sufficient to study a corpus like TVTropes, one full of subtext and sarcasm. Therefore, I train custom word embeddings using the TVTropes corpus.
PySpark_reddit
Used SparkSQL + SparkML + PySpark to train & tune an imbalanced classifier for on 8 millions rows of comment data from subreddit /r/pcmasterrace.
chav-ngvyen's Repositories
chav-ngvyen/clustering_D3
Light-hearted "scrollytelling" tutorial of DBSCAN and HDBSCAN (clustering algorithms) using D3.js. My first stab at web dev/ HTML/ SVG/ D3/ JavaScript https://www.chaungvyen.com/clustering_D3/
chav-ngvyen/chav-ngvyen.github.io
Repo for my portfolio page https://www.chaungvyen.com/
chav-ngvyen/colorblind
Colblindor within a notebook - Tutorial on how to setup and use the colorblind package inside a Jupyter Notebook to check data visualizations for colorblind accessibility. https://www.chaungvyen.com/colorblind/
chav-ngvyen/forest_fire_prediction
chav-ngvyen/hugging_face_course
chav-ngvyen/massive_data
Repo for Massive Data Fundamentals with SQL Specialization
chav-ngvyen/neural_nets_deep_learning
This repo houses my scripts for Neural Nets & Deep Learning (ANLY-590) where I code a universal Machine Learning workflow, gradient descent, loss functions, optimizer, linear and logistic regressions from scratch using numpy, then move onto building neural nets pipelines with hyper-parameter tuning in keras-Tensorflow and PyTorch.
chav-ngvyen/NFL_home_field_advantage_dyad
This repo houses my project where I study home field advantage in the NFL by pairing teams into dyads in a novel dataset I created from scratch. Included are scripts I created to scrape the web for game data (time, location, fan attendance and outcome), GPS coordinates of NFL stadiums, teams' ratings, and codes clean & merge data sets, pre-process features, train models, generate visualizations, and compile the report in LaTeX.
chav-ngvyen/NLP_tvtropes
What is a dragon? When is a dragon *not* a dragon? Standard NLP tools are not sufficient to study a corpus like TVTropes, one full of subtext and sarcasm. Therefore, I train custom word embeddings using the TVTropes corpus.
chav-ngvyen/PySpark_reddit
Used SparkSQL + SparkML + PySpark to train & tune an imbalanced classifier for on 8 millions rows of comment data from subreddit /r/pcmasterrace.
chav-ngvyen/PPOL563_data_viz
This repo will be used for my code and data for Data Visualization (PPOL 563)
chav-ngvyen/resume
Repo for my resume. Web accessible at https://www.chaungvyen.com/resume/
chav-ngvyen/unsupervised_machine_learning
This repo houses my codes for Advanced Modeling Techniques/ Unsupervised Machine Learning.
chav-ngvyen/Yelp_NLP