Pinned Repositories
applying-gradient-descent-data-science-intro-000
applying-gradient-descent-lab-data-science-intro-000
applying-nearest-neighbors-data-science-intro-000
Box_Office_Success
Predict movie profitability given movie budget, genres, facebook likes and many more features using logistic regression, an assortment of trees and svm.
College_Scorecard
Linear regression on college student completion rate from Department of Education's College Scorecard.
Dress-Image-Recognition
Pattern Classification of Dresses
Mass-Shootings-Time-Series
Time series forecasting with SARIMA, VAR, Fast Fourier Transform, Exponential Smoothing, Prophet and LSTM Network on US gun violence incidents that result in multiple casualties.
TripAdvisor_Recommender
Single Vector Decomposition recommender system with Surprise library
Twitter-Data-Visualization
Visual exploration of Democratic presidential candidate tweets' metadata.
Udemine-Scraper
It scrapes course description and reviews from Udemy.
thomsu's Repositories
thomsu/Udemine-Scraper
It scrapes course description and reviews from Udemy.
thomsu/Box_Office_Success
Predict movie profitability given movie budget, genres, facebook likes and many more features using logistic regression, an assortment of trees and svm.
thomsu/Dress-Image-Recognition
Pattern Classification of Dresses
thomsu/Mass-Shootings-Time-Series
Time series forecasting with SARIMA, VAR, Fast Fourier Transform, Exponential Smoothing, Prophet and LSTM Network on US gun violence incidents that result in multiple casualties.
thomsu/TripAdvisor_Recommender
Single Vector Decomposition recommender system with Surprise library
thomsu/Twitter-Data-Visualization
Visual exploration of Democratic presidential candidate tweets' metadata.
thomsu/Bellman_Ford_Negative_Cycle_Detection
thomsu/Bike-Sharing-Analysis
Analyze one year of bike sharing data to uncover insights on casual (non-member) riders in order to formulate marketing strategies aimed at converting casual riders into annual members.
thomsu/City-Employee-Payroll
Cross Analysis of the Payroll in America's 2 Largest Cities, New York City and Los Angeles with PySpark
thomsu/Dijkstra_Shortest_Path
thomsu/dsc-0-05-14-grouping-data-lab-nyc-ds-career-012819
thomsu/dsc-04-40-05-deeper-neural-networks-lab-nyc-ds-career-012819
thomsu/dsc-04-43-03-convolutional-neural-networks-code-along-nyc-ds-career-012819
thomsu/dsc-2-19-15-confidence-intervals-lab-nyc-ds-career-012819
thomsu/dsc-2-20-19-anova-nyc-ds-career-012819
thomsu/dsc-3-29-07-visualizing-confusion-matrices-lab-nyc-ds-career-012819
thomsu/dsc-4-39-03-collaborative-filtering-singular-value-decomposition-nyc-ds-career-012819
thomsu/dsc-database-admin-101-lab
thomsu/dsc-more-practice-with-sql-queries-lab
thomsu/dsc-sql-interview-questions-lab
thomsu/Ebay-Car-Ads
thomsu/Graph-Feature-Generator
thomsu/IT-Salary
Regression Model on IT Professional Salary in Europe
thomsu/Jeopardy-Challenge
thomsu/Missing-Values-Experiment
A fun mini experiment to test the predictions of tree ensembles without missing value replacement against the prediction from logistic regression using median and mode imputations.
thomsu/NQueens
Variation of the original n queens problem where the position of the first queen is fixed and the function has to place the remaining queens such that no other queens should be in the same row, column and diagonal axes. Instead of just a standard chessboard, the board size used for this simulation can be 5x5, 6x6, ... up to 10x10.
thomsu/nyc-mhtn-ds-012819-lectures
Lecture repo for 012819 cohort
thomsu/Online_Retail_w_PySpark
Using PySpark to perform EDA and Customer Segmentation.
thomsu/SQuAD-Question-Answering
Predict answer to question given a context text where the answer may be found with Stanford question answering dataset, SQuAD.
thomsu/Style-Transfer-PyTorch
Generate images to match the image style of another.