Pinned Repositories
amazon-redshift-query-patterns-and-optimizations
In this workshop you will launch an Amazon Redshift cluster in your AWS account and load sample data ~ 100GB using TPCH dataset. You will learn query patterns that affects Redshift performance and how to optimize them. In this lab we will also provide a framework to simulate workload management (WLM) queue and run concurrent queries in regular interval and measure performance metrics- query throughput, query duration etc. We will also provide some use cases for Redshift spectrum to query data from s3 in columnar format such as Parquet.
AssociationRule
Unsupervised learning
Clustering
CreditCardPaymentDefault
Predict customer credit card defaults for next month. Keywords: R, logic regression, decision tree
CS61B
Notes for UC Berkeley's Fall 2021 section of CS 61B
datasharing
The Leek group guide to data sharing
eCommerceProductClassifiaction
Classify products into the right categories. Keywords: python, ski-learn, neural network
PredictForestFireBurnArea
Predict the burned area of forest fire in Portugal. Keywords: R, forward, ridge, and lasso regression
R-Programming-Assignment-2
RenterChoices
simulate how landlords might evaluate renters
tanghanrui's Repositories
tanghanrui/amazon-redshift-query-patterns-and-optimizations
In this workshop you will launch an Amazon Redshift cluster in your AWS account and load sample data ~ 100GB using TPCH dataset. You will learn query patterns that affects Redshift performance and how to optimize them. In this lab we will also provide a framework to simulate workload management (WLM) queue and run concurrent queries in regular interval and measure performance metrics- query throughput, query duration etc. We will also provide some use cases for Redshift spectrum to query data from s3 in columnar format such as Parquet.
tanghanrui/AssociationRule
Unsupervised learning
tanghanrui/Clustering
tanghanrui/CreditCardPaymentDefault
Predict customer credit card defaults for next month. Keywords: R, logic regression, decision tree
tanghanrui/CS61B
Notes for UC Berkeley's Fall 2021 section of CS 61B
tanghanrui/datasharing
The Leek group guide to data sharing
tanghanrui/eCommerceProductClassifiaction
Classify products into the right categories. Keywords: python, ski-learn, neural network
tanghanrui/PredictForestFireBurnArea
Predict the burned area of forest fire in Portugal. Keywords: R, forward, ridge, and lasso regression
tanghanrui/R-Programming-Assignment-2
tanghanrui/RenterChoices
simulate how landlords might evaluate renters
tanghanrui/SalesProAnalysis
PowerBI Virtualization
tanghanrui/SlotMachine
A small program to allow you play a slot machine
tanghanrui/TMDBoxOfficeRev
Predict international box office revenue
tanghanrui/Wordfrequency
My first repo