Pinned Repositories
danube_submit_jobs
danube_submit_job is an in-house python application that automate messages ingestion process after a major deployment
hadoopbook-demo
hadoopbook-demo collected a full-fledge project comedy_comparison_etl as well other hadoop applications
kafka-streams-examples
kafka-streams-example collects small projects that I worked on using Kafka streams. It was inspired by kafka-src streams-examples.
labs
This is a collection of tutorials for learning how to use Docker with various tools. Contributions welcome.
ml-spark-sklearn-tensor
Practice examples from "Hands-on Machine Learning with Scikit-Learn & TensorFlow" and port them to Spark
scala_practice2
This collects various Scala applications I wrote over time
spark-structured-streaming2
Collect various applications using spark structured streaming aggregation, window and join
spark2_emr
It collects spark2 projects deployed to AWS EMR . Automate the whole process and successfully finished ALS recommendation jobs on 26 million Movielens data in 15 minutes using limited AWS resoures
spark2_review
This collects various Spark projects collected through the years from my work and personal projects,
spark_tutorial_2
spark-tutorial_2 collects what I worked on using spark scala 2 Dataset/ DataFrame, including A/B analysis on Tivo Danube pipeline and Movielens recommendation system
threecuptea's Repositories
threecuptea/kafka-streams-examples
kafka-streams-example collects small projects that I worked on using Kafka streams. It was inspired by kafka-src streams-examples.
threecuptea/ml-spark-sklearn-tensor
Practice examples from "Hands-on Machine Learning with Scikit-Learn & TensorFlow" and port them to Spark
threecuptea/spark-structured-streaming2
Collect various applications using spark structured streaming aggregation, window and join
threecuptea/spark2_emr
It collects spark2 projects deployed to AWS EMR . Automate the whole process and successfully finished ALS recommendation jobs on 26 million Movielens data in 15 minutes using limited AWS resoures
threecuptea/spark_tutorial_2
spark-tutorial_2 collects what I worked on using spark scala 2 Dataset/ DataFrame, including A/B analysis on Tivo Danube pipeline and Movielens recommendation system
threecuptea/danube_submit_jobs
danube_submit_job is an in-house python application that automate messages ingestion process after a major deployment
threecuptea/hadoopbook-demo
hadoopbook-demo collected a full-fledge project comedy_comparison_etl as well other hadoop applications
threecuptea/labs
This is a collection of tutorials for learning how to use Docker with various tools. Contributions welcome.
threecuptea/scala_practice2
This collects various Scala applications I wrote over time
threecuptea/spark2_review
This collects various Spark projects collected through the years from my work and personal projects,
threecuptea/moby
Moby Project - a collaborative project for the container ecosystem to assemble container-based systems
threecuptea/python-docs-samples
Code samples used on cloud.google.com
threecuptea/python3
threecuptea/spark-learning
threecuptea/spark-python_16
spark-python_16 collects my re-work of labs from edx.org BerkeleyX course "CS120x: Distrubuted Machine Learning with Apache Spark"
threecuptea/spark_tutorial_16