Pinned Repositories
advanced-transformation
This is katas for week 2 comprising of:
basic-aws-infrastructure
Golden Copy to help you set up a basic AWS environment with an EMR cluster behind a VPC.
basic-transformations
This repository has been built to help people learn how to do basic transformations on a single DataFrame in Spark + Scala.
data-transformations
Started code base for Spark + Scala project.
infra-twdu2b
Basic infrastructure for TWDU 2 Team B
join-transformations
This repository will walk you through several katas for learning how to do joins with Spark+Scala.
semi-structured-data-transformations
This repository has been built to help people learn how to work with semi-structured data sources with Spark+Scala.
transformations
Katas for transforming data with Spark + Scala.
twdu2b
Streaming Data Pipeline for twdu2b
chandnipatelTW's Repositories
chandnipatelTW/twdu2b
Streaming Data Pipeline for twdu2b
chandnipatelTW/basic-aws-infrastructure
Golden Copy to help you set up a basic AWS environment with an EMR cluster behind a VPC.
chandnipatelTW/data-transformations
Started code base for Spark + Scala project.
chandnipatelTW/infra-twdu2b
Basic infrastructure for TWDU 2 Team B
chandnipatelTW/join-transformations
This repository will walk you through several katas for learning how to do joins with Spark+Scala.
chandnipatelTW/advanced-transformation
This is katas for week 2 comprising of:
chandnipatelTW/basic-transformations
This repository has been built to help people learn how to do basic transformations on a single DataFrame in Spark + Scala.
chandnipatelTW/infra-twdu2a
Infrastructure set up for TWDU2 group A
chandnipatelTW/semi-structured-data-transformations
This repository has been built to help people learn how to work with semi-structured data sources with Spark+Scala.
chandnipatelTW/transformations
Katas for transforming data with Spark + Scala.
chandnipatelTW/transformations-with-pyspark
Repository with transformations with pyspark.
chandnipatelTW/tw-twdu-2a
Streaming Data Pipeline Code from TwoWheelers for TWDU-2a
chandnipatelTW/Airflow-Example-DAGS
DAGS used to automate citibike and wordcount apps
chandnipatelTW/airflow_examples
Repo of airflow examples people have made to automate the word count or citibike examples locally and in AWS.
chandnipatelTW/ci-workshop-app
chandnipatelTW/crime-data-transformations
A repository that analyzes crime data using Spark + Scala
chandnipatelTW/docker-curriculum
:dolphin: A comprehensive tutorial on getting started with Docker!
chandnipatelTW/docker-spark-demo
chandnipatelTW/essential-data-developer
Course outline, learning resources and some code for "essential data developer" course
chandnipatelTW/infra-gcube
Infrastructure repo for GCube
chandnipatelTW/kube-airflow
A docker image and kubernetes config files to run Airflow on Kubernetes
chandnipatelTW/ML-workshop
chandnipatelTW/python-getting-started
Getting Started with Python on Heroku.
chandnipatelTW/reddit-fraction-227
chandnipatelTW/ReduxSimpleStarter
Starter pack for an awesome Udemy course
chandnipatelTW/specifying-schema
Show different ways to specify schema with Spark + Scala, and potential issues that can occur.
chandnipatelTW/spring-security-saml
SAML extension for the Spring Security project
chandnipatelTW/streaming-data-pipeline
Streaming pipeline repo for data engineering training program
chandnipatelTW/streaming-pipeline
Golden copy of the streaming data pipeline used for the TwoWheelers client simulation session in the data engineering training program.
chandnipatelTW/wonderland-fsharp-katas
self-contained F# script, and a companion instructions file for katas