sanjyotb
Sr. Data Engineer (EPFL's Certified Scala Developer) having 7.5+ yrs of experience in Scala, Java, Akka with Big Data technologies like Spark, Kafka, AWS.
Thoughtworks Inc.Pune
Pinned Repositories
basic-aws-infrastructure
Repo to help you set up a basic AWS environment with an EMR cluster behind a VPC.
basic-transformations
This repository has been built to help people learn how to do basic transformations on a single DataFrame in Spark + Scala.
bigdata-fun
A complete (distributed) BigData stack, running in containers
clean-code-workshop
crime-data-transformations
A repository that analyzes crime data using Spark + Scala
data-eng-bootcamp
data-transformations
Started code base for Spark + Scala project.
docker-hadoop-secure
Secure Hadoop docker image
github-slideshow
A robot powered training repository :robot:
hello-world
sanjyotb's Repositories
sanjyotb/basic-aws-infrastructure
Repo to help you set up a basic AWS environment with an EMR cluster behind a VPC.
sanjyotb/basic-transformations
This repository has been built to help people learn how to do basic transformations on a single DataFrame in Spark + Scala.
sanjyotb/bigdata-fun
A complete (distributed) BigData stack, running in containers
sanjyotb/clean-code-workshop
sanjyotb/crime-data-transformations
A repository that analyzes crime data using Spark + Scala
sanjyotb/data-eng-bootcamp
sanjyotb/data-transformations
Started code base for Spark + Scala project.
sanjyotb/docker-hadoop-secure
Secure Hadoop docker image
sanjyotb/github-slideshow
A robot powered training repository :robot:
sanjyotb/hello-world
sanjyotb/helloworld
Just a sample repository
sanjyotb/join-transformations
This repository will walk you through several katas for learning how to do joins with Spark+Scala.
sanjyotb/kafka
A distributed publish/subscribe messaging service
sanjyotb/kafka-examples
Apache kafka examples
sanjyotb/kafka-streams-examples
Demo applications and code examples for Apache Kafka's Streams API.
sanjyotb/my-repo
sanjyotb/patchwork
All the Git-it Workshop completers!
sanjyotb/scala-spark-tutorial
Project for James' Apache Spark with Scala course
sanjyotb/semi-structured-data-transformations
This repository has been built to help people learn how to work with semi-structured data sources with Spark+Scala.
sanjyotb/SparkStreamingExample
Word count example to demonstrate Spark Streaming
sanjyotb/training-kit
Open source on demand courses and cheat sheets for Git and GitHub