Pinned Repositories
100-pandas-puzzles
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
aadhaar-dataset-analysis
An analysis on Aadhaar dataset using Mapreduce and Spark
advanced-data-engineering-with-databricks
ApacheSpark_with_Scala
This is the repository that will host all the notebooks hosted for Apache Spark using Scala
azsynapselabsny
Analytics in a Day is Azure Synapse Focused Workshop
azure-synapse-analytics-workshop-400
Big-Data-Analysis-with-Scala-and-Spark
Course offered by EPFL via Coursera
bigdata-notebook
BigDataAnalyticswithSpark
Code for my videos on big data analytics with Apache Spark using Scala.
coding-interview-university
A complete computer science study plan to become a software engineer.
Babadook007's Repositories
Babadook007/ApacheSpark_with_Scala
This is the repository that will host all the notebooks hosted for Apache Spark using Scala
Babadook007/fintank
Architectural POC for real-time market data and portfolio order processing using Storm, Kafka, InfluxDB, Graphana, ooh and Python!
Babadook007/Flight-data-analysis
Developed an Oozie/Hadoop based workflow to process and analyze large volume (11 GB) of flight data
Babadook007/HadoopWithPython
Repository for Hadoop with Python including example source code
Babadook007/HBase-SparkStreaming
Simple Spark Streaming project which reads from HBase Table and writes to HBase Table
Babadook007/IMDBMovieBigData
A big data project to apply Hadoop map- reduce to derive some statistics from IMDB movie data.
Babadook007/kafka-spark-streaming
Project for reading data from kafka and writing to kafka and HBase with kerberos
Babadook007/kafka-sparkstreaming
Babadook007/learning-spark-examples
Examples for learning spark
Babadook007/PythonCrashCourse
A quick introduction to Python for Scientists and Engineers
Babadook007/SOLID-1
Demonstrating the SOLID design principles in Java
Babadook007/spark-datetime
functionstest
Babadook007/spark-kafka-avro
POC: Spark consumer for bottledwater-pg Kafka Avro topics
Babadook007/spark-kafka-streaming
Custom Spark Kafka consumer based on Kafka SimpleConsumer API.
Babadook007/spark-knowledgebase
Spark Knowledge Base
Babadook007/Spark-Streaming-DirectKafka-Examples
DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management
Babadook007/Spark-Streaming-Examples
Spark Streaming with Flume, Kafka, Kenesis, S[arkSQL, Socket, Custom Receiver, Handing Tweeter Data Read/Write, Machine Learning,
Babadook007/spark-streaming-twitter-kafka
Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag
Babadook007/spark-unit-testing
A tutorial on Apache Spark Unit Testing
Babadook007/SparkStreaming.Sessionization
NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase
Babadook007/SparkStreamingHBaseExample
Spark Streaming HBase Example