This is the code repository for Data Stream Development with Apache Spark, Kafka, and Spring Boot [Video], published by Packt. It contains all the supporting project files necessary to work through the video course from start to finish.
For video and source code matching please check the wiki page
Learn to build a data stream pipeline using Apache Spark and Kafka from scratch. Start with a blueprint architecture: developing a completely functional data streaming pipeline. With live coding sessions, get hands-on with architecting every tier of the pipeline.
- Attain a solid foundation in the most powerful and versatile technologies involved in data streaming: Apache Spark and Apache Kafka
- Form a robust and clean architecture for a data streaming pipeline
- Implement the correct tools to bring your data streaming architecture to life
- Isolate the most problematic tradeoff for each tier involved in a data streaming pipeline
- Query, analyze, and apply machine learning algorithms to collected data
- Display analyzed pipeline data via Google Maps on your web browser
- Discover and resolve difficulties in scaling and securing data streaming applications
To fully benefit from the coverage included in this course, you will need:
No prior knowledge required
This course has the following software requirements:
- Java 8
- OpenSSL installed (optional)
- Java compatible IDE (e.g., Visual Studio Code, NetBeans, etc)
For video and source code matching please check the wiki page