Pinned Repositories
akka-cqrs-es-demo
Demo project to implement the CQRS and Event Sourcing patterns in Scala-Akka
akka-typed-distributed-state-blog
Companion repo for Lightbend blog post - How To Distribute Application State with Akka Cluster
clickstream-tutorial
Code for Tutorial on designing clickstream analytics application using Hadoop
data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
data-generator
User web sessions data generator written in Python, for Kafka, Kinesis or local file system sinks
fraud-detection-tutorial
hadoop-arch-book
Code repository for O'Reilly Hadoop Application Architectures book
spark-playground
Code snippets used in demos recorded for the blog.
spark-scala-playground
Sample processing code using Spark 2.1+ and Scala
mrenau's Repositories
mrenau/akka-typed-distributed-state-blog
Companion repo for Lightbend blog post - How To Distribute Application State with Akka Cluster
mrenau/data-generator
User web sessions data generator written in Python, for Kafka, Kinesis or local file system sinks
mrenau/reactive-eplf-course
mrenau/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
mrenau/amundsendatabuilder
Data ingestion library for Amundsen to build graph and search index
mrenau/awesome-compose
Awesome Docker Compose samples
mrenau/cdc_deltaLake
Docker compose and Google Colab demo to build a CDC with Delta Lake
mrenau/Databricks-academy
mrenau/dataframe-rules-engine
Extensible Rules Engine for custom Dataframe / Dataset validation
mrenau/datahack_docker
mrenau/dbt-data-ai-summit
Code that was used as an example during the Data+AI Summit 2020
mrenau/debezium-examples
Examples for running Debezium (Configuration, Docker Compose files etc.)
mrenau/dedupe_spark_sample
mrenau/demo-mlops
mrenau/demo-scene
Scripts and samples to support Confluent Platform talks. May be rough around the edges. For automated tutorials and QA'd code, see https://github.com/confluentinc/examples/
mrenau/docker-images
Docker images for Trino integration testing
mrenau/egeria
Open Metadata and Governance
mrenau/lunatech-scala-2-to-scala3-course
Lunatech course - "Moving forward from Scala 2 to Scala 3"
mrenau/modern-data-stack
This repo helps bootstrap the infrastructures with a modern data stack on Google Cloud Platform using Terraform.
mrenau/neosemantics-python-examples
examples of use of the neosemantics plugin
mrenau/open-data-fabric
Open protocol for decentralized exchange and transformation of data
mrenau/rancher
Complete container management platform
mrenau/snippets
mrenau/spark-dockerfile-multi-stage
Dockerfile for Spark applications using docker layers
mrenau/spark-essentials
The official repository for the Rock the JVM Spark Essentials with Scala course
mrenau/spark-optimization
The official repository for the Rock the JVM Spark Optimization with Scala course
mrenau/spark-performance-tuning
The official repository for the Rock the JVM Spark Optimization 2 course
mrenau/trino-minio-docker
Minimal example to run Trino, Minio, and Hive standalone metastore on docker
mrenau/udemy-spark-streaming
For Udemy students: the official repository of Rock the JVM's Spark Streaming course
mrenau/voluble
Intelligent data generator for Apache Kafka. Generates streams of realistic data with support for cross-topic relationships, tombstoning, configurable rates, and more.