Pinned Repositories
akka-cqrs-es-demo
Demo project to implement the CQRS and Event Sourcing patterns in Scala-Akka
akka-typed-distributed-state-blog
Companion repo for Lightbend blog post - How To Distribute Application State with Akka Cluster
clickstream-tutorial
Code for Tutorial on designing clickstream analytics application using Hadoop
data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
data-generator
User web sessions data generator written in Python, for Kafka, Kinesis or local file system sinks
fraud-detection-tutorial
hadoop-arch-book
Code repository for O'Reilly Hadoop Application Architectures book
spark-playground
Code snippets used in demos recorded for the blog.
spark-scala-playground
Sample processing code using Spark 2.1+ and Scala
mrenau's Repositories
mrenau/data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
mrenau/spark-playground
Code snippets used in demos recorded for the blog.
mrenau/a-kafka-story
Kafka ecosystem ... but step by step!
mrenau/acid-file-formats
Code for Apache Hudi, Apache Iceberg and Delta Lake analysis
mrenau/akka-cassandra-demo
The repository for the demonstration of Akka & Cassandra integration
mrenau/awesome-data-engineering
A curated list of data engineering tools for software developers
mrenau/bigdata_stack
Dockerized Hadoop/Minio/Hive/Presto stack
mrenau/code
Example application code for the python architecture book
mrenau/CursoIntroPython
Curso de introducción a la programación con python para Launch X de Innovacción Virtual
mrenau/data-product-analytics
mrenau/data-product-batch
mrenau/data-product-streaming
data-product-streaming
mrenau/datamesh
Material for the DataMesh presentation at GoDataFest 2021
mrenau/docker-spark-iceberg_fork
mrenau/efficient_data_processing_spark_fork
Code for "Efficient Data Processing in Spark" Course
mrenau/etl-with-airflow
ETL best practices with airflow, with examples
mrenau/examples
mrenau/first-rust-project
mrenau/incubator-pekko-samples_fork
Apache Pekko Sample Projects
mrenau/kafka-playground
mrenau/kubeflow-spark
Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.
mrenau/machine-learning-engineering-for-production-public
Public repo for DeepLearning.AI MLEP Specialization
mrenau/ml-deployment
Repo for post
mrenau/nessie-demos
Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.
mrenau/OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
mrenau/presto-workload-analyzer
The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them
mrenau/spark-daria
Essential Spark extensions and helper methods ✨😲
mrenau/talos
Lawful circuit breakers for Scala. Akka and monix circuit breaker implementations with monitoring.
mrenau/the_data_must_flow
mrenau/trino-getting-started