italo-batista's Stars
pyenv/pyenv
Simple Python version management
rxhanson/Rectangle
Move and resize windows on macOS with keyboard shortcuts and snap areas
mlflow/mlflow
Open source platform for the machine learning lifecycle
kyleneideck/BackgroundMusic
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
apache/druid
Apache Druid: a high performance real-time analytics database.
msiemens/tinydb
TinyDB is a lightweight document oriented database optimized for your happiness :)
redpanda-data/console
Redpanda Console is a developer-friendly UI for managing your Kafka/Redpanda workloads. Console gives you a simple, interactive approach for gaining visibility into your topics, masking data, managing consumer groups, and exploring real-time data with time-travel debugging.
adilkhash/Data-Engineering-HowTo
A list of useful resources to learn Data Engineering from scratch
prometheus/jmx_exporter
A process for collecting metrics using JMX MBeans for Prometheus consumption
kmikiy/SpotMenu
Spotify and iTunes in your menu bar
google-deepmind/educational
dssg/hitchhikers-guide
The Hitchhiker's Guide to Data Science for Social Good
zalando/nakadi
A distributed event bus that implements a RESTful API abstraction on top of Kafka-like queues
shshankar1/ebooks
awslabs/data-on-eks
DoEKS is a tool to build, deploy and scale Data & ML Platforms on Amazon EKS
streaming-with-flink/examples-java
Stream Processing with Apache Flink - Java Examples
zhangjunhd/reading-notes
张俊的读书笔记
AbsaOSS/ABRiS
Avro SerDe for Apache Spark structured APIs.
aws-samples/amazon-kinesis-data-analytics-examples
Example applications in Java, Python and SQL for Kinesis Data Analytics, demonstrating sources, sinks, and operators.
awslabs/aws-glue-schema-registry
AWS Glue Schema Registry Client library provides serializers / de-serializers for applications to integrate with AWS Glue Schema Registry Service. The library currently supports Avro, JSON and Protobuf data formats. See https://docs.aws.amazon.com/glue/latest/dg/schema-registry.html to get started.
natbusa/data-engineering
How to build an awesome data engineering team
FIWARE/data-models
:capital_abcd: Code and specifications to support harmonized data models
aws-solutions-library-samples/real-time-analytics-spark-streaming
A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.
rogeriomm/labtools-k8s
Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,Airflow, Kafka Strimzi, Datahub, OpenMetadata,Zeppelin, Jupyter, JFrog Container Registry
jgrier/FilteringExample
Flink stream filtering examples
publicissapient-france/spark-structured-streaming-blog
maziyarpanahi/spark2-template
Intellij template to develop Apache Spark 2.x applications
FightPandemics/DataModel
FightPandemics data model documentation and scripts
aws-samples/transactional-data-lake-debezium-cdc
avishekrk/solve_eda
EDA materials for Solve for Good Webinar