Pinned Repositories
flink
Apache Flink
kafka
Mirror of Apache Kafka
druid
Apache Druid: a high performance real-time analytics database.
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
spark
Apache Spark - A unified analytics engine for large-scale data processing
hudi
Upserts, Deletes And Incremental Processing on Big Data.
babyfish
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
akka
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
akka-http
The Streaming-first HTTP server/module of Akka
ArvinZheng's Repositories
ArvinZheng/druid
Apache Druid (Incubating) - Column oriented distributed data store ideal for powering interactive applications
ArvinZheng/spring-framework
The Spring Framework
ArvinZheng/parquet-java
Apache Parquet Java
ArvinZheng/caffeine
A high performance caching library for Java
ArvinZheng/flink
Apache Flink
ArvinZheng/spark
Apache Spark - A unified analytics engine for large-scale data processing
ArvinZheng/JCTools
ArvinZheng/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
ArvinZheng/presto
Official home of the community managed version of Presto, the distributed SQL query engine for big data, under the auspices of the Presto Software Foundation.
ArvinZheng/prometheus
The Prometheus monitoring system and time series database.
ArvinZheng/God-Of-BigData
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
ArvinZheng/opentelemetry-collector-contrib
Contrib repository for the OpenTelemetry Collector
ArvinZheng/kafka
Mirror of Apache Kafka
ArvinZheng/opentelemetry-proto
Protobuf definitions for the OpenTelemetry protocol (OTLP)
ArvinZheng/opentelemetry-collector
OpenTelemetry Collector
ArvinZheng/opentelemetry-js
OpenTelemetry JavaScript Client
ArvinZheng/starrocks
StarRocks is a next-gen sub-second MPP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.
ArvinZheng/interactive_latencies
Jeff Dean's latency numbers plotted over time
ArvinZheng/airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
ArvinZheng/bigdata-file-viewer
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
ArvinZheng/alpakka-kafka
Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.
ArvinZheng/akka
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
ArvinZheng/akka-http
The Streaming-first HTTP server/module of Akka
ArvinZheng/hudi
Upserts, Deletes And Incremental Processing on Big Data.
ArvinZheng/codestyle
Code style for Airlift projects
ArvinZheng/guava
Google core libraries for Java
ArvinZheng/hive
Apache Hive
ArvinZheng/flink-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
ArvinZheng/flinkk8soperator
Kubernetes operator that provides control plane for managing Apache Flink applications
ArvinZheng/mysql-binlog-connector-java
MySQL Binary Log connector