xiaoyuyao
Apache Hadoop PMC and Committer. Working on Open Source Software: Hadoop/HDFS/Ozone.
Cloudera Inc.
Pinned Repositories
hadoop
Apache Hadoop
ozone
Scalable, redundant, and distributed object store for Apache Hadoop
ratis
Open source Java implementation for Raft consensus protocol.
ratis-thirdparty
Third-party dependencies for Apache Ratis
caochong
Set up a Hadoop and/or Spark cluster running within Docker containers on a single physical machine
flink
Apache Flink
grpc-java
The Java gRPC implementation. HTTP/2 based RPC
hadoop-ozone
Apache Hadoop Ozone
ozone-0.4
rook
Storage Orchestration for Kubernetes
xiaoyuyao's Repositories
xiaoyuyao/flink
Apache Flink
xiaoyuyao/grpc-java
The Java gRPC implementation. HTTP/2 based RPC
xiaoyuyao/hadoop-ozone
Apache Hadoop Ozone
xiaoyuyao/ozone-0.4
xiaoyuyao/rook
Storage Orchestration for Kubernetes
xiaoyuyao/kubernetes
Production-Grade Container Scheduling and Management
xiaoyuyao/ratis-thirdparty-test
Apache Ratis Thirdparty test
xiaoyuyao/connectors
Connectors for Delta Lake
xiaoyuyao/curve
Curve is a high-performance, lightweight-operation, cloud-native open source distributed storage system. Curve can be applied to: 1) mainstream cloud-native infrastructure platforms OpenStack and Kubernetes; 2) high-performance storage for cloud-native databases; 3) cloud storage middleware using S3-compatible object storage as a data storage engine, providing cost-effective shared file storage.
xiaoyuyao/databricks-copy-into
xiaoyuyao/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
xiaoyuyao/disruptor
High Performance Inter-Thread Messaging Library
xiaoyuyao/docker
xiaoyuyao/docker-hadoop
docker compose for various setups
xiaoyuyao/geohash-java
Implementation of GeoHashes in java. We try to be/stay compliant to the spec, as far as possible.
xiaoyuyao/gravitino
A high-performance, geo-distributed and federated metadata lake
xiaoyuyao/hadoop
Mirror of Apache Hadoop
xiaoyuyao/hadoop-3.2
xiaoyuyao/hbase
Apache HBase
xiaoyuyao/incubator-iceberg
Apache Iceberg (Incubating)
xiaoyuyao/incubator-ratis
Mirror of Apache Ratis (Incubating)
xiaoyuyao/incubator-ratis-thirdparty
Apache Ratis Thirdparty
xiaoyuyao/OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
xiaoyuyao/ozoneconf
Ozoneconf
xiaoyuyao/ozonedemo
Demos of programing Ozone Java RPC
xiaoyuyao/paimon
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
xiaoyuyao/pipeline
PipelineIO: End-to-End ML and AI Platform for Real-time Spark and Tensorflow Data Pipelines
xiaoyuyao/spark-sql-perf
xiaoyuyao/vectordb-recipes
High quality resources & applications for LLMs, multi-modal models and VectorDBs
xiaoyuyao/velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.