Pinned Repositories
amoro
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
spark
Apache Spark - A unified analytics engine for large-scale data processing
itachi
A library that brings useful functions from various modern database management systems to Apache Spark
multi-tenancy-spark
A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO TO https://github.com/NetEase/kyuubi INSTEAD)
spark-authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apache Kyuubi
spark-postgres
PostgreSQL and GreenPlum Data Source for Apache Spark
spark-ranger
已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.
tpcds-for-spark
yaooqinn's Repositories
yaooqinn/ranger
Mirror of Apache Ranger
yaooqinn/spark-docker
Official Dockerfile for Apache Spark
yaooqinn/aircompressor
A port of Snappy, LZO, LZ4, and Zstandard to Java
yaooqinn/cloudberry
Cloudberry Database - Open source alternative to Greenplum Database. Created by the original Greenplum developers.
yaooqinn/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
yaooqinn/duckdb
DuckDB is an analytical in-process SQL database management system
yaooqinn/gluten
yaooqinn/grammars-v4
Grammars written for ANTLR v4; expectation that the grammars are free of actions.
yaooqinn/hive
Apache Hive
yaooqinn/iceberg
Apache Iceberg
yaooqinn/incubator-kyuubi
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
yaooqinn/incubator-streampark
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
yaooqinn/kafka
Mirror of Apache Kafka
yaooqinn/libpg_query
C library for accessing the PostgreSQL parser outside of the server environment
yaooqinn/mongo
The MongoDB Database
yaooqinn/official-images
Primary source of truth for the Docker "Official Images" program
yaooqinn/orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
yaooqinn/parquet-format
Apache Parquet Format
yaooqinn/parquet-mr
Apache Parquet
yaooqinn/polaris
The interoperable, open source catalog for Apache Iceberg
yaooqinn/postgres
Mirror of the official PostgreSQL GIT repository. Note that this is just a *mirror* - we don't work with pull requests on github. To contribute, please see https://wiki.postgresql.org/wiki/Submitting_a_Patch
yaooqinn/spark
Apache Spark - A unified analytics engine for large-scale data processing
yaooqinn/spark-connect-go
Apache Spark Connect Client for Golang
yaooqinn/spark-kubernetes-operator
Apache Spark Kubernetes Operator
yaooqinn/spark-website
Apache Spark Website
yaooqinn/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
yaooqinn/unitycatalog
Open, Multi-modal Catalog for Data & AI
yaooqinn/yaooqinn
We get to decide what our story is.
yaooqinn/yaooqinn.github.io
yaooqinn/zstd-jni
JNI binding for Zstd