Pinned Repositories
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
spark
Apache Spark - A unified analytics engine for large-scale data processing
data-faker
Fake Data Generation in Scala
datafusion-comet
Apache DataFusion Comet Spark Accelerator
external-storage
External storage plugins, provisioners, and helper libraries
gluten
Gluten: Plugin to Double SparkSQL's Performance
goofys
a high-performance, POSIX-ish Amazon S3 file system written in Go
hadoop
Mirror of Apache Hadoop
hello-world
my first rpository
PengleiShi's Repositories
PengleiShi/data-faker
Fake Data Generation in Scala
PengleiShi/datafusion-comet
Apache DataFusion Comet Spark Accelerator
PengleiShi/external-storage
External storage plugins, provisioners, and helper libraries
PengleiShi/gluten
Gluten: Plugin to Double SparkSQL's Performance
PengleiShi/goofys
a high-performance, POSIX-ish Amazon S3 file system written in Go
PengleiShi/hadoop
Mirror of Apache Hadoop
PengleiShi/hello-world
my first rpository
PengleiShi/iceberg
Apache Iceberg
PengleiShi/kubernetes
Production-Grade Container Scheduling and Management
PengleiShi/orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
PengleiShi/PengleiShi.github.io
PengleiShi/spark
Apache Spark - A unified analytics engine for large-scale data processing