mcdull-zhang's Stars
jaywcjlove/awesome-mac
Now we have become very big, Different from the original idea. Collect premium software in various categories.
google/guava
Google core libraries for Java
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
greyireland/algorithm-pattern
算法模板,最科学的刷题方式,最快速的刷题路径,你值得拥有~
apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
facebookincubator/velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
apache/kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
zackelia/bclm
macOS command-line utility to limit max battery charge
janino-compiler/janino
Janino is a super-small, super-fast Java™ compiler.
kwai/blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
substrait-io/substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
apache/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
apache/celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
groupon/sparklint
A tool for monitoring and tuning Spark jobs for efficiency.
cubefs/compass
Compass is a task diagnosis platform for bigdata
Tencent/Firestorm
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers
bytedance/CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
housepower/spark-clickhouse-connector
Spark ClickHouse Connector build on DataSourceV2 API
apache/spark-website
Apache Spark Website
holdenk/spark-flowchart
Flowchart for debugging Spark applications
mcdull-zhang/gluten