dcoliversun's Stars
github/gitignore
A collection of useful .gitignore templates
xai-org/grok-1
Grok open release
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
shimohq/chinese-programmer-wrong-pronunciation
**程序员容易发音错误的单词
akka/akka
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
async-profiler/async-profiler
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
tjy-gitnub/win12
Windows 12 网页版,在线体验 点击下面的链接在线体验
XuehaiPan/nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
ByConity/ByConity
ByConity is an open source cloud data warehouse
apecloud/kubeblocks
KubeBlocks is an open-source control plane software that runs and manages databases, message queues and other stateful applications on K8s.
apache/arrow-ballista
Apache Arrow Ballista Distributed Query Engine
kwai/blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
apache/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
microsoft/DiskANN
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
apache/celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
pyspark-ai/pyspark-ai
English SDK for Apache Spark
PacktPublishing/Java-Coding-Problems
Java Coding Problems, published by Packt
EnricoMi/publish-unit-test-result-action
GitHub Action to publish unit test results on GitHub
LucaCanali/Miscellaneous
Includes notes on using Apache Spark in general, notes on using Spark for Physics, how to run TPCDS on PySpark, how to create histograms with Spark, tools for performance testing CPUs, Jupyter notebooks examples for Spark, examples for Oracle and other DB systems.
xinrong-meng/knowledge-sharing
Hub for curated insights and resources on software systems and technologies
apache/arrow-datafusion-comet
Apache Arrow DataFusion Comet Spark Accelerator
oap-project/gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
neoremind/kraps-rpc
A RPC framework leveraging Spark RPC module
apple/batch-processing-gateway
The gateway component to make Spark on K8s much easier for Spark users.
holdenk/spark-flowchart
Flowchart for debugging Spark applications
sundy-li/strawboat
A native storage format for apache arrow
apache/spark-kubernetes-operator
Apache Spark Kubernetes Operator
rockthejvm/spark-performance-tuning
The official repository for the Rock the JVM Spark Optimization 2 course