DamonZhao-sfu

DamonZhao-sfu's Stars

amazon-science/redset
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
401
manuzhang/awesome-streaming
a curated list of awesome streaming frameworks, applications, etc
2.7k297
chdb-io/chdb
chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse
Language:C++2.1k74
wagjamin/inkfuse
InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.
Language:C++422
duckdblabs/db-benchmark
reproducible benchmark of database-like ops
Language:R14830
bheisler/iai
Experimental one-shot benchmarking/profiling harness for Rust
Language:Rust58223
hyrise/tpch_paper
Online Resources for the Paper 'Quantifying TPC-H Choke Points and Their Optimizations'
83
smola/spark-glusterfs-example
An example of Apache Spark integration with GlusterFS.
Language:Scala4
uber-common/jvm-profiler
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter
Language:Java1.8k342
pgvector/pgvector
Open-source vector similarity search for Postgres
Language:C12.4k573
intel/BDTK
A modular acceleration toolkit for big data analytic engines
Language:C++6725
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Language:Rust30.1k1.9k
gregrahn/join-order-benchmark
Join Order Benchmark (JOB)
29185
HigherOrderCO/Bend
A massively parallel, high-level programming language
Language:Rust17.4k429
apache/datafusion-benchmarks
Apache DataFusion Benchmarks
Language:Python73
hpides/autovec-db
Code for our paper "Evaluating SIMD Compiler-Intrinsics for Database Systems"
Language:C++143
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
Language:C++23.9k1.9k
bigstepinc/SparkBench
Terasort-like benchmark for spark 2.x that uses dataframes, saves files in parquet etc for a more realistic testing.
Language:Scala52
abstools/timsort-benchmark
Java TimSort Benchmarking
Language:Java1
intel/PerTaskMemBWMonitoring
Language:Python107
intel/pcm
Intel® Performance Counter Monitor (Intel® PCM)
Language:C++2.8k475
chipsalliance/chisel
Chisel: A Modern Hardware Design Language
Language:Scala4k595
open-mpi/hwloc
Hardware locality (hwloc)
Language:C575173
oneapi-src/unified-memory-framework
A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management. UMF allows users to manage multiple memory pools characterized by different attributes, allowing certain allocation types to be isolated from others and allocated using different hardware resources as required.
Language:C3628
apache/datafusion-comet
Apache DataFusion Comet Spark Accelerator
Language:Rust809158
apache/datafusion
Apache DataFusion SQL Query Engine
Language:Rust6.2k1.2k
chukonu-team/polars
Fast multi-threaded, hybrid-out-of-core query engine focussing on DataFrame front-ends
1
Azure-Samples/azure-sparkcruise-samples
Docs for Azure HDInsight
Language:Jupyter Notebook43
LucaCanali/sparkMeasure
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.
Language:Scala705145
tomsisso/spark-profiling-plugin
Spark plugin implementation for profiling a Spark app with context
Language:Java8