zhouyuan

database, storage, big data analytics, LLM, views on my own

@apache

Pinned Repositories

Spark-PMoF
Spark Shuffle Optimization with RDMA+AEP
Language:C++30 11 2322
gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Language:Scala257 17 55075
bitcask
Simple KV based on bitcask
Language:C++2 3 01
ceph
Ceph is a distributed object, block, and file storage platform
Language:C++2 6 02
cosbench-kits
Language:Groff2 2 00
fio
Flexible I/O Tester
Language:C1 2 00
HDCS
Hyper-converged Distributed Cache Store
Language:CSS1 3 016
native-sql-engine
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Language:Scala1 1 00
tongjithesis
TongjiThesis is the abbreviation of Tongji University(P.R.C) Thesis LaTeX Template. This macro package aimed at creating a simple-to-use LaTeX dissertation template, including undergraduate thesis, master's thesis, doctoral dissertation.
7 2 14

zhouyuan's Repositories

zhouyuan/native-sql-engine
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Language:Scala1 1 00
zhouyuan/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:Python1 0
zhouyuan/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
Language:C++1 0
zhouyuan/arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
Language:Rust1 0
zhouyuan/arrow-rs
Official Rust implementation of Apache Arrow
Language:Rust1 0
zhouyuan/client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
zhouyuan/dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.
Language:C++0 0
zhouyuan/extension-script
Example repository for custom C++/CUDA operators for TorchScript
Language:Python1 0
zhouyuan/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda1 0
zhouyuan/gluten
Language:Scala1 21
zhouyuan/gluten-it
Intergration testing for Gluten
Language:Scala1 0
zhouyuan/gluten-te
Portable test envrionment of Gluten
Language:Shell1 0
zhouyuan/Gluten-Trino
Gluten: Plugin to Boost Trino's Performance
zhouyuan/libgsasl
https://www.gnu.org/software/gsasl/
Language:C1 0
zhouyuan/libhdfs3
HDFS file read access for ClickHouse
Language:C++1 0
zhouyuan/llama2.c
Inference Llama 2 in one file of pure C
Language:Python1 0
zhouyuan/llm-continuous-batching-benchmarks
Language:Python1 0
zhouyuan/LMCache
Prefill LLMs only once, re-use KV across instances
zhouyuan/Nanoflow
A throughput-oriented high-performance serving framework for LLMs
Language:Cuda0 0
zhouyuan/protobuf
Protocol Buffers - Google's data interchange format
zhouyuan/PyGithub
Typed interactions with the GitHub API v3
Language:Python0 0
zhouyuan/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:C++1 0
zhouyuan/s3select
library for processing s3select queries and execute them on CSV files (current phase)
zhouyuan/spark
Apache Spark - A unified analytics engine for large-scale data processing
Language:Scala1 0
zhouyuan/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
zhouyuan/triton
Development repository for the Triton language and compiler
Language:C++1 0
zhouyuan/velox
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
Language:C++1 01
zhouyuan/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python1 0
zhouyuan/x86-simd-sort
C++ header file library for high performance SIMD based sorting algorithms for primitive datatypes
Language:C++1 0
zhouyuan/zhouyuan.github.io
zhouyuan.github.io
Language:JavaScript