photoszzt's Stars
snuspl/nimble
Lightweight and Parallel Deep Learning Framework
rtenlab/gcaps-super-repo
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
facebookresearch/HolisticTraceAnalysis
A library to analyze PyTorch traces.
pytorch/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
jungmair/rewiring-lkm
A Loadable Kernel Module for Efficient Rewiring
jungmair/START
Self-Tuning Adaptive Radix Tree
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
punica-ai/punica
Serving multiple LoRA finetuned LLM as one
efeslab/Nanoflow
A throughput-oriented high-performance serving framework for LLMs
apolukhin/pfr_non_boost
Boost.PFR without the boost namespaces
rttrorg/rttr
C++ Reflection Library
NVIDIA/nccl-tests
NCCL Tests
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
juicedata/juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
fluid-cloudnative/fluid
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
RWTH-ACS/cricket
cricket is a virtualization solution for GPUs
geohot/cuda_ioctl_sniffer
Sniff CUDA ioctls
thustorage/Sherman
Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory
bytedance/eurosys24-artifacts
Artifacts of EuroSys'24 paper "Exploring Performance and Cost Optimization with ASIC-Based CXL Memory"
coldfunction/qCUDA
qCUDA: GPGPU Virtualization at a New API Remoting Method with Para-virtualization
trailofbits/deepstate
A unit test-like interface for fuzzing and symbolic execution
NVIDIA/open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
pallas/ioucontext
A coöperative multitasking framework based on `liburing` and `libucontext`
microsoft/durabletask-netherite
A new engine for Durable Functions. https://microsoft.github.io/durabletask-netherite
jbleners/Falcon
Falcon: Fast and lethal component observation network
uclasystem/DRust
josh-hildred/Caerus
vertexclique/lever
Pillars for Transactional Systems and Data Grids
kanidm/concread
Concurrently Readable Data Structures for Rust