ZhangZhiPku's Stars
efeslab/Nanoflow
A throughput-oriented high-performance serving framework for LLMs
reed-lau/cute-gemm
HazyResearch/flash-fft-conv
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
vim/vim
The official Vim repository
daquexian/faster-rwkv
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
OpenPPL/ppl.llm.kernel.cuda
OpenPPL/ppl.nn.llm
OpenPPL/ppl.llm.serving
OpenPPL/ppl.pmx
jundaf2/CUDA-INT8-GEMM
CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
open-mmlab/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
triple-Mu/YOLOv8-TensorRT
YOLOv8 using TensorRT accelerate !
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
onnx/optimizer
Actively maintained ONNX Optimizer
inisis/brocolli
Everything in Torch Fx
higham/what-is
Important concepts in numerical linear algebra and related areas
meituan/YOLOv6
YOLOv6: a single-stage object detection framework dedicated to industrial applications.
taichi-dev/taichi
Productive, portable, and performant GPU programming in Python.
OpenPPL/CuAssembler
An unofficial cuda assembler, for all generations of SASS, hopefully :)
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
open-mmlab/mmyolo
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
CompVis/stable-diffusion
A latent text-to-image diffusion model
megvii-research/Sparsebit
A model compression and acceleration toolbox based on pytorch.
carbon-language/carbon-lang
Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)