zyang37

University of MichiganAnn Arbor, MI

zyang37's Stars

haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.8k 156 1.5k2.2k
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
Language:Python17.9k 140 7671.4k
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.1k 64 259944
LukeMathWalker/zero-to-production
Code for "Zero To Production In Rust", a book on API development using Rust.
Language:Rust5.8k 77 218496
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python5.6k 56 568442
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.5k 107 1.1k939
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
Language:Python3.5k 35 20485
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
Language:Python3.3k 40 306210
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language:Python3.2k 35 71173
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML3.2k 12 6375
cuda-mode/lectures
Material for cuda-mode lectures
Language:Jupyter Notebook2.5k 35 7252
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.7k 21 66113
zwang4/awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
1.4k 71 0162
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
1.1k 12 423
harvard-edge/cs249r_book
Collaborative book Machine Learning Systems
Language:TeX1k 14 230128
UMass-Foundation-Model/3D-LLM
Code for 3D-LLM: Injecting the 3D World into Large Language Models
Language:Python930 16 6258
ysymyth/awesome-language-agents
List of language agents based on paper "Cognitive Architectures for Language Agents"
Language:TeX731 14 255
efeslab/Nanoflow
A throughput-oriented high-performance serving framework for LLMs
Language:Cuda594 6 1824
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters
Language:Python591 5 7550
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Language:Python567 8 2323
deeperlearning/professional-cuda-c-programming
Language:Cuda390 12 3154
Xiuyu-Li/q-diffusion
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
Language:Python321 17 3921
mit-han-lab/Quest
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
Language:Cuda178 3 1014
quantumlib/Qualtran
Qᴜᴀʟᴛʀᴀɴ is a Python library for expressing and analyzing Fault Tolerant Quantum algorithms.
Language:Python174 18 43842
NVIDIA/mig-parted
MIG Partition Editor for NVIDIA GPUs
Language:Go169 12 3040
snu-comparch/InfiniGen
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
Language:Python62 3 014
xdit-project/DistVAE
A parallelism VAE avoids OOM for high resolution image generation
Language:Python38 2 13
MDK8888/vllmini
A minimal implementation of vllm.
Language:Cuda28 1 10
amazon-science/piperag
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design
Language:Python9 0 01
MaoZiming/papers
Paper-reading notes for Berkeley OS prelim exam.
7 1 00

zyang37

zyang37's Stars

haotian-liu/LLaVA

stanfordnlp/dspy

facebookresearch/segment-anything-2

LukeMathWalker/zero-to-production

sgl-project/sglang

NVIDIA/cutlass

karpathy/build-nanogpt

pytorch/torchchat

linkedin/Liger-Kernel

wdndev/llm_interview_note

cuda-mode/lectures

cambrian-mllm/cambrian

zwang4/awesome-machine-learning-in-compilers

kvcache-ai/Mooncake

harvard-edge/cs249r_book

UMass-Foundation-Model/3D-LLM

ysymyth/awesome-language-agents

efeslab/Nanoflow

xdit-project/xDiT

mit-han-lab/distrifuser

deeperlearning/professional-cuda-c-programming

Xiuyu-Li/q-diffusion

mit-han-lab/Quest

quantumlib/Qualtran

NVIDIA/mig-parted

snu-comparch/InfiniGen

xdit-project/DistVAE

MDK8888/vllmini

amazon-science/piperag

MaoZiming/papers