soryxie/awesome-AI-system

paper and its code for AI System

Awesome AI System

This repo is motivated by awesome tensor compilers.

Contents

Papers
Contribute

Papers

LLM Serving Framework

LLM Serving

vLLM System(Efficient Memory Management for Large Language Model Serving with PagedAttention SOSP'23)
SpecInfer: Accelerating Generative Large Language Model Serving with Speculative Inference and Token Tree Verification 23arxiv

Framework

Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning OSDI'22
Unity: Accelerating DNN Training Through Joint Optimization of Algebraic Transformations and Parallelization OSDI'22 OSDI'22
Megatron-LM SC21
A Unified Architecture for Accelerating Distributed DNN Training in Heterogeneous GPU/CPU Clusters OSDI'20
Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training
HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework VLDB'22

Parallellism Training

Training

Communication

Serving-Inference

MoE

GPU Cluster Management

Schedule and Resource Management

Optimization

GNN

Fine-Tune

Fine-tuning giant neural networks on commodity hardware with automatic pipeline model parallelism ATC'21

Energy

Misc

Contribute

We encourage all contributions to this repository. Open an issue or send a pull request.