Cunxiao2002

University of Science and Technology Beijing in Automation

University of Science and Technology BeijingBeijing

Cunxiao2002's Stars

hzwer/WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
57515
KnowingNothing/compiler-and-arch
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
37931
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters
Language:Python47540
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python9.9k2.2k
kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Language:Python64131
byungsoo-oh/ml-systems-papers
Curated collection of papers in machine learning systems
1237
NAOSI-DLUT/Campus2025
2025届互联网校招信息汇总
72448
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.8k882
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.2k906
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python18.6k1.5k
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
Language:Python3.4k463
microsoft/AI-System
System for AI Education Resource.
Language:Python3.4k427
RussWong/LLM-engineering
Language:Cuda91
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python36.2k5.7k
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2.3k190
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.2k376
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python26.7k3.9k
RussWong/CUDATutorial
A CUDA tutorial to make people learn CUDA program from 0
Language:Cuda17743
jefferyZhan/Griffon
【ECCV2024】The official repo of Griffon series
Language:Python935
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
2.5k162
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook37.1k3.9k
cuda-mode/awesomeMLSys
An ML Systems Onboarding list
49019
conanhujinming/tips_for_interview
我的一些面试心得；自学CS历程分享；找工作求职经验分享
3.7k345
gzc/CLRS
:notebook:Solutions to Introduction to Algorithms
Language:C++9.4k2.8k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Language:HTML9.1k889
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
57224
kebijuelun/Awesome-LLM-Learning
Learning Large Language Model (LLM）(大语言模型学习)
Language:Python26333
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++12.7k1.5k
l0ngc/hpc-learning
hpc-learning
55437
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Language:Python34129

Cunxiao2002

Cunxiao2002's Stars

hzwer/WritingAIPaper

KnowingNothing/compiler-and-arch

xdit-project/xDiT

NVIDIA/Megatron-LM

kvcache-ai/ktransformers

byungsoo-oh/ml-systems-papers

NAOSI-DLUT/Campus2025

NVIDIA/FasterTransformer

NVIDIA/TensorRT-LLM

mlc-ai/mlc-llm

karpathy/build-nanogpt

microsoft/AI-System

RussWong/LLM-engineering

karpathy/nanoGPT

ModelTC/lightllm

InternLM/lmdeploy

vllm-project/vllm

RussWong/CUDATutorial

jefferyZhan/Griffon

DefTruth/Awesome-LLM-Inference

mlabonne/llm-course

cuda-mode/awesomeMLSys

conanhujinming/tips_for_interview

gzc/CLRS

liguodongiot/llm-action

AmberLJC/LLMSys-PaperList

kebijuelun/Awesome-LLM-Learning

triton-lang/triton

l0ngc/hpc-learning

microsoft/BitBLAS