Muqi1029's Stars
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
yihedeng9/rlhf-summary-notes
A brief and partial summary of RLHF algorithms.
zhaochenyang20/Awesome-ML-SYS-Tutorial
My learning notes/codes for ML SYS.
guanyingc/latex_paper_writing_tips
Tips for Writing a Research Paper using LaTeX
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
genmoai/models
The best OSS video generation models
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
HuaizhengZhang/AI-System-School
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
ggerganov/llama.cpp
LLM inference in C/C++
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
SuperBruceJia/Awesome-LLM-Self-Consistency
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
yaoching0/GaC
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
punica-ai/punica
Serving multiple LoRA finetuned LLM as one
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
microsoft/DeepSpeedExamples
Example models using DeepSpeed
microsoft/ParrotServe
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
test-time-training/ttt-lm-jax
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
DAMO-DI-ML/KDD2023-DCdetector
66RING/tiny-flash-attention
flash attention tutorial written in python, triton, cuda, cutlass
ssbuild/chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning
chaoyanghe/Awesome-Federated-Learning
FedML - The Research and Production Integrated Federated Learning Library: https://fedml.ai
colinmarc/hdfs
A native go client for HDFS
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.