YongGuCheng/llm-resource

LLM全栈优质资源汇总

ShellApache-2.0

llm-resource

LLM全栈优质资源汇总

非常欢迎大家也参与进来，收集更多优质大模型相关资源。

目录

LLM算法
LLM推理
LLM压缩
LLM测评
LLM应用开发
AI基础设施

LLM算法

Transformer

原理：

源码：

GPT

GLM

原理：

预训练语言模型：GLM

LLaMA

LLM 训练

学习率(warmup, decay)：

LLM 微调方法

Adapting P-Tuning to Solve Non-English Downstream Tasks

RLHF

MOSS-RLHF

LLM推理

使用HuggingFace的Accelerate库加载和运行超大模型 : device_map、no_split_module_classes、 offload_folder、 offload_state_dict
使用 DeepSpeed 和 Accelerate 进行超快 BLOOM 模型推理
LLM七种推理服务框架总结
LLM投机采样（Speculative Sampling）为何能加速模型推理
大模型推理妙招—投机采样（Speculative Decoding）
https://github.com/flexflow/FlexFlow/tree/inference
TensorRT-LLM(3)--架构
NLP（十八）：LLM 的推理优化技术纵览：https://zhuanlan.zhihu.com/p/642412124

LLM压缩

LLM量化

LLM 剪枝

LLM 蒸馏

知识蒸馏(Knowledge Distillation) 经典之作：https://zhuanlan.zhihu.com/p/102038521

LLM 稀疏化

NLP（八）：大语言模型的稀疏化技术

AI框架

PyTorch

PyTorch 源码解读系列 @ OpenMMLab 团队
[源码解析] PyTorch 分布式 @ 罗西的思考
PyTorch 分布式(18) --- 使用 RPC 的分布式流水线并行 @ 罗西的思考
【Pytorch】model.train() 和 model.eval() 原理与用法

DeepSpeed

Megatron-LM

Megatron-DeepSpeed

LLM 测评

CLiB中文大模型能力评测榜单
huggingface Open LLM Leaderboard
HELM：https://github.com/stanford-crfm/helm
HELM：https://crfm.stanford.edu/helm/latest/
lm-evaluation-harness：https://github.com/EleutherAI/lm-evaluation-harness/
CLEVA：http://www.lavicleva.com/#/homepage/overview
CLEVA：https://github.com/LaVi-Lab/CLEVA/blob/main/README_zh-CN.md

综合

safetensors

AI编译器

TVM资料
AI编译器原理 @ZIMO酱

AI 基础

AI基础设施

AI 芯片

业界AI加速芯片浅析（一）百度昆仑芯
NVIDIA CUDA-X AI：https://www.nvidia.cn/technologies/cuda-x/
Intel，Nvidia，AMD三大巨头火拼GPU与CPU
处理器与AI芯片-Google-TPU：https://zhuanlan.zhihu.com/p/646793355

CUDA

LLMOps

MLOps Landscape in 2023: Top Tools and Platforms
What Constitutes A Large Language Model Application? ：LLM Functionality Landscape

LLM应用开发

动手学大模型应用开发：https://github.com/datawhalechina/llm-universe
langchain java