llm-training

There are 380 repositories under llm-training topic.

gitleaks/gitleaks
Find secrets with Gitleaks 🔑
Language:Go23.2k 172 9061.8k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML20.8k 140 282.5k
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
Language:Python11.6k 193 1.1k1.2k
skypilot-org/skypilot
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
Language:Python8.7k 72 2.8k775
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language:Python5.7k 51 259399
InternLM/xtuner
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Language:Python4.8k 38 577365
h2oai/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Language:Python4.6k 81 418492
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Language:Python2.6k 40 24243
MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language:Python1.9k 25 24115
dstackai/dstack
dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or on-prem.
Language:Python1.9k 14 1.5k189
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
Language:Python1.6k 44 275198
utkuozdemir/nvidia_gpu_exporter
Nvidia GPU exporter for prometheus using nvidia-smi binary
Language:Go1.3k 11 76132
volcengine/veScale
A PyTorch Native LLM Training Framework
Language:Python865 32 2151
sail-sg/Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Language:Python797 7 3569
ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
Language:Jupyter Notebook672 17 0118
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python563 5 2365
rohan-paul/LLM-FineTuning-Large-Language-Models
LLM (Large Language Model) FineTuning
Language:Jupyter Notebook560 9 2135
anarchy-ai/LLM-VM
irresponsible innovation. Try now at https://chat.dev/
Language:Python486 12 225138
mallorbc/Finetune_LLMs
Repo for fine-tuning Casual LLMs
Language:Python457 10 2083
FlagAI-Open/Aquila2
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Language:Python441 5 6830
yinizhilian/ICLR2025-Papers-with-Code
历年ICLR论文和开源项目合集，包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.
423 5 021
InternLM/InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Language:Python407 10 9169
tigerlab-ai/tiger
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Language:Jupyter Notebook395 11 925
aws-samples/awsome-distributed-training
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Language:Shell339 14 228143
openpsi-project/ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Language:Python315 4 2320
armbues/SiLLM
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
Language:Python280 8 1326
MLSys-Learner-Resources/Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Language:HTML276 4 03
zhuhanqing/APOLLO
APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention
Language:Python255 7 1012
promptslab/LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Language:Python242 4 415
neo4j-labs/text2cypher
collection of text2cypher datasets, evaluations, and finetuning instructions
Language:Jupyter Notebook199 6 322
bd4sur/Nano
电子鹦鹉 / Toy Language Model
Language:C198 1 411
Laz4rz/GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
Language:Jupyter Notebook172 2 16
substratusai/runbooks
Finetune LLMs on K8s by using Runbooks
Language:Go170 7 12514
MatX-inc/seqax
seqax = sequence modeling + JAX
Language:Python167 7 316
shivendrra/SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
Language:Jupyter Notebook139 3 019
fiddlecube/fiddlecube-sdk
Generate ideal question-answers for testing RAG
Language:Python126 1 03

llm-training

gitleaks/gitleaks

liguodongiot/llm-action

ludwig-ai/ludwig

skypilot-org/skypilot

linkedin/Liger-Kernel

InternLM/xtuner

h2oai/h2o-llmstudio

databricks/dbrx

MoonshotAI/MoBA

dstackai/dstack

intelligent-machine-learning/dlrover

utkuozdemir/nvidia_gpu_exporter

volcengine/veScale

sail-sg/Adan

ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing

feifeibear/long-context-attention

rohan-paul/LLM-FineTuning-Large-Language-Models

anarchy-ai/LLM-VM

mallorbc/Finetune_LLMs

FlagAI-Open/Aquila2

yinizhilian/ICLR2025-Papers-with-Code

InternLM/InternEvo

tigerlab-ai/tiger

aws-samples/awsome-distributed-training

openpsi-project/ReaLHF

armbues/SiLLM

MLSys-Learner-Resources/Awesome-MLSys-Blogger

zhuhanqing/APOLLO

promptslab/LLMtuner

neo4j-labs/text2cypher

bd4sur/Nano

Laz4rz/GPT-2

substratusai/runbooks

MatX-inc/seqax

shivendrra/SmallLanguageModel

fiddlecube/fiddlecube-sdk