llm-training
There are 185 repositories under llm-training topic.
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
skypilot-org/skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
h2oai/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
utkuozdemir/nvidia_gpu_exporter
Nvidia GPU exporter for prometheus using nvidia-smi binary
ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
volcengine/veScale
A PyTorch Native LLM Training Framework
anarchy-ai/LLM-VM
irresponsible innovation. Try now at https://chat.dev/
mallorbc/Finetune_LLMs
Repo for fine-tuning Casual LLMs
FlagAI-Open/Aquila2
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
rohan-paul/LLM-FineTuning-Large-Language-Models
LLM (Large Language Model) FineTuning
tigerlab-ai/tiger
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
promptslab/LLMtuner
Tune LLM in few lines of code
feifeibear/long-context-attention
Sequence Parallel Attention for Long Context LLM Model Training and Inference
armbues/SiLLM
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
substratusai/runbooks
Finetune LLMs on K8s by using Runbooks
aws-samples/awsome-distributed-training
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
MatX-inc/seqax
seqax = sequence modeling + JAX
shivendrra/SmallLanguageModel-project
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
slai-labs/get-beam
Run GPU inference and training jobs on serverless infrastructure that scales with you.
vihangd/alpaca-qlora
Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA
dsdanielpark/open-llm-datasets
Repository for organizing datasets and papers used in Open LLM.
neo4j-labs/text2cypher
collection of text2cypher datasets, evaluations, and finetuning instructions
Itachi-Uchiha581/Auto-Data
Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).
SmerkyG/gptcore
Fast modular code to create and train cutting edge LLMs
discus-labs/discus
A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ
muyu42/DataS
本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。
sugarcane-ai/sugarcane-ai
npm like package ecosystem for Prompts 🤖
sec3-service/Owl-LM
Large Language Model for Blockchain
sotopia-lab/sotopia-pi
Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)
microsoft/LLF-Bench
A benchmark for evaluating learning agents based on just language feedback
lenguajenatural-ai/autotransformers
A Python package for automatically training and comparing language models.
TatevKaren/BabyGPT-Build_GPT_From_Scratch
BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training Generative Transformer Models: Building GPT from Scratch with a Step-by-Step Guide to Generative AI in PyTorch and Python