deepspeed

There are 90 repositories under deepspeed topic.

InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python7.1k 54 2k606
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.5k 16 89123
zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
Language:Python1.3k 11 136131
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Language:Python661 7 6457
antgroup/glake
GLake: optimizing GPU memory management and IO transmission.
Language:Python480 7 2243
LambdaLabsML/distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
Language:Python478 7 4343
Coobiw/MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
Language:Jupyter Notebook472 6 3823
shm007g/LLaMA-Cult-and-More
Large Language Models for All, 🦙 Cult and More, Stay in touch !
Language:HTML444 33 825
Xirider/finetune-gpt2xl
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
Language:Python437 5 2274
OpenMOSS/CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
Language:Python416 11 6958
openpsi-project/ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Language:Python315 4 2320
sunzeyeah/RLHF
Implementation of Chinese ChatGPT
Language:Python287 7 2535
stanleylsx/llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
Language:Python220 4 1821
bobo0810/LearnDeepSpeed
DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）
Language:Python177 1 14
git-cloner/llama2-lora-fine-tuning
llama2 finetuning with deepspeed and lora
Language:Python176 3 1814
jackaduma/ChatGLM-LoRA-RLHF-PyTorch
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
Language:Python138 6 210
HomebrewML/revlib
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
Language:Python129 4 56
CoinCheung/gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
Language:Python98 1 810
OpenCSGs/llm-inference
llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.
Language:Python86 12 3416
xyjigsaw/LLM-Pretrain-SFT
Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)
Language:Python84 4 315
billvsme/train_law_llm
✏️0成本LLM微调上手项目，⚡️一步一步使用colab训练法律LLM，基于microsoft/phi-1_5、chatglm3，包含lora微调，全参微调
Language:Jupyter Notebook77 2 211
saforem2/l2hmc-qcd
Application of the L2HMC algorithm to simulations in lattice QCD.
Language:Jupyter Notebook67 5 29
glb400/Toy-RecLM
A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.
Language:Python62 1 14
jackaduma/Alpaca-LoRA-RLHF-PyTorch
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca
Language:Python60 4 16
argonne-lcf/LLM-Inference-Bench
LLM-Inference-Bench
Language:Jupyter Notebook51 9 14
l294265421/my-llm
All about large language models
51 5 07
pszemraj/ai-msgbot
Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.
Language:Jupyter Notebook48 3 010
liangyuwang/Tiny-DeepSpeed
Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library
Language:Python45 1 01
5663015/LLMs_train
一套代码指令微调大模型
Language:Python39 1 13
nawnoes/pytorch-gpt-x
Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.
Language:Python32 2 03
saforem2/ezpz
Train across all your devices, ezpz 🍋
Language:Python24 1 57
Beomi/transformers-language-modeling
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
Language:Python23 2 04
VodLM/vod
End-to-end training of Retrieval-Augmented LMs (REALM, RAG)
Language:Python22 3 43
wangclnlp/DeepSpeed-Chat-Extension
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
Language:Python19 1 01
dyedd/deepspeed-diffusers
🚀 原生使用 Deepspeed 训练 Diffusers | Native Training of Diffusers with Deepspeed
Language:Python18 1 22
Raumberg/myllm
Multi-node distributed LLM training framework
Language:Python18

deepspeed

InternLM/lmdeploy

PKU-Alignment/safe-rlhf

zjunlp/KnowLM

alibaba/Megatron-LLaMA

antgroup/glake

LambdaLabsML/distributed-training-guide

Coobiw/MPP-LLaVA

shm007g/LLaMA-Cult-and-More

Xirider/finetune-gpt2xl

OpenMOSS/CoLLiE

openpsi-project/ReaLHF

sunzeyeah/RLHF

stanleylsx/llms_tool

bobo0810/LearnDeepSpeed

git-cloner/llama2-lora-fine-tuning

jackaduma/ChatGLM-LoRA-RLHF-PyTorch

HomebrewML/revlib

CoinCheung/gdGPT

OpenCSGs/llm-inference

xyjigsaw/LLM-Pretrain-SFT

billvsme/train_law_llm

saforem2/l2hmc-qcd

glb400/Toy-RecLM

jackaduma/Alpaca-LoRA-RLHF-PyTorch

argonne-lcf/LLM-Inference-Bench

l294265421/my-llm

pszemraj/ai-msgbot

liangyuwang/Tiny-DeepSpeed

5663015/LLMs_train

nawnoes/pytorch-gpt-x

saforem2/ezpz

Beomi/transformers-language-modeling

VodLM/vod

wangclnlp/DeepSpeed-Chat-Extension

dyedd/deepspeed-diffusers

Raumberg/myllm