instruction-tuning

There are 192 repositories under instruction-tuning topic.

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python58.3k 290 7.4k7.2k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python23.5k 160 1.6k2.6k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
16.3k 282 1481.1k
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python11.8k 166 67923
modelscope/data-juicer
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Language:Python5.2k 20 292268
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language:Python4.5k 58 19520
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
Language:HTML4.3k 43 34306
NExT-GPT/NExT-GPT
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:Python3.6k 61 112360
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python3.4k 30 209240
EvolvingLMMs-Lab/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.3k 80 165208
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文，开源数据&模型，AIGC应用
3.2k 64 2315
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language:Python2.9k 43 425177
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.8k 34 101253
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language:Python2.5k 29 238187
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python2k 27 277126
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python2k 21 83129
bespokelabsai/curator
Synthetic data curation for post-training and structured data extraction
Language:Python1.5k 10 267120
zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
Language:Python1.3k 11 136131
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
1.1k 16 857
datadreamer-dev/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Language:Python1.1k 8 2854
NVlabs/DoRA
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
Language:Python852 10 2760
yaotingwangofficial/Awesome-MCoT
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
807 13 1124
HKUDS/GraphGPT
[SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"
Language:Python777 8 9277
FudanDISC/DISC-FinLLM
DISC-FinLLM，中文金融大语言模型（LLM），旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financial consulting services in financial scenarios.
Language:Python769 6 2879
ContextualAI/gritlm
Generative Representational Instruction Tuning
Language:Jupyter Notebook672 9 5649
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Language:Python567 6 2729
bigscience-workshop/xmtf
Crosslingual Generalization through Multitask Finetuning
Language:Jupyter Notebook535 6 2241
salesforce/DialogStudio
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
Language:Python516 12 535
RenzeLou/awesome-instruction-learning
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
Language:Python499 7 025
princeton-nlp/LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Language:Jupyter Notebook494 4 3947
mindspore-courses/step_into_llm
MindSpore online courses: Step into LLM
Language:Jupyter Notebook477 8 41122
yuanze-lin/Olympus
[CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"
Language:Python425 4 028
ZebangCheng/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Language:Python403 6 6226
HugAILab/HugNLP
CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊
Language:Python389 6 1348
HKUDS/UrbanGPT
[KDD'2024] "UrbanGPT: Spatio-Temporal Large Language Models"
Language:Python387 12 3546
HenryHZY/Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
355 17 416

instruction-tuning

hiyouga/LLaMA-Factory

haotian-liu/LLaVA

BradyFU/Awesome-Multimodal-Large-Language-Models

RUCAIBox/LLMSurvey

modelscope/data-juicer

yizhongw/self-instruct

Instruction-Tuning-with-GPT-4/GPT-4-LLM

NExT-GPT/NExT-GPT

PKU-YuanGroup/Video-LLaVA

EvolvingLMMs-Lab/Otter

DSXiangLi/DecryptPrompt

InternLM/InternLM-XComposer

PhoebusSi/Alpaca-CoT

X-PLUG/mPLUG-Owl

OpenGVLab/InternVideo

cambrian-mllm/cambrian

bespokelabsai/curator

zjunlp/KnowLM

yaodongC/awesome-instruction-dataset

datadreamer-dev/DataDreamer

NVlabs/DoRA

yaotingwangofficial/Awesome-MCoT

HKUDS/GraphGPT

FudanDISC/DISC-FinLLM

ContextualAI/gritlm

hkust-nlp/deita

bigscience-workshop/xmtf

salesforce/DialogStudio

RenzeLou/awesome-instruction-learning

princeton-nlp/LESS

mindspore-courses/step_into_llm

yuanze-lin/Olympus

ZebangCheng/Emotion-LLaMA

HugAILab/HugNLP

HKUDS/UrbanGPT

HenryHZY/Awesome-Multimodal-LLM