instruction-tuning

There are 144 repositories under instruction-tuning topic.

  • LLaMA-Factory

    hiyouga/LLaMA-Factory

    Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

    Language:Python37.5k2205.7k4.6k
  • haotian-liu/LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

    Language:Python21k1581.6k2.3k
  • BradyFU/Awesome-Multimodal-Large-Language-Models

    :sparkles::sparkles:Latest Advances on Multimodal Large Language Models

  • RUCAIBox/LLMSurvey

    The official GitHub page for the survey paper "A Survey of Large Language Models".

    Language:Python10.7k15965836
  • Instruction-Tuning-with-GPT-4/GPT-4-LLM

    Instruction Tuning with GPT-4

    Language:HTML4.3k4334301
  • yizhongw/self-instruct

    Aligning pretrained language models with instruction data generated by themselves.

    Language:Python4.2k5719493
  • Otter

    Luodian/Otter

    🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

    Language:Python3.6k100163242
  • NExT-GPT/NExT-GPT

    Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

    Language:Python3.4k60109344
  • modelscope/data-juicer

    Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

    Language:Python3.3k19212195
  • PKU-YuanGroup/Video-LLaVA

    【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

    Language:Python3.1k29201219
  • DSXiangLi/DecryptPrompt

    总结Prompt&LLM论文,开源数据&模型,AIGC应用

  • InternLM/InternLM-XComposer

    InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

    Language:Python2.7k44406163
  • PhoebusSi/Alpaca-CoT

    We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

    Language:Jupyter Notebook2.7k35100249
  • X-PLUG/mPLUG-Owl

    mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

    Language:Python2.4k30233177
  • cambrian-mllm/cambrian

    Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

    Language:Python1.8k2371123
  • OpenGVLab/InternVideo

    [ECCV2024] Video Foundation Models & Data for Multimodal Understanding

    Language:Python1.5k2720194
  • zjunlp/KnowLM

    An Open-sourced Knowledgable Large Language Model Framework.

    Language:Python1.3k11136128
  • yaodongC/awesome-instruction-dataset

    A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

  • DataDreamer

    datadreamer-dev/DataDreamer

    DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

    Language:Python88982646
  • NVlabs/DoRA

    [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

    Language:Python676112045
  • HKUDS/GraphGPT

    [SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"

    Language:Python66988663
  • FudanDISC/DISC-FinLLM

    DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financial consulting services in financial scenarios.

    Language:Python63462874
  • ContextualAI/gritlm

    Generative Representational Instruction Tuning

    Language:Jupyter Notebook58495442
  • hkust-nlp/deita

    Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

    Language:Python52362729
  • bigscience-workshop/xmtf

    Crosslingual Generalization through Multitask Finetuning

    Language:Jupyter Notebook52062238
  • salesforce/DialogStudio

    DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

    Language:Python48513534
  • RenzeLou/awesome-instruction-learning

    Papers and Datasets on Instruction Tuning and Following. ✨✨✨

    Language:Python4737023
  • mindspore-courses/step_into_llm

    MindSpore online courses: Step into LLM

    Language:Jupyter Notebook443930104
  • princeton-nlp/LESS

    [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

    Language:Jupyter Notebook39753739
  • HugAILab/HugNLP

    CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊

    Language:Python38281247
  • HenryHZY/Awesome-Multimodal-LLM

    Research Trends in LLM-guided Multimodal Learning.

  • HKUDS/UrbanGPT

    [KDD'2024] "UrbanGPT: Spatio-Temporal Large Language Models"

    Language:Python324173144
  • zhilizju/Awesome-instruction-tuning

    A curated list of awesome instruction tuning datasets, models, papers and repositories.

    Language:Python3213014
  • ictnlp/BayLing

    “百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.

    Language:Python30681219
  • ZigeW/data_management_LLM

    Collection of training data management explorations for large language models

  • mlpc-ucsd/BLIVA

    (AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

    Language:Python273122728