supervised-finetuning

There are 45 repositories under supervised-finetuning topic.

InternLM/xtuner
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Language:Python4.8k 38 577366
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language:Python2.9k 43 436177
Tebmer/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
1.2k 15 656
magpie-align/magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
Language:Python768 5 3569
GaryYufei/AlignLLMHumanSurvey
Aligning Large Language Models with Human: A Survey
731 30 131
chaoswork/sft_datasets
开源SFT数据集整理,随时补充
542 2 343
KodCode-AI/kodcode
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
Language:Python267 1 510
LIN-SHANG/InstructERC
The offical realization of InstructERC
Language:Python142 2 209
NVlabs/catk
Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models. CVPR Oral 2025.
Language:Python141 5 125
codelion/ellora
Enhancing LLMs with LoRA
Language:Jupyter Notebook135 1 0
sail-sg/sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
Language:Shell129 6 167
guanwei49/LogLLM
LogLLM: Log-based Anomaly Detection Using Large Language Models (system log anomaly detection)
Language:Python10820
yyDing1/ScaleQuest
[ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.
Language:Python67 2 27
BUAADreamer/MLLM-Finetuning-Demo
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
Language:Python50 1 22
liziniu/GEM
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
Language:Python40 1 00
ShiZhengyan/InstructionModelling
[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"
Language:Python36
inst-it/inst-it
Official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning"
Language:Python29 3 40
AstraZeneca/vlm
Official implementation for "Diffusion Instruction Tuning"
Language:Python28 2 23
fanqiwan/KCA
EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud
Language:Python22 0 00
quanshr/AugCon
[AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Language:Python20 1 22
BUAADreamer/Qwen2-VL-History
Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums
12 2 02
bhattbhavesh91/google-gemma-finetuning-n2sql
Finetuning Google's Gemma Model for Translating Natural Language into SQL
Language:Jupyter Notebook11 2 04
KwokHing/AI-Planet-LLM-Bootcamp-Challenge
An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain
Language:Jupyter Notebook5 2 00
mirabdullahyaser/LLaMA3-Financial-Analyst
LLM-powered financial analyst using LoRA-tuned Llama-3 and RAG pipeline to answer complex queries over SEC 10-K filings with contextual accuracy.
Language:Jupyter Notebook40
ranzeet013/RLHF-CustomData
Building an LLM with RLHF involves fine-tuning using human-labeled preferences. Based on Learning to Summarize from Human Feedback, it uses supervised learning, reward modeling, and PPO to improve response quality and alignment.
Language:Jupyter Notebook4
sovit-123/lm_sft
Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks
Language:Jupyter Notebook4 1 01
asifhaider/LLM-Finetuning-Prompting-Project
Python Project Sample for Demonstration
Language:Jupyter Notebook3 2 00
ChryssaNab/ECG-Heartbeat-Classification
Binary classification of pathological heartbeats from ECG signals using 1D CNNs in PyTorch
Language:Python3 1 01
nsrinidhibhat/fine-tune-llama-2
This project streamlines the fine-tuning process, enabling you to leverage Llama-2's capabilities for your own projects.
Language:Python3 1 10
tien02/llm-math
Fine tune Large Language Model on Mathematic dataset
Language:Python3 1 00
rasyosef/phi-2-sft-and-dpo
Notebooks to create an instruction following version of Microsoft's Phi 2 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)
Language:Jupyter Notebook2 1 00
renaldiangsar/Medical-LLM-Fine-Tuning
Fine-tuning Large Language Models (LLMs) for medical reasoning to enhances LLMs ability to understand, analyze, and generate accurate medical information.
Language:Jupyter Notebook2
artaasd95/rap-music-generator
The Rap Music Generator project is an innovative LLM-based tool designed to create rap lyrics. It offers multiple fine-tuning approaches to accommodate diverse rap generation techniques, providing users with a versatile platform for generating unique and stylistically varied content.
Language:Jupyter Notebook1 1 00
KindYAK/kaggle_20q
Solution for Kaggle 20 Questions competetion
Language:Jupyter Notebook1 1 00
rasyosef/phi-1_5-instruct
Notebooks to create an instruction following version of Microsoft's Phi 1.5 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)
1 1 00
sunnynevarekar/LLM_Mistral_7b_SFT
Finetune Mistral 7b v1.0 on custom dataset
Language:Jupyter Notebook1 1 00

supervised-finetuning

InternLM/xtuner

InternLM/InternLM-XComposer

Tebmer/Awesome-Knowledge-Distillation-of-LLMs

magpie-align/magpie

GaryYufei/AlignLLMHumanSurvey

chaoswork/sft_datasets

KodCode-AI/kodcode

LIN-SHANG/InstructERC

NVlabs/catk

codelion/ellora

sail-sg/sdft

guanwei49/LogLLM

yyDing1/ScaleQuest

BUAADreamer/MLLM-Finetuning-Demo

liziniu/GEM

ShiZhengyan/InstructionModelling

inst-it/inst-it

AstraZeneca/vlm

fanqiwan/KCA

quanshr/AugCon

BUAADreamer/Qwen2-VL-History

bhattbhavesh91/google-gemma-finetuning-n2sql

KwokHing/AI-Planet-LLM-Bootcamp-Challenge

mirabdullahyaser/LLaMA3-Financial-Analyst

ranzeet013/RLHF-CustomData

sovit-123/lm_sft

asifhaider/LLM-Finetuning-Prompting-Project

ChryssaNab/ECG-Heartbeat-Classification

nsrinidhibhat/fine-tune-llama-2

tien02/llm-math

rasyosef/phi-2-sft-and-dpo

renaldiangsar/Medical-LLM-Fine-Tuning

artaasd95/rap-music-generator

KindYAK/kaggle_20q

rasyosef/phi-1_5-instruct

sunnynevarekar/LLM_Mistral_7b_SFT