supervised-finetuning
There are 45 repositories under supervised-finetuning topic.
InternLM/xtuner
A Next-Generation Training Engine Built for Ultra-Large MoE Models
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Tebmer/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
magpie-align/magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
GaryYufei/AlignLLMHumanSurvey
Aligning Large Language Models with Human: A Survey
chaoswork/sft_datasets
开源SFT数据集整理,随时补充
KodCode-AI/kodcode
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
LIN-SHANG/InstructERC
The offical realization of InstructERC
NVlabs/catk
Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models. CVPR Oral 2025.
codelion/ellora
Enhancing LLMs with LoRA
sail-sg/sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
guanwei49/LogLLM
LogLLM: Log-based Anomaly Detection Using Large Language Models (system log anomaly detection)
yyDing1/ScaleQuest
[ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.
BUAADreamer/MLLM-Finetuning-Demo
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
liziniu/GEM
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
ShiZhengyan/InstructionModelling
[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"
inst-it/inst-it
Official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning"
AstraZeneca/vlm
Official implementation for "Diffusion Instruction Tuning"
fanqiwan/KCA
EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud
quanshr/AugCon
[AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
BUAADreamer/Qwen2-VL-History
Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums
bhattbhavesh91/google-gemma-finetuning-n2sql
Finetuning Google's Gemma Model for Translating Natural Language into SQL
KwokHing/AI-Planet-LLM-Bootcamp-Challenge
An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain
mirabdullahyaser/LLaMA3-Financial-Analyst
LLM-powered financial analyst using LoRA-tuned Llama-3 and RAG pipeline to answer complex queries over SEC 10-K filings with contextual accuracy.
ranzeet013/RLHF-CustomData
Building an LLM with RLHF involves fine-tuning using human-labeled preferences. Based on Learning to Summarize from Human Feedback, it uses supervised learning, reward modeling, and PPO to improve response quality and alignment.
sovit-123/lm_sft
Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks
asifhaider/LLM-Finetuning-Prompting-Project
Python Project Sample for Demonstration
ChryssaNab/ECG-Heartbeat-Classification
Binary classification of pathological heartbeats from ECG signals using 1D CNNs in PyTorch
nsrinidhibhat/fine-tune-llama-2
This project streamlines the fine-tuning process, enabling you to leverage Llama-2's capabilities for your own projects.
tien02/llm-math
Fine tune Large Language Model on Mathematic dataset
rasyosef/phi-2-sft-and-dpo
Notebooks to create an instruction following version of Microsoft's Phi 2 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)
renaldiangsar/Medical-LLM-Fine-Tuning
Fine-tuning Large Language Models (LLMs) for medical reasoning to enhances LLMs ability to understand, analyze, and generate accurate medical information.
artaasd95/rap-music-generator
The Rap Music Generator project is an innovative LLM-based tool designed to create rap lyrics. It offers multiple fine-tuning approaches to accommodate diverse rap generation techniques, providing users with a versatile platform for generating unique and stylistically varied content.
KindYAK/kaggle_20q
Solution for Kaggle 20 Questions competetion
rasyosef/phi-1_5-instruct
Notebooks to create an instruction following version of Microsoft's Phi 1.5 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)
sunnynevarekar/LLM_Mistral_7b_SFT
Finetune Mistral 7b v1.0 on custom dataset