sft

There are 41 repositories under sft topic.

dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
Language:Python8.9k 908 1601.6k
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Language:Python4.3k 23 1.3k382
ssbuild/chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning
Language:Python1.5k 20 246176
jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
Language:Python589 5 2061
ukairia777/tensorflow-nlp-tutorial
tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.
Language:Jupyter Notebook530 5 5268
choosewhatulike/trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
Language:Python459 17 1232
0xsequence/erc-1155
Ethereum Semi Fungible Standard (ERC-1155)
Language:TypeScript322 36 44118
solv-finance/erc-3525
ERC-3525 Reference Implementation
Language:Solidity109 4 947
NiuTrans/Vision-LLM-Alignment
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
Language:Python83 3 95
muyu42/DataS
本项目旨在结合以往研究人员的代表性工作，从多个维度评估sft数据，并自动化过滤sft数据。
Language:Python55 11 512
ecnu-sea/SEA
SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for papers, thereby assisting researchers in improving the quality of their work.
Language:Python50 0 37
ssbuild/moss_finetuning
moss chat finetuning
Language:Python50 2 164
movescriptions/movescriptions
https://twitter.com/MoveScriptions
Language:Move45 4 716
ElvenTools/elven-tools-cli
Elven Tools CLI - command line tool for launching NFTs collections on the MultiversX blockchain (Plus other tools).
Language:TypeScript24 3 2613
rbga/Low_Density_Parity_Check_LDPC_Codes_-_MATLAB_Simulation
LDPC MATLAB simulation using BPSK + AWGN modulation decoded using Sum Product and Min Sum Algorithm
Language:MATLAB16 1 15
wangclnlp/DeepSpeed-Chat-Extension
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
Language:Python16 2 01
Macielyoung/Baichuan-QLora
Finetune baichuan pretrained model with QLora method
Language:Python15 1 41
taishan1994/chinese_llm_sft
使用指令微调对大模型进行微调。
Language:Python8 1 02
AlekseyKorshuk/gai-project
Train expert conversational role-play LLMs with synthetic data
Language:Python6 1 02
DaehanKim/EasyRLHF
EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets
Language:Python6 1 00
hlp-ai/miniChatGPT
Mini ChatGPT
Language:Python6 1 11
THU-KEG/DICE
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
Language:Python5 5 10
ldclabs/ic-sft
A SFT (Semi-Fungible Token, implemented ICRC-7 and ICRC-37) canister smart contract on the Internet Computer.
Language:Rust4 2 01
ElvenTools/elven-tools-sft-minter-sc
Elven Tools SFT Minter Smart Contract - launching SFTs collections on the MultiversX blockchain
Language:Rust3 1 01
SharathHebbar/sft_mathgpt2
Supervised Fine tuning using TRL library
Language:Jupyter Notebook2 1 0
Sophietje/SFTLearning
Testing the security of sanitizers by learning symbolic finite transducers
Language:Java2 3 00
dgomezde83/Multifungible-library
MultiversX library for interacting with the MultiversX blockchain's Non-fungible tokens and Semi-fungible tokens.
Language:C++1 0 00
Lamsoda1123/GPT2_medium_finetune-lora-sft
It's a GPT2 finetune project based on peft and transformers. Although can provide quite a imporvement, however, the illusion and inteligent is terrible.
Language:Python1 2 00
Lizhecheng02/Kaggle-LMSYS
Analyze a dataset of conversations from the Chatbot Arena, where various LLMs provide responses to user prompts. The goal is to develop a model that enhances chatbot interactions, ensuring they align more closely with human preferences.
Language:Jupyter Notebook1 1 00
sftchance/sftchance
⚪ CHANCE IS A STUDY IN DECENTERED IDENTITY TOURISM AND THE A(E)FFECTS OF PRIVILEGE, ENTITLEMENT, AND CAPITAL, WITH BOUNDLESS MOBILITY ENABLED BY THE INTERNET.
Language:TypeScript1 1 41
SharathHebbar/Coding-Templates
Coding Templates
Language:Jupyter Notebook1 1 0
sunnynevarekar/LLM_Mistral_7b_SFT
Finetune Mistral 7b v1.0 on custom dataset
Language:Jupyter Notebook1 1 00
tonyskapunk/sft-aur
Scripts to keep up with latest scaleft packages to build them for AUR
Language:Shell1 3 01
data-dream-gdsp/Hello-Happy-World
AI-powered automatic dataset creation from the web, Support for LoRA and SFT question generation!
Language:Python00
XpastaX/Instruction-Fusion
Advancing Prompt Evolution through Hybridization
Language:Python0 1 00
web-seven/w7
Control Planes Blockchain Network
Language:Go0 0

sft

dataelement/bisheng

modelscope/ms-swift

ssbuild/chatglm_finetuning

jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese

ukairia777/tensorflow-nlp-tutorial

choosewhatulike/trainable-agents

0xsequence/erc-1155

solv-finance/erc-3525

NiuTrans/Vision-LLM-Alignment

muyu42/DataS

ecnu-sea/SEA

ssbuild/moss_finetuning

movescriptions/movescriptions

ElvenTools/elven-tools-cli

rbga/Low_Density_Parity_Check_LDPC_Codes_-_MATLAB_Simulation

wangclnlp/DeepSpeed-Chat-Extension

Macielyoung/Baichuan-QLora

taishan1994/chinese_llm_sft

AlekseyKorshuk/gai-project

DaehanKim/EasyRLHF

hlp-ai/miniChatGPT

THU-KEG/DICE

ldclabs/ic-sft

ElvenTools/elven-tools-sft-minter-sc

SharathHebbar/sft_mathgpt2

Sophietje/SFTLearning

dgomezde83/Multifungible-library

Lamsoda1123/GPT2_medium_finetune-lora-sft

Lizhecheng02/Kaggle-LMSYS

sftchance/sftchance

SharathHebbar/Coding-Templates

sunnynevarekar/LLM_Mistral_7b_SFT

tonyskapunk/sft-aur

data-dream-gdsp/Hello-Happy-World

XpastaX/Instruction-Fusion

web-seven/w7