sft
There are 41 repositories under sft topic.
dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
ssbuild/chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning
jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
ukairia777/tensorflow-nlp-tutorial
tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.
choosewhatulike/trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
0xsequence/erc-1155
Ethereum Semi Fungible Standard (ERC-1155)
solv-finance/erc-3525
ERC-3525 Reference Implementation
NiuTrans/Vision-LLM-Alignment
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
muyu42/DataS
本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。
ecnu-sea/SEA
SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for papers, thereby assisting researchers in improving the quality of their work.
ssbuild/moss_finetuning
moss chat finetuning
movescriptions/movescriptions
https://twitter.com/MoveScriptions
ElvenTools/elven-tools-cli
Elven Tools CLI - command line tool for launching NFTs collections on the MultiversX blockchain (Plus other tools).
rbga/Low_Density_Parity_Check_LDPC_Codes_-_MATLAB_Simulation
LDPC MATLAB simulation using BPSK + AWGN modulation decoded using Sum Product and Min Sum Algorithm
wangclnlp/DeepSpeed-Chat-Extension
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
Macielyoung/Baichuan-QLora
Finetune baichuan pretrained model with QLora method
taishan1994/chinese_llm_sft
使用指令微调对大模型进行微调。
AlekseyKorshuk/gai-project
Train expert conversational role-play LLMs with synthetic data
DaehanKim/EasyRLHF
EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets
hlp-ai/miniChatGPT
Mini ChatGPT
THU-KEG/DICE
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
ldclabs/ic-sft
A SFT (Semi-Fungible Token, implemented ICRC-7 and ICRC-37) canister smart contract on the Internet Computer.
ElvenTools/elven-tools-sft-minter-sc
Elven Tools SFT Minter Smart Contract - launching SFTs collections on the MultiversX blockchain
SharathHebbar/sft_mathgpt2
Supervised Fine tuning using TRL library
Sophietje/SFTLearning
Testing the security of sanitizers by learning symbolic finite transducers
dgomezde83/Multifungible-library
MultiversX library for interacting with the MultiversX blockchain's Non-fungible tokens and Semi-fungible tokens.
Lamsoda1123/GPT2_medium_finetune-lora-sft
It's a GPT2 finetune project based on peft and transformers. Although can provide quite a imporvement, however, the illusion and inteligent is terrible.
Lizhecheng02/Kaggle-LMSYS
Analyze a dataset of conversations from the Chatbot Arena, where various LLMs provide responses to user prompts. The goal is to develop a model that enhances chatbot interactions, ensuring they align more closely with human preferences.
sftchance/sftchance
⚪ CHANCE IS A STUDY IN DECENTERED IDENTITY TOURISM AND THE A(E)FFECTS OF PRIVILEGE, ENTITLEMENT, AND CAPITAL, WITH BOUNDLESS MOBILITY ENABLED BY THE INTERNET.
SharathHebbar/Coding-Templates
Coding Templates
sunnynevarekar/LLM_Mistral_7b_SFT
Finetune Mistral 7b v1.0 on custom dataset
tonyskapunk/sft-aur
Scripts to keep up with latest scaleft packages to build them for AUR
data-dream-gdsp/Hello-Happy-World
AI-powered automatic dataset creation from the web, Support for LoRA and SFT question generation!
XpastaX/Instruction-Fusion
Advancing Prompt Evolution through Hybridization
web-seven/w7
Control Planes Blockchain Network