fine-tuning
There are 1501 repositories under fine-tuning topic.
ort
Fast ML inference & training for ONNX models in Rust
LibFewShot
LibFewShot: A Comprehensive Library for Few-shot Learning. TPAMI 2023.
Bert-Multi-Label-Text-Classification
This repo contains a PyTorch implementation of a pretrained BERT model for multi-label text classification.
DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
LLM-Finetuning-Toolkit
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
AI-Engineering.academy
Mastering Applied AI, One Concept at a Time
Lora-for-Diffusers
The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥
Magick
Magick is a cutting-edge toolkit for a new kind of AI builder. Make Magick with us!
ModelsGenesis
[MICCAI 2019 Young Scientist Award] [MEDIA 2020 Best Paper Award] Models Genesis
NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
cerebellum
Browser automation system that uses AI-driven planning to navigate web pages and perform goals.
start-llms
A complete guide to start and improve your LLM skills in 2025 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
vectordb-recipes
High quality resources & applications for LLMs, multi-modal models and VectorDBs
beta9
Run serverless GPU workloads with fast cold starts on bare-metal servers, anywhere in the world
RAG-FiT
Framework for enhancing LLMs for RAG tasks using fine-tuning.
MARS
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
LLM-Kit
🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用工具
slowllama
Finetune llama2-70b and codellama on MacBook Air without quantization
LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
aikit
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
simpleT5
simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.
tiger
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
embedding_studio
Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.
LLM-RLHF-Tuning
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
huozi
活字通用大模型
fondant
Production-ready data processing made easy and shareable
BentoDiffusion
BentoDiffusion: A collection of diffusion models served with BentoML
MedQA-ChatGLM
🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答
RAG-Driven-Generative-AI
This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face models for generation and evaluation.
BOND
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Odyssey
Odyssey: Empowering Minecraft Agents with Open-World Skills
shared_colab_notebooks
A Repo to store the Google Colaboratory Notebooks that I have created and shared
AutoAudit
AutoAudit—— the LLM for Cyber Security 网络安全大语言模型