finetuning-llms

There are 80 repositories under finetuning-llms topic.

adithya-s-k/AI-Engineering.academy
Mastering Applied AI, One Concept at a Time
Language:Jupyter Notebook778 9 292
GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS
End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects
129 1 044
Itachi-Uchiha581/Auto-Data
Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).
Language:Python90 3 08
simplifine-llm/Simplifine
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨
Language:Python86 2 13
neuralwork/instruct-finetune-mistral
Fine-tune Mistral 7B to generate fashion style suggestions
Language:Python33 0 16
wangermeng2021/llm-webui
A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat
Language:Python33 3 16
BaohaoLiao/mefts
[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
Language:Python29 3 01
Prismadic/magnet
the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly
Language:Python27 4 192
samadon1/LLM-From-Scratch
Medical Language Model fine-tuned using pretraining, instruction tuning, and Direct Preference Optimization (DPO). Progresses from general medical knowledge to specific instruction following, with experiments in preference alignment for improved medical text generation and understanding.
Language:Jupyter Notebook25 2 08
garyfanhku/Galore-pytorch
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Language:Python22 3 01
AlphaPav/mem-kk-logic
On Memorization of Large Language Models in Logical Reasoning
Language:Python18 1 04
zhaoyl18/SEIKO
SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.
Language:Python17 3 30
adithya-s-k/Indic-llm
A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of domains and languages.
Language:Python12 2 02
BaohaoLiao/ApiQ
[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs
Language:Python12 2 02
bhattbhavesh91/google-gemma-finetuning-n2sql
Finetuning Google's Gemma Model for Translating Natural Language into SQL
Language:Jupyter Notebook11 2 03
SaltyGod/Qwen-Qlora-ACSA
qwen-1.5-1.8B sentiment analysis with prompt optimization and qlora fine-tuning
Language:Python9 1 00
louisc-s/QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM
Code for fine-tuning Llama2 LLM with custom text dataset to produce film character styled responses
Language:Python8 1 01
zelaki/awesome-LoRA
A curated list of Parameter Efficient Fine-tuning papers with a TL;DR
6 1 00
PromptEngineer48/Fine_tuning_1
Finetuning LLMs + Private Data (Video 1/10) Basic
Language:Jupyter Notebook4 2 02
Rahul-AkaVector/java-code-generator
This repository contains code for fine-tuning the LLama3 8b model using Alpaca prompts to generate Java codes. The code is based on a Google Colab notebook.
Language:Jupyter Notebook4 1 04
ShashankGupta10/Code-Wizard
Code Wizard is a coding companion/ code generation tool empowered by CodeLLama-v2-34B AI to automatically generate and enhance code based on best practices found in your GitHub repository.
Language:Python4 2 00
alaradirik/finetune-phi-2
Fine tune Phi 2 for persona grounded chat
Language:Python3 2 03
harshpreet14/journalling
An audio journaling app that provides AI analysis for your journal entries
Language:JavaScript3 1 00
inuwamobarak/Meta-Llama-3-8B
Experiments with the Meta-Llama-3-8B
Language:Jupyter Notebook3 1 02
LikithMeruvu/Gemma2B_Finetuning_Medium
This Repo contains How to Finetune Google's New Gemma LLm model using your custom instuction dataset. I have finetuned Gemma 2b instuct Model on 20k medium articles data for 5hrs on kaggle p100 GPU
Language:Jupyter Notebook3 1 11
roy-sub/LLM-FineTuning
Fine-Tuned Language Models Exploration using LoRA and Hugging Face's Transformers Library
Language:Jupyter Notebook3 1 01
arunpshankar/VAI-FineTuning-LLMs
"Clean and comprehensive examples for fine-tuning LLMs supported by Vertex AI, including proprietary, open-source, and third-party models."
Language:Python2 2 00
Gaurav-Van/Fine-Tuning-LLMs
Introductory Guide where we will talk about Different Techniques of Fine Tuning LLMs
Language:Jupyter Notebook2 1 01
GURPREETKAURJETHRA/Multi-GPU-Fine-Training-LLMs
Multi GPU Fine Training LLMs using DeepSpeed and Accelerate.
Language:Jupyter Notebook2 1 01
harshit433/ResurrectAI
ResurrectAI is an AI-driven chat application designed to bring the wisdom and knowledge of great historical personalities to life. Leveraging advanced language models and fine-tuning techniques, ResurrectAI enables users to interact with AI avatars of iconic figures, gaining access to their insights, guidance, and philosophical teaching in realtime
Language:Dart2 1 00
SuryaKrishna02/sft-llm-news-articles-telugu
The repository contains the code that is used to create the instruct style dataset of telugu news articles.
Language:Jupyter Notebook2 1 00
YashBhalekar07/java-code-generator
This repository contains code for fine-tuning the LLama3 8b model using Alpaca prompts to generate Java codes. The code is based on a Google Colab notebook.
Language:Jupyter Notebook2 1 0
zeyadusf/Finetuning-LLMs
Finetuning Large Language Models
2 1 0
andrew264/modelex
Doing devious stuff with AI
Language:Python1 2 0
codelion/ellora
Enhancing LLMs with LoRA
1 1 0
nootums/lora-from-scratch
Building LoRA adapters from scratch
Language:Jupyter Notebook1

finetuning-llms

adithya-s-k/AI-Engineering.academy

GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

Itachi-Uchiha581/Auto-Data

simplifine-llm/Simplifine

neuralwork/instruct-finetune-mistral

wangermeng2021/llm-webui

BaohaoLiao/mefts

Prismadic/magnet

samadon1/LLM-From-Scratch

garyfanhku/Galore-pytorch

AlphaPav/mem-kk-logic

zhaoyl18/SEIKO

adithya-s-k/Indic-llm

BaohaoLiao/ApiQ

bhattbhavesh91/google-gemma-finetuning-n2sql

SaltyGod/Qwen-Qlora-ACSA

louisc-s/QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM

zelaki/awesome-LoRA

PromptEngineer48/Fine_tuning_1

Rahul-AkaVector/java-code-generator

ShashankGupta10/Code-Wizard

alaradirik/finetune-phi-2

harshpreet14/journalling

inuwamobarak/Meta-Llama-3-8B

LikithMeruvu/Gemma2B_Finetuning_Medium

roy-sub/LLM-FineTuning

arunpshankar/VAI-FineTuning-LLMs

Gaurav-Van/Fine-Tuning-LLMs

GURPREETKAURJETHRA/Multi-GPU-Fine-Training-LLMs

harshit433/ResurrectAI

SuryaKrishna02/sft-llm-news-articles-telugu

YashBhalekar07/java-code-generator

zeyadusf/Finetuning-LLMs

andrew264/modelex

codelion/ellora

nootums/lora-from-scratch