finetuning-llms
There are 80 repositories under finetuning-llms topic.
adithya-s-k/AI-Engineering.academy
Mastering Applied AI, One Concept at a Time
GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS
End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects
Itachi-Uchiha581/Auto-Data
Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).
simplifine-llm/Simplifine
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨
neuralwork/instruct-finetune-mistral
Fine-tune Mistral 7B to generate fashion style suggestions
wangermeng2021/llm-webui
A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat
BaohaoLiao/mefts
[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
Prismadic/magnet
the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly
samadon1/LLM-From-Scratch
Medical Language Model fine-tuned using pretraining, instruction tuning, and Direct Preference Optimization (DPO). Progresses from general medical knowledge to specific instruction following, with experiments in preference alignment for improved medical text generation and understanding.
garyfanhku/Galore-pytorch
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
AlphaPav/mem-kk-logic
On Memorization of Large Language Models in Logical Reasoning
zhaoyl18/SEIKO
SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.
adithya-s-k/Indic-llm
A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of domains and languages.
BaohaoLiao/ApiQ
[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs
bhattbhavesh91/google-gemma-finetuning-n2sql
Finetuning Google's Gemma Model for Translating Natural Language into SQL
SaltyGod/Qwen-Qlora-ACSA
qwen-1.5-1.8B sentiment analysis with prompt optimization and qlora fine-tuning
louisc-s/QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM
Code for fine-tuning Llama2 LLM with custom text dataset to produce film character styled responses
zelaki/awesome-LoRA
A curated list of Parameter Efficient Fine-tuning papers with a TL;DR
PromptEngineer48/Fine_tuning_1
Finetuning LLMs + Private Data (Video 1/10) Basic
Rahul-AkaVector/java-code-generator
This repository contains code for fine-tuning the LLama3 8b model using Alpaca prompts to generate Java codes. The code is based on a Google Colab notebook.
ShashankGupta10/Code-Wizard
Code Wizard is a coding companion/ code generation tool empowered by CodeLLama-v2-34B AI to automatically generate and enhance code based on best practices found in your GitHub repository.
alaradirik/finetune-phi-2
Fine tune Phi 2 for persona grounded chat
harshpreet14/journalling
An audio journaling app that provides AI analysis for your journal entries
inuwamobarak/Meta-Llama-3-8B
Experiments with the Meta-Llama-3-8B
LikithMeruvu/Gemma2B_Finetuning_Medium
This Repo contains How to Finetune Google's New Gemma LLm model using your custom instuction dataset. I have finetuned Gemma 2b instuct Model on 20k medium articles data for 5hrs on kaggle p100 GPU
roy-sub/LLM-FineTuning
Fine-Tuned Language Models Exploration using LoRA and Hugging Face's Transformers Library
arunpshankar/VAI-FineTuning-LLMs
"Clean and comprehensive examples for fine-tuning LLMs supported by Vertex AI, including proprietary, open-source, and third-party models."
Gaurav-Van/Fine-Tuning-LLMs
Introductory Guide where we will talk about Different Techniques of Fine Tuning LLMs
GURPREETKAURJETHRA/Multi-GPU-Fine-Training-LLMs
Multi GPU Fine Training LLMs using DeepSpeed and Accelerate.
harshit433/ResurrectAI
ResurrectAI is an AI-driven chat application designed to bring the wisdom and knowledge of great historical personalities to life. Leveraging advanced language models and fine-tuning techniques, ResurrectAI enables users to interact with AI avatars of iconic figures, gaining access to their insights, guidance, and philosophical teaching in realtime
SuryaKrishna02/sft-llm-news-articles-telugu
The repository contains the code that is used to create the instruct style dataset of telugu news articles.
YashBhalekar07/java-code-generator
This repository contains code for fine-tuning the LLama3 8b model using Alpaca prompts to generate Java codes. The code is based on a Google Colab notebook.
zeyadusf/Finetuning-LLMs
Finetuning Large Language Models
andrew264/modelex
Doing devious stuff with AI
codelion/ellora
Enhancing LLMs with LoRA
nootums/lora-from-scratch
Building LoRA adapters from scratch