Pinned Repositories
qlora
QLoRA: Efficient Finetuning of Quantized LLMs
FLAN
t5x
colabtools
Python libraries for Google Colaboratory
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
trl
Train transformer language models with reinforcement learning.
scaling_sentemb
Scaling Sentence Embeddings with Large Language Models
hellaswag
HellaSwag: Can a Machine _Really_ Finish Your Sentence?
unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
JhonDan1999's Repositories
JhonDan1999 doesn’t have any repository yet.