trjo1/genaiwithllms
Fine-tuned FLAN T-5 using Instruction Fine-Tuning (Full), LoRA-based PEFT, and RLHF with PPO
Jupyter NotebookMIT
Stargazers
No one’s star this repository yet.
Fine-tuned FLAN T-5 using Instruction Fine-Tuning (Full), LoRA-based PEFT, and RLHF with PPO
Jupyter NotebookMIT
No one’s star this repository yet.