Pinned Repositories
lm-evaluation-harness
A framework for few-shot evaluation of language models.
llm_qlora
Fine-tuning LLMs using QLoRA
langchain
🦜🔗 Build context-aware reasoning applications
ms-implement
ms-implement
study-Channel-LM-Prompting
study-p-tuning
pyreft
ReFT: Representation Finetuning for Language Models
pyvene
Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
orpo
Official repository for ORPO