Pinned Repositories
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
glerzing
Config files for my GitHub profile.
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
TransformerLens
TransformerLens
trl
Train transformer language models with reinforcement learning.
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
trl
Train transformer language models with reinforcement learning.
tournesol
Free and open source code of the https://tournesol.app platform. Meet the community on Discord https://discord.gg/WvcSG55Bf3
glerzing's Repositories
glerzing/glerzing
Config files for my GitHub profile.
glerzing/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
glerzing/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
glerzing/TransformerLens
TransformerLens
glerzing/trl
Train transformer language models with reinforcement learning.
glerzing/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)