victor-iyi/rlhf-trl

Reinforcement Learning from Human Feedback with 🤗 TRL

Python

Readme
0Issues
9Stargazers
3Watchers

Stargazers

bdx0
bdx0.io.vn
daniuxiaochun
emphasis10
Samsung Research
Eruly
Sionic AI
Natyren
samanehheidari48
Sandalots
Volcanak
younesbelkada
@huggingface

Contact site admin: Geeks.