openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
PythonMIT
Stargazers
- Ankur3107JPMorgan Chase
- bittdyBeijing-IT
- BusbyActualLos Angeles
- caoxu915683474@Lenovo Reasearch@BIT
- Celeste-cj
- charles9nPasadena, CA.
- chrisdonahueStanford
- DeepAndy
- denisfitz57
- errolyanTianAnMen
- fanglinchenCarnegie Mellon University
- fly51flyPRIS
- GitHub30Osaka, Japan
- j-minUNC Chapel Hill
- JeffCarpenterCanada
- jon-chunKenyon College
- lmmx@beatchain
- mbyasefuture technology
- naranbat
- neeksorPhoenix, Arizona
- nghuyong@Tencent
- orhmeh09Istanbul
- Oskop
- outformaticsoutformatics
- qywuColumbia University
- raruidolARG-tech | Centre for Argument Technology| University of Dundee
- raz0rknaif
- sdan
- soneo1127
- srgraham
- weekstudyShanghai
- wojiaopanhaoran
- WuTheFWasThat
- xuanhan863Los Angeles, USA
- ZeroLeonShanghai
- zhiyueGuangzhou