lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
PythonMIT
Watchers
- AlbertBJBeijing, China
- AlexiaJMSamsung SAIT
- Ayush-a3h@Focus-Bear
- CamaradaLares
- duyvuleo@oracle
- eemailme
- hexadecibleTelios
- iceychrisAugsburg University of Applied Sciences
- jabogithub
- jbdatascienceNetherlands
- karlotimmerman@sky-dust-intelligence
- lucidrainsSan Francisco
- madalincosteaMadeNN
- mansoor-s¯\_(ツ)_/¯
- mbofb
- michael-erasmus@DonorsChoose
- pczzySina.com
- physicsru
- richstav
- runrunliuliu
- suchith720Indian Institute of Technology, Delhi
- voxmenthe
- vulcangz