ahmed-alllam/Direct-Preference-Optimization
Direct Preference Optimization from scratch in PyTorch
Python
Stargazers
- ahmed-alllamCairo, Egypt
- ambyerhantencent
- d-isasterhub
- dadamaowangShanghai, China
- dihuangdhZhejiang University
- FredKhayat
- gclomax
- imr555Neovotech
- Jasonxu1225The Chinese University of Hong Kong, Shenzhen
- JEONGSEJINRepublic of Korea
- keven980716Peking University
- konan009Ateneo de Manila University
- lemo2012
- liuchaohu
- mtxing
- NeXX-N
- NMS05Nanyang Technological University
- nurgumus
- pie33000Pernod RIcard USA
- quockhangdevFuixlabs
- saadettinBerber
- sangminwooKAIST
- shaojh1
- shikhrLucknow, India
- shuishen112@TJUIRLAB
- tathagatoroy
- tonytu16Berkeley, California
- UniBody
- wangxd15
- zhangfaenGoogle