HumanSignal/RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
Jupyter Notebook
Stargazers
- AbubakarSaad
- AndrejOros
- bmartelCanada
- borisheartex
- Brishen
- carly-bartel
- deppp@humansignal
- eliekawerkUAE
- emptymalei@spikinglabs
- erinmikailstapleslaunchdarkly
- Gondragos
- hakan458
- hgasconEurope
- hlomzikHeartex
- hogepodgePortland, OR
- joeisastaple
- juliosgarbi@HumanSignal
- KonstantinKorotaev
- krlngscieneers
- lsell@heartexlabs/label-studio
- makseq
- mauryalandParis
- MBrede
- nehaleckyHumanSignal
- niklub
- pakelley
- Pent
- radao@scythe-robotics
- SandalotsVolcanak
- ThanThoaiAI Engineer, Software Engineer
- tsterbakBerlin, Germany
- ufwt
- vagechirkovBerlin
- vladimirheartex
- Yuan-ManXShanghai, China
- yyassi-heartexHumanSignal