facebookresearch/RLCD
Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment
PythonMIT
Issues
- 1
- 2
Argument meaning.
#4 opened by LanShanPi - 2
How to find file?
#3 opened by LanShanPi
Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment
PythonMIT