liujch1998/l-mcts_alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
PythonApache-2.0
Watchers
No one’s watching this repository yet.
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
PythonApache-2.0
No one’s watching this repository yet.