minyang-chen/RLHF_example
Reinforcement learning from human feedback (RLHF) Movie Reviews Example
Jupyter NotebookApache-2.0
No issues in this repository yet.
Reinforcement learning from human feedback (RLHF) Movie Reviews Example
Jupyter NotebookApache-2.0
No issues in this repository yet.