pengqianhan/safety-rbr-code-and-data-pq
Code and example data for the paper: Rule Based Rewards for Language Model Safety
Jupyter NotebookMIT
No issues in this repository yet.
Code and example data for the paper: Rule Based Rewards for Language Model Safety
Jupyter NotebookMIT
No issues in this repository yet.