AlignmentResearch/epic
Implements the Equivalent-Policy Invariant Comparison (EPIC) distance for reward functions.
PythonMIT
Issues
- 3
- 0
Add continuous integration
#4 opened by AdamGleave
Implements the Equivalent-Policy Invariant Comparison (EPIC) distance for reward functions.
PythonMIT