/RLHI

Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.