/RLHI

Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition

Primary LanguagePythonMIT LicenseMIT

Stargazers

No one’s star this repository yet.