/RLHI

Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition

Primary LanguagePythonMIT LicenseMIT

Stargazers