/ShieldLM

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

Primary LanguagePythonMIT LicenseMIT

Stargazers