/VLMs-Behavior-Critic

Task Success is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors

Primary LanguagePython

Issues