ilkaza

Pinned Repositories

JPAL-HA
Justified Human Preferences for Active Learning with Hypothetical Actions (JPAL-HA) is an human-in-the-loop algorithm for safe agent learning in safety-critical environments. It builds on the Parenting algorithm, augmenting it with two novel and generalisable ideas: Justifications and Hypothetical Actions.
Language:Python10
swc-rf4
Language:Python00

ilkaza's Repositories

ilkaza/JPAL-HA
Justified Human Preferences for Active Learning with Hypothetical Actions (JPAL-HA) is an human-in-the-loop algorithm for safe agent learning in safety-critical environments. It builds on the Parenting algorithm, augmenting it with two novel and generalisable ideas: Justifications and Hypothetical Actions.
Language:Python1
ilkaza/swc-rf4