/MIL-for-Non-Markovian-Reward-Modelling

Code for the paper "Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning".

Primary LanguagePython

Watchers