JAEarly/MIL-for-Non-Markovian-Reward-Modelling

Code for the paper "Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning".

Python

Watchers