qzed/irl-maxent

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python

Jupyter NotebookMIT

Issues

if the trajectory stays n the terminal state (for a limited number of times)
#6 opened a year ago by ArezooAalipanah
3
How do I cite this?
#5 opened 9 months ago by catubc
1
Supporting MDPs with negative reward states?
#4 opened 2 years ago by kierad
1
Multiple terminal states
#3 opened 2 years ago by siddhya
5
Presentation template
#2 opened 2 years ago by rosewang2008
1