qzed/irl-maxent
Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python
Jupyter NotebookMIT
Issues
- 3
if the trajectory stays n the terminal state (for a limited number of times)
#6 opened by ArezooAalipanah - 1
How do I cite this?
#5 opened by catubc - 1
Supporting MDPs with negative reward states?
#4 opened by kierad - 5
Multiple terminal states
#3 opened by siddhya - 1
Presentation template
#2 opened by rosewang2008