/max-causal-ent-irl

Maximum Causal Entropy Inverse Reinforcement Learning

Primary LanguagePythonMIT LicenseMIT

Maximum Causal Entropy Inverse Reinforcement Learning

Python implementation of Maximum Causal Entropy Inverse Reinforcement Learning from Brian Ziebart's PhD thesis (2010): http://www.cs.cmu.edu/~bziebart/publications/thesis-bziebart.pdf

Main file: max_causal_ent_irl.py.

Work in progress. Current TODOs:

  1. Compute P_0 for the occupancy measure for the gradient of the infinite horizon discounted version of the algorithm as in section A of Levine's GPIRL supplement http://graphics.stanford.edu/projects/gpirl/gpirl_supplement.pdf.
  2. Documentation.