/off_policy_trpo

Simple implementation of off policy TRPO.

Primary LanguagePython

Off-Policy TRPO

This is a simple implemenation of off-policy TRPO (link).

requirement

results

HalfCheetah-v2

img

  • obtained by training with three seeds.
  • {algo_name}-Norm: training with state normalization.