antoinedang/AmateurPolicyImitation
Implementation for research project on domain knowledge injection through pre-training. Specifically, an amateur policy is mimicked to initialize the policy network.
PythonMIT
Implementation for research project on domain knowledge injection through pre-training. Specifically, an amateur policy is mimicked to initialize the policy network.
PythonMIT