Implementation for paper on domain knowledge injection through pre-training. Specifically, an amateur policy is mimicked to initialize the policy network.
WORK IN PROGRESS
Implementation for research project on domain knowledge injection through pre-training. Specifically, an amateur policy is mimicked to initialize the policy network.
PythonMIT