This is the code for the paper "Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward".
Install Python environment with:
conda create -n diaster python=3.9 -y
conda activate diaster
conda install pytorch cudatoolkit=11.3 -c pytorch -y
pip install -r ./requirements.txt
python3 main.py --env-name=[Env name]
The config files act as defaults for a task.
They are all located in config
.
--env-name
refers to the config files in config/
including Hopper-v3, Walker2d-v3, Swimmer-v3, Humanoid-v3, HumanoidStandup-v2.
All results will be stored in the result
folder.
For example, run Diaster on Hopper:
python main.py --env-name=Hopper-v3