This repository is the implementation version using Newest MineRL dataset. The original repo and info please see below.
Dependencies:
Please download MineRL dataset through:
====================================== Original Version ======================================
This repository is the TF2.0 implementation of Forgetful Replay Buffer for Reinforcement Learning from Demonstrations by Alexey Skrynnik, Aleksey Staroverov, Ermek Aitygulov, Kirill Aksenov, Vasilii Davydov, Aleksandr I. Panov.
If you use this repo in your research, please consider citing the paper as follows
@misc{skrynnik2020forgetful,
title={Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations},
author={Alexey Skrynnik and Aleksey Staroverov and Ermek Aitygulov and Kirill Aksenov and Vasilii Davydov and Aleksandr I. Panov},
year={2020},
eprint={2006.09939},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
To install requirements:
pip install -r requirements.txt
For a set of simple environments, most of the expert data is already in the demonstrations folder.
To train ForgER on MineRL domain you need to put expert data in the demonstrations
folder.
To train ForgER on Simple set, run this command:
python train_simple_set.py --config simple_set_config.yaml
You can change the environment and the path to expert data in the config.
The simple_set_config.yaml
file provides an example config for the Acrobot-v1 environment.
To train ForgER on Treechop, run this command:
python train_treechop.py --config treechop_config.yaml
To train ForgER on MineRL, run this command:
python train_minerl.py --config minerl_config.yaml
Item | MineRL2019 | ForgER | ForgER++ |
---|---|---|---|
log | 859 | 882 | 867 |
planks | 805 | 806 | 792 |
stick | 718 | 747 | 790 |
crafting table | 716 | 744 | 790 |
wooden pickaxe | 713 | 744 | 789 |
cobblestone | 687 | 730 | 779 |
stone pickaxe | 642 | 698 | 751 |
furnace | 19 | 48 | 98 |
iron ore | 96 | 109 | 231 |
iron ingot | 19 | 48 | 98 |
iron pickaxe | 12 | 43 | 83 |
diamond | 0 | 0 | 1 |
mean reward | 57.701 | 74.09 | 104.315 |