This is an implementation for Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment [Paper]
Authors: Tung M. Luu*, Chang D. Yoo.
Acknowledgement: This work was supported in part by the Institute for Information & Communications Technology Planning & Evaluation (IITP) grant funded by the Korea Government (MSIT) (No. 2019-0-01396, Development of Framework for Analyzing, Detecting, Mitigating of Bias in AI Model and Training Data) and in part by the BK21 FOUR program.
Guide: To run experiments, please follow the instructions in baselines/baselines/her/README.md
If you use this repo in your research, please consider citing the paper as follows
@ARTICLE{9391700,
author={Luu, Tung M. and Yoo, Chang D.},
journal={IEEE Access},
title={Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment},
year={2021},
volume={9},
number={},
pages={51996-52007},
doi={10.1109/ACCESS.2021.3069975}}
This code is based on OpenAI Baselines.