Solving a Real-world optimization problem using Proximal Policy Optimization combined with Curriculum Learning and Reward Engineering ♻️
- Python >=3.8.0,<3.10
- Conda
- Follow instructions : Installer link
Create a conda virtual environment and run the following commands
conda create -n myenv python=3.8.8
conda activate myenv
pip install -r requirements.txt
python reproduce_results_paper.py
Feel free to ask questions. Posting in Github Issues and PRs are also welcome.