Solving a Real-world optimization problem using Proximal Policy Optimization combined with Curriculum Learning and Reward Engineering ♻️

🖊 Info

Create a conda virtual environment and run the following commands

conda create -n myenv python=3.8.8
conda activate myenv
pip install -r requirements.txt

python reproduce_results_paper.py

Feel free to ask questions. Posting in Github Issues and PRs are also welcome.