multi-echelon-drl

Solving Multi-Echelon Inventory Management Problems with Heuristic-Guided Deep Reinforcement Learning

Recommended Python version: 3.9

Installation

git clone https://github.com/qihuazhong/multi-echelon-drl.git

To install the required packages:

pip install --upgrade -r requirements.txt

Training DRL agents

The experiments setup and hyperparameters of the DRL algorithm should be defined in a YAML file. Examples of experiments setup files can be found at setup_exmaple_centralized.yaml and setup_exmaple_decentralized.yaml.

Usage

Commandline arguments

usage: train_a2c.py [-h] -g GLOBAL_INFO -p HYPERPARAMETERS [--name NAME] --ordering-rule ORDERING_RULE --role {Retailer,Wholesaler,Distributor,Manufacturer,MultiFacility} --scenario SCENARIO
                                                                                                                                                                                              
options:                                                                                                                                                                                      
  -h, --help            show this help message and exit                                                                                                                                       
  -g GLOBAL_INFO, --global-info GLOBAL_INFO                                                                                                                                                   
                        Whether to return global info of the entire supply chain in the decentralized setting. This argument is ignored in the centralized setting                            
  -p HYPERPARAMETERS, --hyperparameters HYPERPARAMETERS                                                                                                                                       
                        Path to the experiment setup file (.yaml)                                                                                                                             
  --name NAME           Name of the experiment. Used as a prefix for saving log files and models to avoid overwriting previous experiment outputs                                             
  --ordering-rule ORDERING_RULE                                                                                                                                                               
                        'a' or 'd+a'
  --role {Retailer,Wholesaler,Distributor,Manufacturer,MultiFacility}
                        Should be one of 'Retailer', 'Wholesaler', 'Distributor', 'Manufacturer' or 'MultiFacility' (Centralized control)
  --scenario SCENARIO   complex or basic

Examples

To train a TD3 agent with heuristic guided exploration (HGE), simply run the train script with the path to a YAML file as the first argument:

python train_td3.py -e setup_exmaple_centralized.yaml

Or train a TD3 / DQN / A2C without HGE in the decentralized setting: