/RLCO-Papers

Paper collection of reinforcement learning based combinatorial optimization

MIT LicenseMIT

RLCO-Papers

Reinforcement Learning based combinatorial optimization (RLCO) is a very interesting research area. Combinatorial Optimization Problems include: Travelling Salesman Problem (TSP), Single-Source Shortest Paths (SSP), Minimum Spanning Tree (MST), Vehicle Routing Problem (VRP), Orienteering Problem, Knapsack Problem, Maximal Independent Set (MIS), Maximum Cut (MC), Minimum Vertex Cover (MVC), Maximal Clique (MC), Integer Linear Programming (ILP), Network Optimization (Routing), Graph Coloring Problem (GCP), Bin Packing Problem, Graph Partitioning, EDA problems. Most of them are NP-hard or NP-complete. Combinatorial Problems can traditionally be solved by: exact method, heuristic method (genetic algorithm, simulated annealing), etc. Recently, better learning-based solvers are coming out.

This is a collection of resaerch & application papers of RLCO. Papers are sorted by time and categories. Some related supervised learning papers are also listed as a reference.

The sharing principle of these references here is for research. If any authors do not want their paper to be listed here, please feel free to contact [Haiguang Liao] (Email: haiguanl [AT] andrew.cmu.edu). Feedbacks on any mistakes on the repo are also welcomed

Review Papers:

Research Papers:

Papers are catgorized based on the solution approahces and ordered in time sequence:

1. Policy RL + (GNN)

2. Value RL + (GNN)

3. Supervised Learning + Tree Search + Graph Embedding

Application Papers:

Papers are catgorized based on the application domains and ordered in time sequence:

1. Electronic Design Automation (EDA)

EDA is not easy. Some problems in EDA such as physical design (floorplan, placement, routing, etc.) can be formulated as combinatorial optimization problems. Thus, some of them are solved with RLCO.

2. Path Planning (Routing)

RL has been extensively applied to robotic path planning, the papers listed here are only a tiny subset of those we think relevent to RLCO.

3. Resource Management

Relevant Papers:

1. MLCO (Machine Learning for Combinatorial Optimization)

2. ILCO (Imitation for Combinatorial Optimization)

3. GNN and CO

Relevant Resources:

Frameworks:

  1. PyTorch Geometric (TU Dortmund University)
  2. Deep Graph Library (Amazon Web Services, AWS Shanghai AI Lab, New York University, NYU Shanghai)

Libraries:

  1. Facilate the learning of heuristics for CO problems similar to OpenAI Gym.
  1. Facilitate the learning of configuration parameters for CO solvers.
  1. Offering a general, extensible framework for implementing and evaluating machine learning-enhanced CO, also based on OpenAI Gym.
  • Ecole (Mila, Polytechnique Montréal)

Environments/Problems: