Wheatley

A Job-Shop Scheduling problem (JSSP) solver based on Reinforcement Learning, targeted to solving real-world industrial problems, and more.

Features

Trains a scheduler for fixed or problems with uncertainty
Support for training over random problems and generalize
Support for training over problems with bounded but uncertain durations
Reads JSSP in Taillard format, extended for uncertain durations
Web live training metrics reported with Visdom
Includes schedule visualization as Gantt charts
Compares to OR-Tools
Relies on state-of-the art Deep Learning libraries: written with Pytorch, and DGL for graph neural networks

Note: for windows users, we strongly recommend to use anaconda

See JSSP, PSP and ADVICE for more information.

If you want to contribute to wheatley, make sure to install the pre-commit hooks:

pre-commit install

Wheatley learns how to schedule well and generalize over problems and/or uncertainty. It works from a representation of the schedule state-space directly, as opposed to the state-space of jobs and machines.
Uses PPO as the main RL algorithm
Captures schedules in the form of graphs and trains with an underlying Graph Neural Network
Large number of hyper-parameters, default values are set to the best currently known values
A small choice of different rewards is implemented.

Rewards are normalized
Wheatley uses proper batching and parallel environments
Wheatley uses advanced GNN, such as gatv2 (with edge info) thanks to DGL.
Wheatley embeds more information into every node of the schedule graph (like propagated time bounds), yielding more informed policies
Wheatley has support for bounded uncertain durations, including at node and reward levels.