IQCbased_ImitationLearning

This code is to accompany the paper Imitation Learning with Stability and Safety Guarantees. It learns a Neural Network controller with stability and safety guarantees through imitation learning process.

Authors:

  • He Yin (he_yin at berkeley.edu)
  • Peter Seiler (pseiler at umich.edu)
  • Ming Jin (jinming at vt.edu)
  • Murat Arcak (arcak at berkeley.edu)

Getting Started

The code is written in Python3 and MATLAB.

Prerequisites

There are several packages required:

  • MOSEK: Commercial semidefinite programming solver
  • CVX: MATLAB Software for Convex Programming
  • Tensorflow: Open source machine learning platform

To plot the computed ROA, two more packages are required:

  • SOSOPT: General SOS optimization utility
  • Multipoly: Package used to represent multivariate polynomials

Way of Using the Code

  • To start the safe imitation learing process, go to each folder, run NN_policy.py. The computation results are stored in the folder data.
  • To visualize the results for the inverted pendulum example, run result_analysis.m. For the GTM and vehicle lateral control examples, run plot_generation.m.

ROAs of the Learned NN Controllers and Explicit MPC Controller for the Vehicle Lateral Control Example

vehicle