The Auto-Multilift is a novel learning framework for cooperative load transportation with quadrotors. It can automatically tune various MPC hyperparameters, which are modeled by DNNs and difficult to tune manually, via reinforcement learning in a distributed and closed-loop manner.
A Diagram of a multilift system and Auto-Multilift Learning Piplines |
---|
Please find out more details in our paper: "Auto-Multilift: Distributed Learning and Control for Cooperative Load Transportation With Quadrotors"
- Project Overview
- Dependency Packages
- How to Use
- [A: Distributed Learning of Adaptive Weightings](#A: Distributed Learning of Adaptive Weightings)
- [B: Distributed Learning of Adaptive References](#B: Distributed Learning of Adaptive References)
- Contact Us
The project consists of two folders, which correspond to the two experiments in the paper that show the following three advantages of our method.
- Auto-Multilift enjoys fast convergence in just a few iterations, with its convergence speed unaffected by the number of quadrotors.
- Auto-Multilift is able to learn adaptive MPC weightings directly from trajectory tracking errors. Additionally, it significantly improves training stability and tracking performance over a state-of-the-art open-loop learning method [1].
- Beyond its improved training ability to learn adaptive MPC weightings, our method can effectively learn an adaptive tension reference, enabling the multilift system to reconfigure itself when traversing through obstacles.
Please make sure that the following packages have already been installed before running the source code.
- CasADi: version 3.5.5 Info: https://web.casadi.org/
- Numpy: version 1.23.0 Info: https://numpy.org/
- Pytorch: version 1.12.0+cu116 Info: https://pytorch.org/
- Matplotlib: version 3.3.0 Info: https://matplotlib.org/
- Python: version 3.9.12 Info: https://www.python.org/
- Scipy: version 1.8.1 Info: https://scipy.org/
- Pandas: version 1.4.2 Info: https://pandas.pydata.org/
- scikit-learn: version 1.0.2 Info: https://scikit-learn.org/stable/whats_new/v1.0.html
First and foremost, the implementation for Auto-Multilift is straightforward to setup. The source code has been comprehensively annotated to facilitate ease of use. To reproduce the simulation results presented in the paper, simply follow the steps outlined below, sequentially, after downloading and decompressing all the necessary folders.
Learning Process of a Large Multilift System With Six Quadrotors and Non-uniform Loas Mass Distribution |
---|
- Open the Python file 'main_distributed_autotuning.py' in the folder 'Source code A'
- Before running, please do the following settings:
- Set the number of quadrotors on line 42 (i.e., the fifth number in the 'uav_para' array).
- Set the load mass value on line 43 (i.e., the first number in the 'load_para' array).
- Set the MPC horizon on line 52 (the default value is 10).
- Set the higher-level loss horizon on line 53 (the default value is 20).
- After completing the above settings, run the file 'main_distributed_autotuning.py'. In the prompted terminal interface, you will be asked to select the control and sensitivity propagation modes.
- In our settings, 's' and 'p' denote 'sequential' and 'parallel' computing, respectively.
- 'c' and 'o' represent 'closed-loop' (our method) and 'open-loop' (the Safe-PDP method [1]) training modes.
- To evaluate the trained model, run the Python file 'main_distributed_autotuning_evaluation.py'
- You can skip Step 4 and evaluate the saved models that were previously trained and employed in the paper. To do so, copy the files that end with '.pt' from the folder 'Previously trained models' (within the folder 'Source code A') to the folder 'trained data' (where the retained models via Step 4 will be saved).
Learning Process of the Same Large Multilift System |
---|
- Open the Python file 'main_distributed_learn_tensionref.py' in the folder 'Source code B'
- Before running, please do the following settings:
- Set the number of quadrotors on line 40 to the same as that used in Section A (i.e., learning of adaptive weightings).
- Set the load mass value on line 43 (i.e., the first number in the 'load_para' array).
- Set the MPC horizon on line 52 (the default value is 10).
- Set the higher-level loss horizon on line 53 (the default value is 20).
- After completing the above settings, run the file 'main_distributed_learn_tensionref.py'. In the prompted terminal interface, you will be asked to select the control and sensitivity propagation modes.
- In our settings, 's' and 'p' denote 'sequential' and 'parallel' computing, respectively.
- To evaluate the trained model, run the Python file 'main_distributed_learn_tensionref_evaluation.py'
- You can skip Step 4 and evaluate the saved models that were previously trained and employed in the paper. To do so, copy the files that end with '.pt' from the folder 'Previously trained models' (within the folder 'Source code B') to the folder 'trained data' (where the retained models via Step 4 will be saved).
If you encounter a bug in your implementation of the code, please do not hesitate to inform me.
- Name: Mr. Bingheng Wang
- Email: wangbingheng@u.nus.edu
[1] Jin, Wanxin and Mou, Shaoshuai and Pappas, George J, "Safe pontryagin differentiable programming", Advances in Neural Information Processing Systems, 34, 16034--16050, 2021