A Quick Tutorial on Bi-level Optimization

Introduction

Bi-Level Optimization (BLO) is originated from the area of economic game theory and then introduced into the optimization community. BLO is able to handle problems with a hierarchical structure, involving two levels of optimization tasks, where one task is nested inside the other. The standard BLO problem can be formally expressed as

$BLO$

In machine learning and computer vision fields, despite the different motivations and mechanisms, a lot of complex problems, such as hyper-parameter optimization, multi-task and meta learning, neural architecture search, adversarial learning and deep reinforcement learning, actually all contain a series of closely related subproblms. In our recent survey published in TPAMI, named "Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond", we uniformly express these complex learning and vision problems from the perspective of BLO. Also we construct a best-response-based single-level reformulation and establish a unified algorithmic framework to understand and formulate mainstream gradient-based BLO methodologies, covering aspects ranging from fundamental automatic differentiation schemes to various accelerations, simplifications, extensions and their convergence and complexity properties. We summarize mainstream gradient-based BLOs and illustrate their intrinsic relationships within our general algorithmic platform. We also discuss the potentials of our unified BLO framework for designing new algorithms and point out some promising directions for future research.

$BLO$

In this website, we first summarize our related progress and references of existing works for a quick look at the current progress. Futhermore, we provide a list of important papers discussed in this survey, corresponding codes, and additional resources on BLOs. We will continuously maintain this website to promote the research in BLO fields.

Our Related Work

Papers

Risheng Liu, Jiaxin Gao, Jin Zhang, Deyu Meng, Zhouchen Lin. Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond. IEEE TPAMI 2021. [Paper] [Project Page]
Risheng Liu, Zi Li, Xin Fan, Chenying Zhao, Hao Huang, Zhongxuan Luo. Learning Deformable Image Registration from Optimization: Perspective, Modules, Bilevel Training and Beyond. IEEE TPAMI 2021. [Paper]
Risheng Liu, Long Ma, Jiaao Zhang, Xin Fan, Zhongxuan Luo. Retinex-Inspired Unrolling With Cooperative Prior Architecture Search for Low-Light Image Enhancement. CVPR 2021. [Paper] [Project Page]
Risheng Liu, Yaohua Liu, Shangzhi Zeng, Jin Zhang. Towards Gradient-based Bilevel Optimization with Non-convex Followers and Beyond. NeurIPS 2021 (Spotlight, Acceptance Rate ≤ 3%). [Paper] [Code]
Pan Mu, Zhu Liu, Yaohua Liu, Risheng Liu, Xin Fan. Triple-level Model Inferred Collaborative Network Architecture for Video Deraining. IEEE TIP 2021. [Paper] [Code]
Risheng Liu, Zhu Liu, Jinyuan Liu, Xin Fan. Searching a Hierarchically Aggregated Fusion Architecture for Fast Multi-Modality Image Fusion. ACM MM 2021. [Paper] [Code].
Dian Jin, Long Ma, Risheng Liu, Xin Fan. Bridging the Gap between Low-Light Scenes: Bilevel Learning for Fast Adaptation. ACM MM 2021. [Paper]

Risheng Liu, Xuan Liu, Xiaoming Yuan, Shangzhi Zeng, Jin Zhang. A Value Function-based Interior-point Method for Non-convex Bilevel Optimization. ICML 2021.[Paper][Code]

Yaohua Liu, Risheng Liu. BOML: A Modularized Bilevel Optimization Library in Python for Meta-learning. ICME 2021.[Paper][Code]

Risheng Liu, Pan Mu, Xiaoming Yuan, Shangzhi Zeng, Jin Zhang. A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton. ICML 2020. [Paper][Code]

Risheng Liu, Pan Mu, Jian Chen, Xin Fan, Zhongxuan Luo. Investigating Task-driven Latent Feasibility for Nonconvex Image Modeling. IEEE TIP 2020.[Code]

Risheng Liu, Zi Li, Yuxi Zhang, Xin Fan, Zhongxuan Luo. Bi-level Probabilistic Feature Learning for Deformable Image Registration. IJCAI 2020.[Paper] [code]

Bi-level Optimization Methods Toolkits

We have published BOML previously, a modularized Tensorflow-based optimization library that unifies several ML algorithms into a common bilevel optimization framework. Now we integrate more recently proposed algorithms and more compatible applications and release the Pytorch version.

Integrated Algoithms

Parts of Existing Work in Learning and Vision Fields

Gradient-based Optimization

Luca Franceschi, Michele Donini, Paolo Frasconi, Massimiliano Pontil. Forward and Reverse Gradient-Based Hyperparameter Optimization. ICML 2017.
Amirreza Shaban, Ching-An Cheng, Nathan Hatch, Byron Boots. Truncated Back-propagation for Bilevel Optimization. AISTATS 2019.
Hanxiao Liu, Karen Simonyan, Yiming Yang. DARTS: Differentiable Architecture Search. ICLR 2019.
Chelsea Finn, Pieter Abbeel, Sergey Levine. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. ICML 2017.
Alex Nichol, Joshua Achiam, John Schulman. On First-Order Meta-Learning Algorithms. arXiv 2018.
Sebastian Flennerhag, Andrei A. Rusu, Razvan Pascanu, Francesco Visin, Hujun Yin, Raia Hadsell. Meta-Learning with Warped Gradient Descent. ICLR 2020.
Eunbyung Park, Junier B. Oliva. Meta-Curvature. NeurIPS 2019.
Yoonho Lee, Seungjin Choi. Meta-Learning with Adaptive Layerwise Metric and Subspace. arXiv 2018.
Matthew MacKay, Paul Vicol, Jon Lorraine, David Duvenaud, Roger Grosse. Self-tuning Networks: Bilevel Optimization of Hyperparameters Using Structured Best-response Functions. ICLR 2019.
Jonathan Lorraine, David Duvenaud. Stochastic Hyperparameter Optimization through Hypernetworks. arXiv 2018.
Fabian Pedregosa. Hyperparameter Optimization with Approximate Gradient. ICML 2016.
Aravind Rajeswaran, Chelsea Finn, Sham Kakade, Sergey Levine. Meta-Learning with Implicit Gradients. NeurIPS 2019.
Jonathan Lorraine, Paul Vicol, David Duvenaud. Optimizing Millions of Hyperparameters by Implicit Differentiation. AISTATS 2020.

Hyper-parameter Optimization

Luca Franceschi, Michele Donini, Paolo Frasconi, Massimiliano Pontil. Forward and Reverse Gradient-Based Hyperparameter Optimization. ICML 2017.
Amirreza Shaban, Ching-An Cheng, Nathan Hatch, Byron Boots. Truncated Back-propagation for Bilevel Optimization. AISTATS 2019.
Matthew MacKay, Paul Vicol, Jon Lorraine, David Duvenaud, Roger Grosse. Self-tuning networks: Bilevel Optimization of Hyperparameters Using Structured Best-response Functions. ICLR 2019.
Fabian Pedregosa. Hyperparameter Optimization with Approximate Gradient. ICML 2016.
D. Maclaurin, D. Duvenaud, and R. Adams. Gradient-Based Hyperparameter Optimization Through Reversible Learning. PMLR 2015.
Takayuki Okuno, Akiko Takeda, Akihiro Kawana. Hyperparameter Learning via Bilevel Nonsmooth Optimization. arXiv 2018
Ankur Sinha, Tanmay Khandait, Raja Mohanty. A Gradient-based Bilevel Optimization Approach for Tuning Hyperparameters in Machine Learning. arXiv 2020.

Multi-task and Meta-learning

Alex Nichol, Joshua Achiam, John Schulman. On First-Order Meta-Learning Algorithms. arXiv 2018.
Sebastian Flennerhag, Andrei A. Rusu, Razvan Pascanu, Francesco Visin, Hujun Yin, Raia Hadsell. Meta-Learning with Warped Gradient Descent. ICLR 2020.
Aravind Rajeswaran, Chelsea Finn, Sham Kakade, Sergey Levine. Meta-Learning with Implicit Gradients. NIPS 2019.
Luca Franceschi, Paolo Frasconi, Michele Donini, Massimiliano Pontil. A Bridge Between Hyperparameter Optimization and Learning-to-learn, NeurIPS 2017.
Luca Franceschi, Paolo Frasconi, Saverio Salzo, Riccardo Grazzi, Massimilano Pontil. Bilevel Programming for Hyperparameter Optimization and Meta-learning. ICML 2018.
Luca Bertinetto, João F. Henriques, Philip H.S. Torr, Andrea Vedaldi. Meta-learning with Differentiable Closed-form Solvers, ICLR 2019.
Alex Nichol, John Schulman. Reptile: A Scalable Metalearning Algorithm.
Alesiani Francesco, Shujian Yu, Ammar Shaker, and Wenzhe Yin. Towards Interpretable Multi-Task Learning Using Bilevel Programming. ECML PKDD 2020

Citation

If this paper is helpful for your research, please cite our paper:
@article{liu2021investigating,
title={Investigating bi-level optimization for learning and vision from a unified perspective: A survey and beyond},   
author={Liu, Risheng and Gao, Jiaxin and Zhang, Jin and Meng, Deyu and Lin, Zhouchen},   
journal={arXiv preprint arXiv:2101.11517},   
year={2021}
}

LOOP-MATH/BLO