This is the official implementation of FlowNAS. The training and searching code is coming soon.
Existing optical flow estimators usually employ the network architectures typically designed for image classification as the encoder to extract per-pixel features. However, due to the natural difference between the tasks, the architectures designed for image classification may be sub-optimal for flow estimation. To address this issue, we propose a neural architecture search method named FlowNAS to automatically find the better encoder architecture for flow estimation task. We first design a suitable search space including various convolutional operators and construct a weight-sharing super-network for efficiently evaluating the candidate architectures. Then, for better training the super-network, we propose Feature Alignment Distillation, which utilizes a well-trained flow estimator to guide the training of super-network. Finally, a resource-constrained evolutionary algorithm is exploited to find an optimal architecture (i.e., sub-network). Experimental results show that the discovered architecture with the weights inherited from the super-network achieves 4.67% F1-all error on KITTI, an 8.4% reduction of RAFT baseline, surpassing state-of-the-art handcrafted models GMA and AGFlow, while reducing the model complexity and latency.
Please follow the official to install torch and torchvision
Other requirements:
matplotlib tensorboard scipy opencv-python tqdm imageio einops
Please follow RAFT to download and prepare the dataset.
The final dataset folder will be like:
├── datasets
├── Sintel
├── test
├── training
├── KITTI
├── testing
├── training
├── devkit
├── FlyingChairs_release
├── data
├── FlyingThings3D
├── frames_cleanpass
├── frames_finalpass
├── optical_flow
You can evaluate a trained model using evaluate_supernet.py
python evaluate_supernet.py --model <model_path> --dataset <sintel or kitti>
Architecture | Sintel clean | Sintel final | KITTI | Weights |
---|---|---|---|---|
RAFT | 1.94 | 3.18 | 5.10 | —— |
FlowNAS-RAFT-S | 1.93 | 3.38 | —— | Google_drive |
FlowNAS-RAFT-K | —— | —— | 4.67 | Google_drive |
- SuperNet training code
- Searching code
The overall code is based on RAFT and AttentiveNAS. We sincerely thank the authors for open sourcing their methods.
The project is only free for academic research purposes, but needs authorization for commerce. For commerce permission, please contact wyt@pku.edu.cn.
If FlowNAS is useful or relevant to your research, please cite our paper:
@article{Lin2023,
title={FlowNAS: Neural Architecture Search for Optical Flow Estimation},
author={Lin, Zhiwei and Liang, Tingting and Xiao, Taihong and Wang, Yongtao and Yang, Ming-Hsuan},
journal={International Journal of Computer Vision},
year={2023},
month={Oct},
day={25},
issn={1573-1405},
doi={10.1007/s11263-023-01920-9},
url={https://doi.org/10.1007/s11263-023-01920-9}
}