This repository is the official implementation for Pose Recognition with Cascade Transformers. It proposes two types of cascade Transformers, as follows, for pose recognition.
Please refer to README.md for detailed usage of the two-stage model variant.
Please refer to README.md for detailed usage of the sequential (end-to-end) model variant.
For more details, please see Pose Recognition with Cascade Transformers by Ke Li*, Shijie Wang*, Xiang Zhang*, Yifan Xu, Weijian Xu, and Zhuowen Tu.
Code and pretrained models will be released soon.
@misc{li2021pose,
title={Pose Recognition with Cascade Transformers},
author={Ke Li and Shijie Wang and Xiang Zhang and Yifan Xu and Weijian Xu and Zhuowen Tu},
year={2021},
eprint={2104.06976},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
This repository is released under the Apache License 2.0. License can be found in LICENSE file.
This project is based on the following open source repositories, which greatly facilitate our research.
- Thanks to DETR for the implementation of Detection Transformer
- Thanks to HRNet-Human-Pose-Estimation for the training and evaluation pipeline
- Thanks to HRNet-Image-Classification for HRNet backbone implementation