[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Primary LanguagePythonApache License 2.0Apache-2.0