About the provided pretrained models

Question

About the provided pretrained models

Opened this issue 7 months ago · 1 comments

ColinZhou1993 commented 7 months ago

Great work!
Want to know if you could release the pretrained models with different backbones( WHAM (Res)/(HR)) in the paper. I want to run the demo in my own pc in real-time(>30fps) way, but the WHAM(ViT) is time costly.
Thanks a lot!

Answer 1 · 2024-04-18T20:25:20.000Z

Hi Colin, yes I will release the other two backbones as well. But as you can see in our supplementary materials, backbone is not the bottleneck in WHAM. You can consider using lightweighted keypoints detection model instead of ViTPose. The performance will slightly degenerate btw.