yohanshin/WHAM

About the provided pretrained models

Opened this issue · 1 comments

Great work!
Want to know if you could release the pretrained models with different backbones( WHAM (Res)/(HR)) in the paper. I want to run the demo in my own pc in real-time(>30fps) way, but the WHAM(ViT) is time costly.
Thanks a lot!

Hi Colin, yes I will release the other two backbones as well. But as you can see in our supplementary materials, backbone is not the bottleneck in WHAM. You can consider using lightweighted keypoints detection model instead of ViTPose. The performance will slightly degenerate btw.