Roadmap of MMPose

Question

Roadmap of MMPose

hellock opened this issue 4 years ago · 76 comments

We keep this issue open to collect feature requests from users and hear your voice. Our monthly release plan is also available here.

You can either:

Suggest a new feature by leaving a comment.
Vote for a feature request with 👍 or be against with 👎. (Remember that developers are busy and cannot respond to all feature requests, so vote for your most favorable one!)
Tell us that you would like to help implement one of the features in the list or review the PRs. (This is the greatest things to hear about!)

Answer 1 · 2020-07-15T12:07:48.000Z

i hope that MMPose can support 21 hand landmark detetion, thanks

Answer 2 · 2020-07-22T09:00:56.000Z

i hope that MMPose can support 21 hand landmark detetion, thanks

Good suggestions! We will add this feature in our TODO list. Thank you.

Answer 3 · 2020-07-23T06:15:07.000Z

TODO List (continuously updated... [last edit: 2023.1.14]) :
Here is a collection of feature requests.
Items that have already been implemented in MMPose will be removed from the list.

More popular backbones

ConvNeXt

Add more popular datasets:

More 2d human pose estimation method.

More 2d face alignment algorithms.
More 3d human pose algorithms.

Support 2d video pose estimation and tracking

Support Vehicle pose estimation

Add 3D Pose Consistency Benchmark #828
Mano based hand keypoints detection
Depth-based 3d hand pose estimation

A2J

Multi-view 3d pose estimation

Support memonger
Support Pytorch AMP training #339
Hyperparameter tuner Optuna
Support Unity plugin
print loss during evaluation. #333
Quantization Aware Training #359
Easier Usage (API)
Export to Torchscript #576

Answer 4 · 2020-07-26T05:51:51.000Z

Would you mind add #31 (comment) to the TODO list.

Answer 5 · 2020-07-26T12:56:43.000Z

Would you mind add #31 (comment) to the TODO list.

Sure.

Answer 6 · 2020-07-30T02:50:16.000Z

Speed up inference #40

Answer 7 · 2020-08-16T06:20:16.000Z

Support video pose estimation #67

Answer 8 · 2020-08-16T21:10:12.000Z

Would be great to add support for whole body pose estimation dataset (body+face+hands) via COCO-WholeBody

Answer 9 · 2020-08-26T08:46:31.000Z

Also add support for MPII in mmdetection.

Answer 10 · 2020-08-26T21:26:06.000Z

It would be great to add support for pose tracking dataset i.e. posetrack2017/2018.

Answer 11 · 2020-08-28T07:13:06.000Z

Support to convert pytorch model to onnx by the way.Thx！

Answer 12 · 2020-08-31T13:16:36.000Z

@OasisYang could you elaborate? is loading the data and processing on a frame basis is enough, or you want the tracking part also?

@flynnamy sounds like a request for a general tool. maybe we can provide such tools for the whole mm-series (just saying, not a confirmation).

Answer 13 · 2020-09-02T06:20:53.000Z

@innerlee If possible, adding both data loading and tracking part would be great. However, the tracking part seems a little bit complicate and always comes with some extra modules. Maybe, the first step is basically to support the data loading and processing. Thanks

@OasisYang could you elaborate? is loading the data and processing on a frame basis is enough, or you want the tracking part also?

@flynnamy sounds like a request for a general tool. maybe we can provide such tools for the whole mm-series (just saying, not a confirmation).

Answer 14 · 2020-09-02T10:03:14.000Z

Support ShuffleNet V2 & MobileNet V3 backbones. #94

Answer 15 · 2020-09-02T13:49:13.000Z

Support for InterHand2.6

Answer 16 · 2020-09-23T20:10:35.000Z

Add Yolov4 and OpenPose

Answer 17 · 2020-10-03T21:45:29.000Z

Please make it possible to obtain estimated heatmaps from methods

Answer 18 · 2020-10-05T02:48:06.000Z

@hamedcan could you explain more about the usage? do you want a visualizer of heatmaps during training, or a visualization tool for demo, or anything else?

Answer 19 · 2020-10-05T13:50:16.000Z

Bottom up for MPII dataset?

Answer 20 · 2020-10-05T14:44:10.000Z

@innerlee First, I really want to thank you for the MMPose. It really helped me. I want to compare different models' performance on hard poses. So I need to be able to observe generated heatmaps. I want a visualization tool for demo.

Answer 21 · 2020-10-23T08:46:18.000Z

support memonger : https://github.com/Lyken17/pytorch-memonger

Answer 22 · 2020-10-30T06:56:06.000Z

Support multi-head networks #219

Answer 23 · 2020-11-03T03:47:53.000Z

Please support mpii_trb demo and mpi_inf_3dhp datasets!

Answer 24 · 2020-11-27T08:39:33.000Z

Support 3d hand keypoint estimation!!!!!!

Answer 25 · 2020-12-04T08:57:52.000Z

Support log info when dataset is tinty, #333

Answer 26 · 2020-12-09T04:23:46.000Z

Support PyTorch AMP training, thanks. #339

Answer 27 · 2020-12-09T06:59:04.000Z

Support GCN-based methods for refining top-down results.
https://arxiv.org/pdf/2003.10506v3.pdf https://github.com/lingtengqiu/OPEC-Net
https://arxiv.org/abs/2007.10599

Answer 28 · 2020-12-10T17:50:19.000Z

Would be great to see the integration of a hyperparameter tuner like Optuna

Answer 29 · 2020-12-14T09:56:25.000Z

A Unity plugin would be amazing to have, using json input data and/or real-time pose estimation with a webcam and seeing it reflected on a 3D model.

Answer 30 · 2020-12-14T09:58:05.000Z

@MaxGodTier do you have experience in developing unity plugin? contributions are welcome :D

Answer 31 · 2020-12-15T04:31:16.000Z

I don't, but a dirty implementation may be possible using an existing repo , it reads pose data from simple text files each representing a single frame , I see two solutions: (1) If pose_results from mmpose were translated into the same format expected from that repo, it will work out of the box without needing to change a single line of code or (2) edit the repo code (C#) to use mmpose rules instead of theirs.

Answer 32 · 2020-12-18T02:58:08.000Z

Quantization Aware Training for models to get the int8 models ,int8 models will greatly improve inference speed #359,thanks

Answer 33 · 2020-12-28T08:20:55.000Z

Support DetTrack and KeyTrack.
http://arxiv.org/abs/2003.13743 & https://arxiv.org/abs/1912.02323

Answer 34 · 2020-12-30T19:14:37.000Z

Albumentations augmetnations similar to mmclassification

Answer 35 · 2021-01-20T07:32:11.000Z

i hope that MMPose can support 3D hand landmarks detetion, thanks

Answer 36 · 2021-01-22T08:23:51.000Z

Does MMPose support Single Person Pose Estimation?
Currently I found only multi-person versions are supported.

Answer 37 · 2021-01-22T09:07:48.000Z

@rhiver single is a case of multi

Answer 38 · 2021-01-22T09:52:54.000Z

@rhiver single is a case of multi

Sort of. But multi-person version has two stages, person detection and pose estimation, which have to infer on two models.
So this method doesn't work for realtime pose estimation in mobile devices since it takes too long on the inference.
MobileNetV2 is good enough for simple pose estimation. But for best FPS, it's better to let it do both single person detection and pose estimation.

Answer 39 · 2021-01-26T04:05:12.000Z

I see it has supported heatmap method of face datasets now. Please support regression method of face dataset!

Answer 40 · 2021-01-26T04:12:53.000Z

I see it has supported heatmap method of face datasets now. Please support regression method of face dataset!

Do you have any recommended papers/codes ?

Answer 41 · 2021-01-26T06:10:04.000Z

I see it has supported heatmap method of face datasets now. Please support regression method of face dataset!

Do you have any recommended papers/codes ?

Yes,wingloss:https://arxiv.org/pdf/1711.06753.pdf, and GCN+softwing loss: https://arxiv.org/pdf/2006.11697.pdf

Answer 42 · 2021-01-28T03:08:48.000Z

Support FashionAI https://tianchi.aliyun.com/competition/entrance/231648/introduction

Answer 43 · 2021-02-06T07:36:02.000Z

I want to use mmpose with the pypi package much more easily than now; such as:

from mmpose import top_down

top_down("darkpose", "COCO_wholebody", video_path="hoge.mp4", output_json_title="hoge") # Analyze hoge.mp4 with COCO wholebody on darkpose and output the result as hoge/hoge000000000000.json, hoge/hoge000000000001.json, hoge/hoge000000000002.json, ....

Answer 44 · 2021-02-10T13:35:08.000Z

MPII multi-person dataset for bottom-up methods is really needed!

Answer 45 · 2021-03-22T06:38:52.000Z

hi ,can you add handlandmark filtering algorithm for eliminating handlandmark jittering in videos? thanks

Answer 46 · 2021-04-15T16:03:43.000Z

Adding Vehicle pose estimation to the pipe line using CarFusion dataset. Similar to Occlusion-net, and Apollocar3D.

Answer 47 · 2021-04-16T01:18:16.000Z

Export to Torchscript !

Answer 48 · 2021-04-17T12:14:28.000Z

Lite-HRNet, its already built with mmpose, so including into the main repo should be super simple. Would be amazing if it could work with the pytorch2onnx tool for deployment

Answer 49 · 2021-04-21T18:34:13.000Z

Please support Halpe data set: https://github.com/Fang-Haoshu/Halpe-FullBody

It has 3 useful points in addition to the COCO-WholeBody.

Answer 50 · 2021-04-24T16:53:39.000Z

Hi everyone,

I intend to create my own keypoints dataset with 3 points of interest (two endpoints and one center point). Can anyone kindly help me on how I can create annotations to be loaded into mmpose? Because I believe that the repo is based on mmcv, how can I get my own dataloader?
Any help in this regard will be highly appreciated.
Thank you

Answer 51 · 2021-05-29T05:38:41.000Z

Support 3dpw dataset #682

Answer 52 · 2021-07-24T00:26:08.000Z

Do you have any plans for the mano based hand keypoints detection? Also optimization with the IK loss

Answer 53 · 2021-07-31T07:19:26.000Z

Add 3D Pose Consistency Benchmark - #828

Answer 54 · 2021-10-13T00:27:43.000Z

Add https://github.com/mks0601/3DMPPE_POSENET_RELEASE into MMPose

Answer 55 · 2021-11-06T11:13:27.000Z

It would be nice to add "PoseFormer". It based on VideoPose3D, which already supported.

Answer 56 · 2021-11-24T13:04:46.000Z

It would be nice to add "CenterNet". it is a bottom up based 2d human pose estimation method and it groups keypoints of one person by combine regression and heatmap of keypoints which is quite different from associated embedding and affinity fields

Answer 57 · 2021-12-18T06:09:57.000Z

Blog:Next-Generation Pose Detection with MoveNet and TensorFlow.js 这里有movenet的简单介绍，https://storage.googleapis.com/movenet/MoveNet.SinglePose%20Model%20Card.pdf

Answer 58 · 2021-12-24T09:01:41.000Z

Add RLE into MMPose

Answer 59 · 2022-01-20T09:35:05.000Z

Background : 3d pose estimation (with video generation) with a high number of people (es: official video, minute 00:19 sec, but with a lot of people

Result video: the original video is put on the top-left, with the subsuquent 3d pose of the people on the right. If there are a lot of people, the final video has strange resolution (i.e 6000x400) because every people detected is on put on the same row.

What could be improve: split the people 3d pose visualization into multiple row

Answer 60 · 2022-02-04T23:59:00.000Z

It would be great to have a 'score_per_joint' option in test_cfg in order to output one score per joint, instead of having only a global score for the pose, my use case is related to associative embedding

Answer 61 · 2022-02-17T02:09:41.000Z

update Interhand2.6M dataset which contains MANO hand mesh parameters.......

Answer 62 · 2022-04-11T11:24:15.000Z

It would be nice to have Depth-Based 3D Hand Pose Estimation methods like A2J.

Answer 63 · 2022-04-21T09:50:57.000Z

It would be great to have SmoothNet trained on 3DPW and AIST++ :)

Answer 64 · 2022-04-24T05:58:01.000Z

It would be nice to add SmoothNet training code about pose estimation, hoping it could easily retrain on my own dataset.

Answer 65 · 2022-05-16T07:54:34.000Z

3D Human Mesh
frankmocap

Answer 66 · 2022-05-16T12:33:18.000Z

3D Human Mesh frankmocap

Thanks for your feedback. 3D human mesh recovery is no longer supported in MMPose. We have MMHuman3D for this task and you are welcome to submit an issue there about your request.

Answer 67 · 2022-05-21T14:50:48.000Z

It would be so helpful for better analysis if AP for each type of body joints are printed, for example 17 AP value for 17 kinds of body joints are given when inferencing a model in MS COCO body-keypoint dataset.

Answer 68 · 2022-05-25T09:22:00.000Z

Will be really helpful to implement MIPNet into mmpose:

It is particularly useful to tackle data where there are crowded/highly occluded humans. Was previously the SOTA on OCHuman before ViTPose came along. Within the realms of convnets, it should still be the SOTA, and it seems like the idea is general enough to be applied to different types of backbones.

Answer 69 · 2022-05-25T09:24:19.000Z

Also similar to #1389 request, will be nice to integrate ViTPose into mmpose. ViTPose is already implemented in mmpose, so I expect integration to be much easier 😄

Answer 70 · 2022-05-25T15:02:54.000Z

It would be so helpful for better analysis if AP for each type of body joints are printed, for example 17 AP value for 17 kinds of body joints are given when inferencing a model in MS COCO body-keypoint dataset.

@yshMars
Already supported in #1170

Answer 71 · 2022-06-07T08:38:04.000Z

It would be nice to support ConvNeXt backbones. It is a very simple model that is purely convolutional. They can serve as a drop-in replacement for ResNet or Swin Transformer architectures. ImageNet-22k pretrained ConvNeXt variants are considered state-of-the-art in this regime.

Official code: https://github.com/facebookresearch/ConvNeXt
ConvNeXt was also implemented in the mmsegmentation and mmdetection libraries.

Thanks!

Answer 72 · 2022-09-09T02:40:40.000Z

It would be nice to have Poseaug augmentation pipeline for 3d pose estimation.
official code: [(https://github.com/jfzhang95/PoseAug)]

Thanks!

Answer 73 · 2022-11-04T16:26:27.000Z

Would like to have Tracing support for mmPose models.
I have been able to successfully use 'torch2torchscript' under mmdeploy.apis to trace mmSegmentation Models. However, using the same on mmPose (triedconfigs under dekr and associative_embedding) with the following mmdeploy config: '\mmdeploy\configs\mmpose\pose-detection_torchscript.py' would throw the following error:

File "mmdetection\mmpose\mmpose\datasets\pipelines\shared_transform.py", line 176, in call
meta[key_tgt] = results[key_src]

KeyError: 'flip_index'

Would really appreciate having this feature! Thanks

Answer 74 · 2022-12-13T08:18:28.000Z

Can SCRFD be added to mmpose?
https://github.com/deepinsight/insightface/tree/master/detection/scrfd

Answer 75 · 2022-12-13T09:20:15.000Z

Can SCRFD be added to mmpose? https://github.com/deepinsight/insightface/tree/master/detection/scrfd

It seems that SCRFD is for face detection. MMPose will focus on pose estimation/keypoint detection. Maybe it is more appropriate to support it in mmdet.

Answer 76 · 2023-02-06T04:40:51.000Z

Are you still woking on the openpose which has list on the ROADMAP for years?

Thanks