/Human-Pose-Estimation-from-RGB

Human Pose Estimation from RGB Camera - The repo

Primary LanguagePython

Human Pose Estimation from RGB Camera - The repo

In recent years, tremendous amount of progress is being made in the field of Human Pose Estimation from RGB Camera, which is an interdisciplinary field that fuses computer vision, deep/machine learning and anatomy. This repo is for my study notes and will be used as a place for triaging new research papers.

Get Involved

To make it a collaborative project, you may add content through pull requests or open an issue to let me know.

How this is sorted

The projects and papers are sorted by season, then by month, then by alphabet. I do this because newer papers are usually more sophisticated than older ones and to make removing duplicates easier. Below the publications are short, memorable bite sized summaries. These summaries should be understandable to someone who is four, and very short.

  • Time Dimension
    • πŸ“· Single-Shot
    • πŸŽ₯ Video/Real-Time
  • Spatial Dimensions
    • πŸšͺ 2D Models
    • πŸ“¦ 3D Models

Projects and papers

Fall 2018

πŸ“·πŸ“¦Deep 3D Human Pose Estimation Under Partial Body Presence (Oct 2018)

My legs have been chopped off, and my head, but tis but a scratch.

πŸ“·πŸ“¦3D Human Pose Estimation Using Stochastic Optimization In Real Time (Oct 2018)

Try again and again, till you get it right. Uses depth based sensors. RTW + Particle Swarms

πŸ“·πŸ“¦Adversarial 3D Human Pose Estimation via Multimodal Depth Supervision (Sep 2018)

Continuation of FBI work, also got multimodal network now. IDK What that means

πŸ“·πŸšͺDeeply Learned Compositional Models for Human Pose Estimation (Sep 2018)

Take the human body, and shove it into code blocks

πŸšͺDense Pose Transfer (Sep 2018)

color in a mannequin, a machine imagines the details, then animates it's paper statue

πŸ“·πŸ“¦3D Ego-Pose Estimation via Imitation Learning (Sep 2018)

Headcam, they use a very complicated ragdoll, also, just walking and running

πŸ“·πŸ“¦3D Human Pose Estimation with Siamese Equivariant Embedding (Sep 2018)

Twins compare their answers after doing their math homework. (Homework refers to the 3d pose estimation)

πŸ“·πŸ“¦Synthetic Occlusion Augmentation with Volumetric Heatmaps for the 2018 ECCV PoseTrack Challenge on 3D Human Pose Estimation (Sep 2018)

They block their face and body with cheap photoshop techniques, then the machine has to "x-ray" through all that.

Summer 2018

πŸ“·πŸ“¦Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation [CODE] (Aug 2018)

The circle of 3D pose estimation. 2d Image -> 2d Color Me Rad guy -> 3d pudgy man -> photograph of said man -> 2d Image

πŸ“·πŸ“¦ Single-Shot Multi-Person 3D Body Pose Estimation From Monocular RGB Input (Aug 2018)

They use a ORPM, whatever that means. And they have some very obviouly green screened images.

πŸ“·πŸ“¦ Rethinking Pose in 3D: Multi-stage Refinement and Recovery for Markerless Motion Capture (Aug 2018)

use lots of cameras to make just one camera better, and do this over and over and over again

πŸ“·πŸ“¦3D Human Pose Estimation with Relational Networks (Jul 2018)

Back bone connected to the shoulder bone, shoulder bone connected to the neck bone...

πŸšͺHuman Pose Estimation with Parsing Induced Learner (Jun 2018)

πŸ“·πŸ“¦FBI-Pose: Towards Bridging the Gap between 2D Images and 3D Human Poses using Forward-or-Backward Information (Jun 2018)

Anderson Silva's broken bent leg

Spring 2018

πŸ“·πŸ“¦DRPose3D: Depth Ranking in 3D Human Pose Estimation (May 2018)

These guys do FBI but without the "crowd sourced" annotations

πŸ“·πŸ“¦It's all Relative: Monocular 3D Human Pose Estimation from Weakly Supervised Data (May 2018)

Crowd sourced relative depth annotations

πŸ“¦BodyNet: Volumetric Inference of 3D Human Body Shapes [CODE] (Apr 2018)

Make a statue of michealangelo in minecraft

πŸ“¦Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation [CODE] (Apr 2018)

It rotates the person with it's eyes. (Insert reference to undressing with eyes here)

πŸŽ₯πŸ“¦ MonoPerfCap: Human Performance Capture from Monocular Video [Project] (Mar 2018)

makes a 3d replica of you like a fully featured action figure

πŸ“¦Learning to Estimate 3D Human Pose and Shape from a Single Color Image (May 2018)

SMPL brand Artist's Mannequin

πŸ“·πŸ“¦ 3D Human Pose Estimation in the Wild by Adversarial Learning (Mar 2018)

πŸŽ₯πŸ“¦ LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images [Project] (Mar 2018)

πŸ“·πŸ“¦ Unsupervised Adversarial Learning of 3D Human Pose from 2D Joint Locations [Project page] (Mar 2018)

πŸŽ₯πŸ“¦ End-to-end Recovery of Human Shape and Pose [CODE] (Dec 2017)

πŸ“·πŸ“¦Exploiting temporal information for 3D human pose estimation (Nov 2017)

πŸ“·πŸ“¦ DensePose: Dense Human Pose Estimation In The Wild [CODE] [Project page] (Feb 2018)

Fall 2017

πŸŽ₯πŸšͺ Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [CODE] (Apr 2017)

πŸ“·πŸšͺ Adversarial PoseNet: A Structure-aware Convolutional Network for Human Pose Estimation (May 2017)

πŸ“·πŸ“¦ A simple yet effective baseline for 3d human pose estimation (Aug 2017) [CODE]

πŸŽ₯πŸ“¦ VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera [CODE] [Project] (Jul 2017)

πŸ“·πŸ“¦ Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image (Oct 2017)

πŸ“·πŸ“¦Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation [CODE] (Aug 2017)

2016

πŸ“·πŸ“¦ Learning to Fuse 2D and 3D Image Cues for Monocular Body Pose Estimation (Nov 2016)

πŸ“·πŸ“¦ Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision [Project] (Nov 2016)

πŸ“·πŸ“¦ MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild (Oct 2016)

πŸ“·πŸ“¦3D Human Pose Estimation Using Convolutional Neural Networks with 2D Pose Information (Sep 2016)

πŸ“·πŸ“¦ Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image (Jul 2016)

πŸ“·πŸšͺ Stacked Hourglass Networks for Human Pose Estimation [CODE] (Mar 2016)

πŸ“·πŸšͺ Convolutional Pose Machines [CODE] (Jan 2016)

2014 & 2015

πŸŽ₯πŸ“¦ Spatio-temporal Matching for Human Pose Estimation [Project] (Dec 2015)

πŸŽ₯πŸ“¦ Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video [Project] (Nov 2015)

DataSets

MS COCO

MPII POSE

Human 3.6M

Human Eva

MPI INF 3DHP

Unite The People

Pose Guided Person Image Generation

A Generative Model of People in Clothing

Deformable GANs for Pose Based Human Image Generatoin

Dense Pose Transfer

Human3.6M

HumanEva

MPI-INF-3DHP

Unite The People

Pose Guided Person Image Generation - [CODE] - Ma, L., Jia, X., Sun, Q., Schiele, B., Tuytelaars, T., & Gool, L.V. (NIPS 2017)

A Generative Model of People in Clothing - Lassner, C., Pons-Moll, G., & Gehler, P.V. (ICCV 2017)

Deformable GANs for Pose-based Human Image Generation - [CODE] - Siarohin, A., Sangineto, E., Lathuilière, S., & Sebe, N. (CVPR 2018)

Dense Pose Transfer - Neverova, N., Guler, R.A., & Kokkinos, I. (ECCV 2018)

Guide

Gesture and Sign Language Recognition with Deep Learning

Human Pose Estimation 101

Bob

Jessie

Awesome

HoreFice

My personal goals:

  • I'd like to find a project I can clone.

  • I'd like to find a recent project.

  • I'd like to find a project with the 3d work done

  • I'd like to find a project that can integrate with SteamVR. (bone locations instead of blobs/meshes)

  1. state of the art 2d pose detector, this is crucial.

  2. This 2d pose detector can return colored limbs corresponding to each body part or heat maps corresponding to joint, or even forward/backwards information from annoation.

  3. Time dependence ideally the network, when doing pixel to 2d pose and 2d pose to 3d map and 3d map to skeleton should take into account the previous frame, and with an internal representation of the boundary conditions of the human body pose (Perhaps a GAN?)

  4. Additionally, there are other gimmicks that can be used, like Siamese network parallelism. Where two shots of the same pose are rewarded for giving the same output. Physics simulations of body mechanics can be used. Reprojection of 3d joints back to 2d geometery using meshes. The 3d pose can be iteratively refined over and over again. Sythetic data created by game data. Additionally there should be 3d pose standarization. Additionally 2d pose should be done well.

Domain Transfer for 3D Pose Estimation from Color Images without Manual Annotations (Oct 2018)

Not relevant, but hand posing is here, so I guess it kinda is

Context-Aware Deep Spatio-Temporal Network for Hand Pose Estimation from Depth Images (Oct 2018)

Not relvant, hand pose estimation

πŸ“·πŸ“¦Cascaded Pyramid Network for 3D Human Pose Estimation Challenge (Oct 2018)

Top down, seems pretty typical, nothing special goin on here

πŸ“·πŸ“¦ Deep Textured 3D Reconstruction of Human Bodies [Project] (Sep 2018) Not relevant. I'll make a replica out of you from soggy clay. Depth training, regular camera tests

**:camera:

Multiview 3D human pose estimation using improved least-squares and LSTM networks (Jul 2018) LSTM

Hierarchical Contextual Refinement Networks for Human Pose Estimation (Oct 2018)

You start from your center and work outwards

Fully Automatic Multi-person Human Motion Capture for VR Applications (Sep 2018)

$$$ Have a party, turn all your friends into skeletons then they leave

Propagating LSTM: 3D Pose Estimation based on Joint Interdependency (Sep 2018)

[Hockey Pose Estimation and Action Recognition using Convolutional Neural Networks to Ice Hockey][https://uwspace.uwaterloo.ca/handle/10012/13835] (Sep 2018)

Human pose estimation method based on single depth image (Sep 2018)

Learning Robust Features and Latent Representations for Single View 3D Pose Estimation of Humans and Objects (Sep 2018)

A Review of Human Pose Estimation from Single Image (Jul 2018)

3D Human pose estimation on Taiji sequence (Jul 2018)

MoCap and a new biomedical dataset!

Human Pose Estimation Based on Deep Neural Network (Jul 2018)

Multi-View CNNs for 3D Hand Pose Estimation (Jul 2018)

Lots a people lookin at your hand

Multiview 3D human pose estimation using improved least-squares and LSTM networks (Jul 2018) LSTM

3-D Reconstruction of Human Body Shape from a Single Commodity Depth Camera (Jun 2018)

Human Pose As Calibration Pattern; 3D Human Pose Estimation With Multiple Unsynchronized and Uncalibrated Cameras (Jun 2018)

Stacked dense-hourglass networks for human pose estimation (Apr 2018)

πŸšͺSimple Baselines for Human Pose Estimation and Tracking [CODE] (Apr 2018)

A generalizable approach for multi-view 3D human pose regression (Apr 2018)

A Deep Learning Based Method For 3D Human Pose Estimation From 2D Fisheye Images (Mar 2018)

A Unified Framework for Multi-View Multi-Class Object Pose Estimation (Mar 2018)

Learning Monocular 3D Human Pose Estimation from Multi-view Images (Mar 2018)

Multi-Scale Structure-Aware Network for Human Pose Estimation (Mar 2018)

Mo2Cap2: Real-time Mobile 3D Motion Capture with a Cap-mounted Fisheye Camera (Mar 2018)

Hierarchical graphical-based human pose estimation via local multi-resolution convolutional neural network (Feb 2018)

Image-based Synthesis for Deep 3D Human Pose Estimation (Feb 2018)

πŸšͺLSTM Pose Machines [CODE] (Dec 2017)

Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB (Dec 2017)

Using a single RGB frame for real time 3D hand pose estimation in the wild (Dec 2017)

πŸ“¦Learning 3D Human Pose from Structure and Motion (Nov 2017)

πŸ“¦Integral Human Pose Regression [CODE] (Nov 2017)

Human Pose Retrieval for Image and Video collections (Oct 2017) A search engine for dancers

πŸšͺHuman Pose Estimation Using Global and Local Normalization (Sep 2017)

πŸšͺLearning Feature Pyramids for Human Pose Estimation [CODE] (Aug 2017)

πŸ“¦Recurrent 3D Pose Sequence Machines (Jul 2017)

πŸšͺSelf Adversarial Training for Human Pose Estimation [CODE1][CODE2] (Jul 2017)

Learning Human Pose Models from Synthesized Data for Robust RGB-D Action Recognition (Jul 2017)

Faster Than Real-time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses (Jul 2017)

[A Dual-Source Approach for 3D Human Pose Estimation from a Single Image] (https://arxiv.org/pdf/1705.02883.pdf) (May 2017)

πŸ“¦Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach [CODE] (Apr 2017)

Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation (Apr 2017)

Forecasting Human Dynamics from Static Images (Apr 2017)

πŸ“¦Compositional Human Pose Regression (Apr 2017)

2D-3D Pose Consistency-based Conditional Random Fields for 3D Human Pose Estimation (Apr 2017)

πŸšͺMulti-context Attention for Human Pose Estimation - [CODE] (Feb 2017)

πŸ“¦Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image (Jan 2017)

πŸšͺTowards Accurate Multi-person Pose Estimation in the Wild [CODE] (Jan 2017)

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image (Jan 2017)

Learning from Synthetic Humans (Jan 2017)

MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior (Jan 2017)

πŸšͺRMPE: Regional Multi-person Pose Estimation [CODE1][CODE2] (Dec 2016)

πŸ“¦Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose [CODE] (Nov 2016)

πŸšͺRealtime Multi-person 2D Pose Estimation Using Part Affinity Fields [CODE] (Nov 2016)

3D Human Pose Estimation from a Single Image via Distance Matrix Regression (Nov 2016)

Learning camera viewpoint using CNN to improve 3D body pose estimation (Sep 2016)

EgoCap: Egocentric Marker-less Motion Capture with Two Fisheye Cameras (Sep 2016)

πŸ“¦Structured Prediction of 3D Human Pose with Deep Neural Networks (May 2016)

πŸšͺDeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model [CODE1][CODE2] (May 2016)

πŸšͺRecurrent Human Pose Estimation [CODE] (May 2016)

Synthesizing Training Images for Boosting Human 3D Pose Estimation (Apr 2016)

Seeing Invisible Poses: Estimating 3D Body Pose from Egocentric Video - Completely insane and above the scope of science (Mar 2016)

πŸšͺDeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation [CODE] (Nov 2015)

A Dual-Source Approach for 3D Pose Estimation from a Single Image (Sep 2015)

πŸšͺHuman Pose Estimation with Iterative Error Feedback [CODE] (Jul 2015)

πŸšͺFlowing ConvNets for Human Pose Estimation in Videos [CODE] (Jun 2015)

πŸ“¦3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network (Nov 2014)

πŸšͺEfficient Object Localization Using Convolutional Networks (Nov 2014)

πŸšͺMoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation (Sep 2014)

πŸšͺJoint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation [CODE] (Jun 2014)

πŸšͺLearning Human Pose Estimation Features with Convolutional Networks (Dec 2013)

πŸšͺDeepPose: Human Pose Estimation via Deep Neural Networks (Dec 2013)

Deep 3D Pose Dictionary: 3D Human Pose Estimation from Single RGB Image Using Deep Convolutional Neural Network 3D Hand Pose Tracking from Depth Images using Deep Reinforcement Learning

Human 3D Reconstruction and Identification Using Kinect Sensor (Aug 2018) Low-fi body ID

3D Head Pose Estimation Using Tensor Decomposition and Non-linear Manifold Modeling (Sep 2018) `

` A Data-Driven Approach for 3D Human Body Pose Reconstruction from a Kinect Sensor Accidental Fall Detection Based on Pose Analysis and SVDD