The official implementation of ECCV 2022 paper Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estiation.
All source files are listed below:
root
|- README.md
|- requirements.txt
|- get_bone_lengths.py
|- config.py
|- structural_triangulation.py
|- test.py
|- utils.py
|- virtual_test.py
`- configs
|- h36m_config.yaml
`- virtual_config.yaml
Functions of the source code are:
get_bone_lengths.py
: The implementation of a simple way to estimate bone lengths with given frame indices: Taking the average over all symmetric bones.config.py
: The interface to get configurationsstructural_triangulation.py
: The main implementation file of Structural Triangulation. Inside there is a tree structure class (DictTree
) and an estimation function (Pose3D_estimate
).test.py
: The test file for public datasets.utils.py
: Implementations of some basic functions.virtual_test.py
: The test file for virtual tests.- Files under
configs
dir are configurations files used in experiments. You may modify specific terms according to your purpose.
The solution is of closed form, so only some basic scientific calculation packages (numpy
, scipy
), and visualization packages (matplotlib
, tqdm
) are needed for the basic implementation. Note that Python 3.6+ is needed to make formatted strings work. But now the cuda version is implemented, so PyTorch is needed if you want it to work out-of-the-box. Besides the following command, you will need to follow the official guide if you want GPU support.
pip install -r requirements.txt
Pose3D_inference(...)
in structural_triangulation.py
is the key function which implements the main method in our work. This function takes one frame of 2D poses, along with camera matrices, bone lengths, etc., as input, and produces the optimal 3D pose of the current frame. Besides the closed-form Structural Triangulation combining with Step Constraint Method, implementation of Lagrangian Algorithm is also provided as a baseline. The methods are selected by a string parameter in Pose3D_inference(...)
.
- Note:
structural_triangulation_torch
is the parallelized implementation using PyTorch, and only support Structural Triangulation.
Actually, Structural Triangulation is as simple as just a triangulation method, these test files are more likely to be sample code than official ones, since it requires 2D estimations to be ready. You may test this method however you like, as long as proper variables are passed in the functions.
If you're willing to use our code to reproduce the results in the paper, here are the instructions:
For tests on Human3.6M Dataset, pre-processing and 2D backbone are provided by this model. We made some modifications to dump 2D estimations and ground truth labels to a pickle file, you may download it from here.
After that, make a directory named data
and place the file in it, so that your local directory looks like this:
root
|- data
| `- detected_data.pkl
...
Then, run get_bone_lengths.py
to get bone lengths.
python get_bone_lengths.py
You will see npy files generated under data/bone_lengths/h36m
dir:
root
|- data
| |- detected_data.pkl
| `- bone_lengths/h36m
| |- S9_bl_estimated.npy
... ...
Here, *_estimated.npy
files contain results estimated from linear triangulation result of T-poses; *_gt.npy
files contain that from ground truth.
With data ready, running test is very simple:
python test.py --cfg configs/h36m_config.yaml
If you want to test the cuda implementation with particular batch size, like 4, just use:
python test.py --cfg configs/h36m_config.yaml --cuda --batch-size 4
The result will be dumped to corresponding directory under log
once the test is finished. You can modify configurations in configs/h36m_config.yaml
.
Virtual test needs only the 3D ground truth as 2D estimations are generated. Just prepare data according to the previous section. Then run
python virtual_test.py --cfg configs/virtual_config.yaml
You will see results in csv
format under vir_result
folder. To specify camera numbers and 2D estimation errors, modify configs/virtual_config.yaml
.
Implement the functions to
-
dump experimental results to local storage.
-
specify parameters in command arguments;
-
process data in batches using GPU.
If you use our code, please cite with:
@inproceedings{Chen2022ECCV,
title={Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estiation},
author={Chen, Zhuo and Zhao, Xu and Wan, Xiaoyue},
booktitle = {European Conference on Computer Vision (ECCV)},
year={2022}
}