This is a unofficial implementation of PIFuhd
PIFuHD: Multi-Level Pixel-Aligned Implicit Function forHigh-Resolution 3D Human Digitization(CVPR2020)
- Training Coarse PIFuhd
- Training Fine PIFuhd
- Inference
- metrics(P2S, Normal, Chamfer)
- Gan generates front normal and back normal (Link)
- Unsigned distance field and signed distance filed
Note that the pipeline I design do not consider normal map generated by pix2pixHD because it is Not main difficulty we reimplement PIFuHD.
- PyTorch>=1.6
- json
- PIL
- skimage
- tqdm
- cv2
- trimesh with pyembree
- pyexr
- PyOpenGL
- freeglut (use
sudo apt-get install freeglut3-dev
for ubuntu users) - (optional) egl related packages for rendering with headless machines. (use
apt install libgl1-mesa-dri libegl1-mesa libgbm1
for ubuntu users) - face3d
We use Render People as our datasets but the data size is 296 (270 for training while 26 for testing) which is less than paper said 500.
Note that we are unable to release the full training data due to the restriction of commertial scans.
I modified part codes in PIFu (branch: PIFu-modify, and download it into your project) in order to could process dirs where your model save
bash ./scripts/process_obj.sh [--dir_models_path]
#e.g. bash ./scripts/process_obj.sh ../Garment/render_people_train/
I modified part codes in PIFu in order to could process dirs where your model save
python -m apps.render_data -i [--dir_models_path] -o [--save_processed_models_path] -s 1024 [Optional: -e]
#-e means use GPU rendering
#e.g.python -m apps.render_data -i ../Garment/render_people_train/ -o ../Garment/render_gen_1024_train/ -s 1024 -e
Rendering front and back normal map In Current Project
All config params is set in ./configs/PIFuhd_Render_People_HG_coarse.py, bash ./scripts/generate.sh
# the params you could modify from ./configs/PIFuhd_Render_People_HG_normal_map.py
# the import params here is
# e.g. input_dir = '../Garment/render_gen_1024_train/' and cache= "../Garment/cache/render_gen_1024/rp_train/"
# inpud_dir means output render_gen_1024_train
# cache means where save intermediate results like sample points from mesh
After processing all datasets, Tree-Structured Directory looks like following:
render_gen_1024_train/
├── rp_aaron_posed_004_BLD
│ ├── GEO
│ ├── MASK
│ ├── PARAM
│ ├── RENDER
│ ├── RENDER_NORMAL
│ ├── UV_MASK
│ ├── UV_NORMAL
│ ├── UV_POS
│ ├── UV_RENDER
│ └── val.txt
├── rp_aaron_posed_005_BLD
....
All config params is set in ./configs/PIFuhd_Render_People_HG_coarse.py, Where you could modify all you want.
Note that this project I designed is friend, which means you could easily replace origin backbone, head by yours :)
bash ./scripts/train_pfhd_coarse.sh
the same as coarse PIFuhd, all config params is set in ./configs/PIFuhd_Render_People_HG_fine.py,
bash ./scripts/train_pfhd_fine.sh
**If you meet memory problems about GPUs, pls reduce batch_size in ./config/*.py **
bash ./scripts/test_pfhd_coarse.sh
#or
bash ./scripts/test_pfhd_fine.sh
the results will be saved into checkpoints/PIFuhd_Render_People_HG_[coarse/fine]/gallery/test/model_name/*.obj, then you could use meshlab to view the generate models.
export MESA_GL_VERSION_OVERRIDE=3.3
# eval coarse-pifuhd
python ./tools/eval_pifu.py --config ./configs/PIFuhd_Render_People_HG_coarse.py
# eval fine-pifuhd
python ./tools/eval_pifu.py --config ./configs/PIFuhd_Render_People_HG_fine.py
We provide the pretrained models of PIFuhd(fine-pifuhd, coarse-pifuhd)
Note that the training models use front or back normal map rendered from mesh instead of being obtained by GANs. Therefore you need render the normal map of test obj
we provide rendering code using free models in RenderPeople. This tutorial uses rp_dennis_posed_004
model. Please download the model from this link and unzip the content. Use following command to reconstruct the model:
I provide bool params(debug in all of config files) to you to check whether your points sampled from mesh is right. There are examples:
As following show, left is input image, mid is the results of coarse-pifuhd, right is fine-pifuhd
Note that our training datasets are less than official one(270 for our while 450 for paper) resulting in the performance changes in some degree
IoU | ACC | recall | P2S | Normal | Chamfer | |
---|---|---|---|---|---|---|
PIFu | 0.748 | 0.880 | 0.856 | 1.801 | 0.1446 | 2.00 |
Coarse-PIFuhd(+Front and back normal) | 0.865(5cm) | 0.931(5cm) | 0.923(5cm) | 1.242 | 0.1205 | 1.4015 |
Fine-PIFuhd(+Front and back normal) | 0.813(3cm) | 0.896(3cm) | 0.904(3cm) | - | 0.1138 | - |
There is an issue why p2s of fine-pifuhd is bit large than coarse-pifuhd. This is because I do not add some post-processing to clean some chaos in reconstruction. However, the details of human mesh produced by fine-pifuhd are obviously better than coarse-pifuhd.
I hope that this project could provide some contributions to our communities, especially for implicit-field.
By the way, If you think the project is helpful to you, pls don’t forget to star this project : )
Monocular Real-Time Volumetric Performance Capture (ECCV 2020) Ruilong Li*, Yuliang Xiu*, Shunsuke Saito, Zeng Huang, Kyle Olszewski, Hao Li
PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization (CVPR 2020) Shunsuke Saito, Tomas Simon, Jason Saragih, Hanbyul Joo
ARCH: Animatable Reconstruction of Clothed Humans (CVPR 2020) Zeng Huang, Yuanlu Xu, Christoph Lassner, Hao Li, Tony Tung
Robust 3D Self-portraits in Seconds (CVPR 2020) Zhe Li, Tao Yu, Chuanyu Pan, Zerong Zheng, Yebin Liu
Learning to Infer Implicit Surfaces without 3d Supervision (NeurIPS 2019) Shichen Liu, Shunsuke Saito, Weikai Chen, Hao Li