Deform2NeRF: Non-Rigid Deformation and 2D–3D Feature Fusion with Cross-Attention for Dynamic Human Reconstruction
Project Page | Paper | Data
Reconstructing dynamic human body models from multi-view videos poses a substantial challenge in the field of 3D computer vision. Currently, the Animatable NeRF method addresses this challenge by mapping observed points from the viewing space to a canonical space. However, this mapping introduces positional shifts in predicted points, resulting in artifacts, particularly in intricate areas. In this paper, we propose an innovative approach called Deform2NeRF that incorporates non-rigid deformation correction and image feature fusion modules into the Animatable NeRF framework to enhance the reconstruction of animatable human models. Firstly, we introduce a non-rigid deformation field network to address the issue of point position shift effectively. This network adeptly corrects positional discrepancies caused by non-rigid deformations. Secondly, we introduce a 2D–3D feature fusion learning module with cross-attention and integrate it with the NeRF network to mitigate artifacts in specific detailed regions. Our experimental results demonstrate that our method significantly improves the PSNR index by approximately 5% compared to representative methods in the field. This remarkable advancement underscores the profound importance of our approach in the domains of new view synthesis and digital human reconstruction.
Our work is based on the improvement of AnimatableNeRF(https://arxiv.org/abs/2105.02872,ICCV). Any questions or discussions are welcomed!
Please see INSTALL.md for manual installation.
Since the license of Human3.6M dataset does not allow us to distribute its data, we cannot release the processed Human3.6M dataset publicly. If someone is interested at the processed data, please email me.
We provide the pretrained models at here.
The command lines for test are recorded in test.sh.
Take the test on S9
as an example.
-
Download the corresponding pretrained models, and put it to
$ROOT/data/trained_model/deform/aninerf_s9p/latest.pth
and$ROOT/data/trained_model/deform/aninerf_s9p_full/latest.pth
. -
Test on training human poses:
python run.py --type evaluate --cfg_file configs/aninerf_s9p.yaml exp_name aninerf_s9p resume True
-
Test on unseen human poses:
python run.py --type evaluate --cfg_file configs/aninerf_s9p.yaml exp_name aninerf_s9p_full resume True aninerf_animation True init_aninerf aninerf_s9p test_novel_pose True
Take the visualization on S9
as an example.
-
Download the corresponding pretrained models, and put it to
$ROOT/data/trained_model/deform/aninerf_s9p/latest.pth
and$ROOT/data/trained_model/deform/aninerf_s9p_full/latest.pth
. -
Visualization:
- Visualize novel views of the 0-th frame
python run.py --type visualize --cfg_file configs/aninerf_s9p.yaml exp_name aninerf_s9p resume True vis_novel_view True begin_ith_frame 0
- Visualize views of dynamic humans with 3-th camera
python run.py --type visualize --cfg_file configs/aninerf_s9p.yaml exp_name aninerf_s9p resume True vis_pose_sequence True test_view "3,"
- Visualize mesh
# generate meshes python run.py --type visualize --cfg_file configs/aninerf_s9p.yaml exp_name aninerf_s9p vis_posed_mesh True
-
The results of visualization are located at
$ROOT/data/novel_view/aninerf_s9p
and$ROOT/data/novel_pose/aninerf_s9p
.
Take the training on S9
as an example. The command lines for training are recorded in train.sh.
-
Train:
# training python train_net.py --cfg_file configs/aninerf_s9p.yaml exp_name aninerf_s9p resume False # training the blend weight fields of unseen human poses python train_net.py --cfg_file configs/aninerf_s9p.yaml exp_name aninerf_s9p_full resume False aninerf_animation True init_aninerf aninerf_s9p
-
Tensorboard:
tensorboard --logdir data/record/deform
If someone wants to download the ZJU-Mocap dataset, please fill in the agreement, and email me (pengsida@zju.edu.cn) and cc Xiaowei Zhou (xwzhou@zju.edu.cn) to request the download link.
The command lines for test are recorded in test.sh.
Take the test on 313
as an example.
-
Download the corresponding pretrained models, and put it to
$ROOT/data/trained_model/deform/aninerf_313/latest.pth
and$ROOT/data/trained_model/deform/aninerf_313_full/latest.pth
. -
Test on training human poses:
python run.py --type evaluate --cfg_file configs/aninerf_313.yaml exp_name aninerf_313 resume True
-
Test on unseen human poses:
python run.py --type evaluate --cfg_file configs/aninerf_313.yaml exp_name aninerf_313_full resume True aninerf_animation True init_aninerf aninerf_313 test_novel_pose True
Take the visualization on 313
as an example.
-
Download the corresponding pretrained models, and put it to
$ROOT/data/trained_model/deform/aninerf_313/latest.pth
and$ROOT/data/trained_model/deform/aninerf_313_full/latest.pth
. -
Visualization:
- Visualize novel views of the 0-th frame
python run.py --type visualize --cfg_file configs/aninerf_313.yaml exp_name aninerf_313 resume True vis_novel_view True begin_ith_frame 0
- Visualize views of dynamic humans with 0-th camera
python run.py --type visualize --cfg_file configs/aninerf_313.yaml exp_name aninerf_313 resume True vis_pose_sequence True test_view "0,"
- Visualize mesh
# generate meshes python run.py --type visualize --cfg_file configs/aninerf_313.yaml exp_name aninerf_313 vis_posed_mesh True
-
The results of visualization are located at
$ROOT/data/novel_view/aninerf_313
and$ROOT/data/novel_pose/aninerf_313
.
Take the training on 313
as an example. The command lines for training are recorded in train.sh.
-
Train:
# training python train_net.py --cfg_file configs/aninerf_313.yaml exp_name aninerf_313 resume False # training the blend weight fields of unseen human poses python train_net.py --cfg_file configs/aninerf_313.yaml exp_name aninerf_313_full resume False aninerf_animation True init_aninerf aninerf_313
-
Tensorboard:
tensorboard --logdir data/record/deform
If you find this code useful for your research, please use the following BibTeX entry.
@article{xie2023deform2nerf,
title={Deform2NeRF: Non-Rigid Deformation and 2D--3D Feature Fusion with Cross-Attention for Dynamic Human Reconstruction},
author={Xie, Xiaolong and Guo, Xusheng and Li, Wei and Liu, Jie and Xu, Jianfeng},
journal={Electronics},
volume={12},
number={21},
pages={4382},
year={2023},
publisher={MDPI}
}
@inproceedings{peng2021animatable,
title={Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies},
author={Peng, Sida and Dong, Junting and Wang, Qianqian and Zhang, Shangzhan and Shuai, Qing and Zhou, Xiaowei and Bao, Hujun},
booktitle={ICCV},
year={2021}
}