Haokai Pang† · Heming Zhu† · Adam Kortylewski · Christian Theobalt · Marc Habermann‡
†Joint first authors.
‡Corresponding author.
2024-6-14 The Training Code, and the Data Processing Code is available! 🎆🎆🎆
2024-3-29 The initial release, i.e., the Demo Code is available. The Training Code is on the way. For more details, pleaase check out the project page😃.
git clone git@github.com:kv2000/ASH.git --recursive
cd ./submodules/diff-gaussian-rasterization/
git submodule update --init --recursive
The code is tested on Python 3.9
, pytorch 1.12.1
, and cuda 11.3
.
Firstly, install the underlying clothed human body model, 🎆DeepCharacters Pytorch🎆, which also consists the dependencies that needed for this repo.
Then, setup the submodules for 3D Gaussian Splatting.
# the env with DeepCharacters Pytorch
conda activate mpiiddc
# 3DGS go
cd ./submodules/diff-gaussian-rasterization/
python setup.py install
cd ../simple-knn/
python setup.py install
You may find the metadata and the checkpoints from this link.
The extracted metadata and checkpoints follows folder structure below
# for the checkpoints
checkpoints
|--- Subject0001
|---deformable_character_checkpoint.pth # character checkpoints
|---gaussian_checkpoints.tar # gaussian checkpoints
# for the meta data
meta_data
|--- Subject0001
|---skeletoolToGTPose # training poses
| |--- ...
|
|---skeletoolToGTPoseTest # Testing poses
| |--- ...
|
|---skeletoolToGTPoseRetarget # Retartget another subject's pose
| |--- ...
|
|--- ... # Others
Run the following and the results will be stored in ./dump_results/
by default.
bash run_inference.sh
- Download the compressed raw data from from this link in to
./raw_data/
. - Decompress the data with
tar -xzvf Subject0022.tar.gz
- Run the (slurm) bash script
./process_video/bash_get_image.sh
that extracts the masked images from the raw RGB videoes and the foreground mask videoes . The provided script supports parallel the image extraction with slurm job arrays.
Run the following and the results will be stored in ./dump_results/
by default.
bash run_train.sh
The folder structure for the training is as follows:
# for the meta data
dump_results
|--- Subject0022
|---cached_files # The precomputed character related
| |--- cached_fin_rotation_quad.pkl
| |--- cached_fin_translation_quad.pkl
| |--- cached_joints.pkl
| |--- cached_ret_canonical_delta.pkl
| |--- cached_ret_posed_delta.pkl
| |--- cached_temp_vert_normal.pkl
|
|---checkpoints
| |--- ...
|
|---exp_stats # Tensorboard Logs
| |--- ...
|
|---validations_fine # Validationed images every X Frames
Note that at the first time that the training script runs, it will pre-compute and store the character related data, stored in ./dump_results/[Subject Name]/cached_files/
. Which will greatly speed up and reduce the gpu usage of the training process.
Plese check out this issue on some hints on training on your own data, discussion is welcomed :).
- Data processing for Training
- Training Code
If you find our work useful for your research, please, please, please consider citing our paper!
@InProceedings{Pang_2024_CVPR,
author = {Pang, Haokai and Zhu, Heming and Kortylewski, Adam and Theobalt, Christian and Habermann, Marc},
title = {ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2024},
pages = {1165-1175}
}
For questions, clarifications, feel free to get in touch with:
Heming Zhu: hezhu@mpi-inf.mpg.de
Marc Habermann: mhaberma@mpi-inf.mpg.de
Deep Characters Pyotrch is under CC-BY-NC license. The license applies to the pre-trained models and the metadata as well.
Christian Theobalt was supported by ERC Consolidator Grant 4DReply (No.770784). Adam Kortylewski was supported by the German Science Foundation (No.468670075). This project was also supported by the Saarbrucken Research Center for Visual Computing, Interaction, and AI. We would also like to thank Andrea Boscolo Camiletto and Muhammad Hamza Mughal for the efforts/discussion on motion retargeting.
Below are some resources that we benefit from (keep updating):
- 3D gaussian-splatting, bravo for the brilliant representation for real-time and high-quality rendering.
- DeepCharacters Pytorch for the human character model.
- Pytorch3D and Kornia for the handy geometry library.
- NeuS for the project structure D:.