Programmable Motion Generation for Open-Set Motion Control Tasks (CVPR24)

This is the code for the paper "Programmable Motion Generation for Open-Set Motion Control Tasks" (CVPR24).

Getting started

1. Setup environment

Follow the instruction of PriorMDM (or MDM) to create a conda environment and install necessary packages in order to use the MDM model.

2. Download necessary data files

The data files required for running experiments are the same as MDM. The required files include:

pretrained MDM model: model000475000.pt and args.json. Available at HumanML3D humanml-encoder-512 (best model) provided by MDM. Place the folder as progmogen/save/humanml_trans_enc_512.
glove: Download files from progmogen/prepare/download_glove.sh and place the directory as progmogen/glove.

cd progmogen
bash prepare/download_glove.sh

dataset: Download t2m from progmogen/prepare/download_t2m_evaluators.sh and place it as progmogen/t2m.

bash prepare/download_t2m_evaluators.sh

body_models: Download files from progmogen/prepare/download_smpl_files.sh (a folder named smpl) and place it under progmogen/body_models.

bash prepare/download_smpl_files.sh

You can also refer to paths in progmogen/config_data.py to check whether files are placed correctly.

3. Get HumanML3D data

Place HumanML3D folder under progmogen/dataset as progmogen/dataset/HumanML3D.

a. The easy way (for demo)

HumanML3D - If you wish to run the demo only, motion data is not needed and just prepare Mean.npy and Std.npy.

cp -r my_data/HumanML3D dataset/HumanML3D

b. Full data (text + motion capture for evaluation)

HumanML3D - Follow the instructions in HumanML3D, prepare motion capture data, and then copy the result dataset to our repository:

cd ..
git clone https://github.com/EricGuo5513/HumanML3D.git
unzip ./HumanML3D/HumanML3D/texts.zip -d ./HumanML3D/HumanML3D/
cp -r HumanML3D/HumanML3D progmogen/dataset/HumanML3D
cd progmogen

4. Our data for experiments (for evaluation only)

Copy test_all_id.txt and test_plane_v0_id.txt in progmogen/my_data to progmogen/dataset/HumanML3D. They are subsets of the test split used in our evaluation.

cd dataset/HumanML3D
ln -s ../../my_data/test_all_id.txt test_all_id.txt
ln -s ../../my_data/test_plane_v0_id.txt test_plane_v0_id.txt
cd ../..

Other data files with texts and constraints for quantitative experiments are provided in progmogen/my_data.

After these steps, the data will be organized as following

progmogen
|-- save/humanml_trans_enc_512
         |--model000475000.pt
         |--args.json
|-- glove
|-- body_models
     |-- smpl
|-- t2m
|-- dataset
     |-- humanml_opt.txt
     |-- t2m_mean.npy
     |-- t2m_std.npy
     |-- HumanML3D
          |-- Mean.npy
          |-- Std.npy
          |-- test_all_id.txt
          |-- test_plane_v0_id.txt
          |-- new_joint_vecs

TEMOS-master
assets

5. Install blender for visualization (optional)

We use blender code from project TEMOS for visualization. Follow the instruction to install blender and bpy dependencies. In progmogen/script_demo/*.sh scripts, replace blender_app path with your own path to blender application, and replace project_dir with your own absolute path to this github project.

Project structure

progmogen
  |--diffusion          # ddim
  |--atomic_lib         # atomic constraint library
  |--tasks              # main program
  |--eval               # quantitative evaluation
  |--script_demo        # run demo in Figures
  |--task_configs       # define error function and optimization parameters for demo.
  |--script_eval        # run evaluation in Tables
  |--task_configs_eval  # define error function and optimization parameters for evaluation.
  |--script_eval_others # run evaluation for other baseline methods.
  |--task_configs_eval_others  # define error function and optimization parameters for evaluation.
  |--my_data
  |--config_data.py
  ...
TEMOS-master
  |--render_demo_*.py   # main rendering program for each task
  |--temos
       |--render
            |--blender
                 |--render_*.py  # draw scenes

Demo

We provide scripts in progmogen/script_demo for runnning examples presented in the paper. The script will generate motion gen.npy and fit smpl body sequences gen_smpl/gen*_smpl_params.npy.

For visualization, we provide (1) stick figure animation gen*_video.gif using matplotlib, and (2) image and video rendering gen_smpl/gen*_smpl_params.png/mp4 using blender (scenes and objects are drawn using blender only).

Results will be saved to save_fig_dir (under progmogen/results/demo) and you can change in the script on your own.

cd progmogen

Motion Control with High-order Dynamics

sh script_demo/run_demo_hod1.sh

Motion Control with Geometric Constraints

sh script_demo/run_demo_geo1_relax.sh
sh script_demo/run_demo_geo2_relax.sh

Human-Scene Interaction

sh script_demo/run_demo_hsi3.sh
sh script_demo/run_demo_hsi4.sh
sh script_demo/run_demo_hsi5.sh

Human-Object Interaction

sh script_demo/run_demo_hoi1.sh
sh script_demo/run_demo_hoi2.sh

Human Self-Contact

sh script_demo/run_demo_hsc1.sh

Physics-based Generation

sh script_demo/run_demo_pbg1.sh
sh script_demo/run_demo_pbg2.sh

Motion programming by GPT

sh script_demo/run_demo_hsi3_gpt.sh
sh script_demo/run_demo_hsi4_gpt.sh

Other examples

sh script_demo/run_demo_dir.sh
sh script_demo/run_demo_or.sh

For any other customized tasks, just write a task_config.py with customized f_loss and f_eval, assign appropriate optimization parameters and feed to --task_config ${task_config}.

We also implemented a simple framework for constraint relaxation. Refer to run_demo_geo1_relax.sh and run_demo_geo2_relax.sh for more details.

Evaluation

We provide scripts in progmogen/script_eval for quantitative evaluation on some pre-defined tasks presented in the paper. The script will generate gen.npy, and calculate metrics using evaluation code in progmogen/eval.

Results will be saved to save_fig_dir (under progmogen/results/eval) and you can change in the script on your own.

cd progmogen
sh script_eval/eval_task_hsi1.sh
sh script_eval/eval_task_hsi3.sh
sh script_eval/eval_task_geo1_relax.sh
sh script_eval/eval_task_hoi1_relax.sh
sh script_eval/eval_task_hsi2.sh

(Since the optimization for each sample takes several minutes, we run the generation for each sample only once to reduce test time when calcuating evaluation metrics. A set of text prompts and corresponding constraints are pre-defined and provided in progmogen/my_data. Also, as the FID score is sensitive to the groundtruth samples selected for calculating statistics, we also provide code to calculate average FID by sampling groundtruth motions multiple times.)

sh script_eval/eval_task_hsi1_fid_nruns.sh <gen_npy_file>

Scripts for some other baseline methods are also provided in progmogen/script_eval_others.

Results of LLM programming evaluated in the supplementary material are provided in assets/GPT_programming.

Acknowledgements

Our code is heavily built on: PriorMDM, MDM, TEMOS, GMD and HumanML3D. We thank them for kindly releasing their code.

Bibtex

If you find this code useful in your research, please consider citing:

@inproceedings{liu2024programmable,
title={Programmable Motion Generation for Open-Set Motion Control Tasks},
author={Liu, Hanchao and Zhan, Xiaohang and Huang, Shaoli and Mu, Tai-Jiang and Shan, Ying},
booktitle={CVPR},
year={2024}
}

License

This code is distributed under an MIT LICENSE.

Note that our code depends on other libraries, including CLIP, SMPL, SMPL-X, PyTorch3D, and uses datasets that each have their own respective licenses that must also be followed.