/LaserHuman

Official implement for LaserHuman.

Primary LanguagePython

LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment.

Language-guided scene-aware human motion generation has great significance for entertainment and robotics. In response to the limitations of existing datasets, we introduce LaserHuman, which stands out with its inclusion of genuine human motions within 3D environments, unbounded free-form natural language descriptions, a blend of indoor and outdoor scenarios, and dynamic, ever-changing scenes. Diverse modalities of capture data and rich annotations present great opportunities for the research of Scene-Text-to-Motion generation, and can also facilitate the development of real-life applications. Moreover, to generate semantically consistent and physically plausible human motions, we propose a multi-conditional diffusion model, which is simple but effective, achieving state-of-the-art performance on existing datasets.

πŸ’» Train your own models

  1. Prepare the datasets

We release sample of our dataset on Google Drive.

The annotation is saved in pub_datas.pkl for each motion sequence.

Please prepare the dataset as following folder struction:

./LaserHuman_path
  └── data/
      β”œβ”€β”€process    # the dense point cloud 
          β”œβ”€β”€ spst.ply
          β”œβ”€β”€ sistout.ply
          └── ...
      β”œβ”€β”€ pub_data.pkl      
      β”œβ”€β”€ pc_img_folder
          β”œβ”€β”€ 1
              β”œβ”€β”€ pc # store the LiDAR point cloud
          β”œβ”€β”€ 2
              β”œβ”€β”€ pc 
          ...
  └── render_code/
  • data rendering

You can render the motion sequence on the scene:

python render.py --id 10 --pkl_path 'data/pub_datas.pkl' --path_root 'data/'

A window will first appear to let you adjust the camera view, adjust and close it to display the motion result.

  • data instruction

We provide data_tutorial.ipynb to give the detailed instruction of our data.

License

All datasets are published under the Creative Commons Attribution-NonCommercial-ShareAlike. This means that you must attribute the work in the manner specified by the authors, you may not use this work for commercial purposes and if you alter, transform, or build upon this work, you may distribute the resulting work only under the same license.

Citation

@misc{cong2024laserhuman,
     title={LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment}, 
     author={Peishan Cong and Ziyi Wang and Zhiyang Dou and Yiming Ren and Wei Yin and Kai Cheng and Yujing Sun and Xiaoxiao Long and Xinge Zhu and Yuexin Ma},
     year={2024},
     eprint={2403.13307},
     archivePrefix={arXiv},
     primaryClass={cs.CV}
}