This repository contains the official implementation associated with the paper: HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid.
[arXiv]
If you find our work useful, please cite:
@misc{xu2024humanvla,
title={HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid},
author={Xinyu Xu and Yizheng Zhang and Yong-Lu Li and Lei Han and Cewu Lu},
year={2024},
eprint={2406.19972},
archivePrefix={arXiv},
primaryClass={cs.RO},
url={https://arxiv.org/abs/2406.19972},
}