HumanVLA

This repository contains the official implementation associated with the paper: HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid.

[arXiv]

Citation

If you find our work useful, please cite:

@misc{xu2024humanvla,
      title={HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid}, 
      author={Xinyu Xu and Yizheng Zhang and Yong-Lu Li and Lei Han and Cewu Lu},
      year={2024},
      eprint={2406.19972},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2406.19972}, 
}