heng-hw's Stars
crane-papercode/3DMedPT
Project page: https://3dmedpt.github.io/
daveredrum/D3Net
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Zhiyuan-Li-John/MuCR
MuCR is a benchmark designed to evaluate Vision Large Language Models' (VLLMs) ability to infer causal relationships using only visual cues
chaoyivision/python-dprint
An easy-to-use debug print tool for deep learning projects in python. PyPi: https://pypi.org/project/pydprint/