ch3cook-fdu/Vote2Cap-DETR
[CVPR 2023] Vote2Cap-DETR and [T-PAMI 2024] Vote2Cap-DETR++; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning methods
PythonMIT
Issues
- 1
How to visualize the encoder scene and output bbox
#25 opened by HT0403 - 1
- 5
loss is negative when running SCST
#23 opened by 1301358882 - 3
CUDA kernel failed : no kernel image is available for execution on the device
#22 opened by 1301358882 - 5
Inference on Custom DB
#21 opened by jkstyle2 - 12
Questions about performance
#19 opened by jkstyle2 - 4
What should I do to visualize this data?
#20 opened by YigaoWang - 4
- 6
- 2
scannet_means.npz and scannet_reference_means.npz
#18 opened by cy94 - 2
Suddenly terminates during debugging
#17 opened by 1301358882 - 2
Question about evaluate metric
#16 opened by WeitaiKang - 15
- 3
Question for ScanRefer benchmark, not Scan2cap
#15 opened by jkstyle2 - 4
Thanks for your great work! I have some question
#13 opened by Leon1207 - 2
Question about caption evaluation results
#12 opened by TTXiann - 4
How to visualize the result?
#11 opened by iris0329 - 2
- 1
Why the 'nyu40id2class' of Vote2Cap is different with that of these detection methods?
#8 opened by linhaojia13 - 1
- 4
Question about caption evaluation
#5 opened by 8reaks - 6
dataset processing issue
#3 opened by cactusycy