EPCL pertained model details

Question

EPCL pertained model details

ZCMax opened this issue a year ago · 4 comments

Thanks for your code, I wonder how the EPCL pertained model is obtained? For example, training datasets and training approach? Since the name of checkpoint includes scannet, was it trained on ScanNet datasets?

Answer 1 · 2023-07-03T07:32:49.000Z

Hi, thanks for your issue.

The EPCL checkpoint we used is the methodology from FrozenCLIP. Since the 3D datsets are limited, we follow the setting in the paper and choose the pretrained checkpoint on ScanNet, which is trained for 3D detection task.

Answer 2 · 2023-07-12T06:41:53.000Z

Thanks for your reply, my next question is that since the pertained checkpoint is trained for 3D detection task on ScanNet, whether the 3D benchmark on ScanNet can still be regarded as zero-shot manner?

Answer 3 · 2023-07-20T07:29:43.000Z

Thanks. This method is limited by the existing pretrained encoder in 3D vision. Compared with 2D, the EPCL encoder indeed used the scannet data to pretrain. But in LAMM framework, ScanNet data is not exposured to LLM decoder, which is the major part of the framework.

Later, we will try to test with other point cloud encoder, contributions are also welcomed.

Answer 4 · 2023-08-01T06:10:13.000Z

This issue will be closed for no further discussions. Please reopen it if necessary.