Full Surround Monodepth from Multiple Cameras, I have some questions about this paper，Thank you for your answer！

Question

Full Surround Monodepth from Multiple Cameras, I have some questions about this paper，Thank you for your answer！

abing222 opened this issue 3 years ago · 3 comments

1、In multi camera training, do you only use spatio-temporal loss or temporal + spatio-temporal loss?
2、In multi camera training, do you use COLMAP depthmap supervise？

Answer 1 · 2022-01-05T07:18:12.000Z

"By introducing spatial contexts (Equation 3), we not only further improve performance, but also learn scale-aware models by leveraging the camera extrinsics", What does scale-aware mean?

Answer 2 · 2022-01-19T02:14:00.000Z

We use spatial + temporal + spatio-temporal
We do not use COLMAP supervision

Scale-aware means that the pointclouds are metrically accurate (i.e. estimates are in meters).

I hope that helps!

Answer 3 · 2022-01-19T02:19:53.000Z

thank you very much，In my experiment，without additional supervision, due to the introduction of camera external parameters, the scales of space and temporal are inconsistent (absolute scale and relative scale), resulting in poor model training results