Full Surround Monodepth from Multiple Cameras, I have some questions about this paper,Thank you for your answer!
abing222 opened this issue · 3 comments
1、In multi camera training, do you only use spatio-temporal loss or temporal + spatio-temporal loss?
2、In multi camera training, do you use COLMAP depthmap supervise?
"By introducing spatial contexts (Equation 3), we not only further improve performance, but also learn scale-aware models by leveraging the camera extrinsics", What does scale-aware mean?
- We use spatial + temporal + spatio-temporal
- We do not use COLMAP supervision
Scale-aware means that the pointclouds are metrically accurate (i.e. estimates are in meters).
I hope that helps!
thank you very much,In my experiment,without additional supervision, due to the introduction of camera external parameters, the scales of space and temporal are inconsistent (absolute scale and relative scale), resulting in poor model training results