Using Iphone images as input data

Question

Using Iphone images as input data

Closed this issue a year ago · 5 comments

If I want to use RGB images taken from Iphone as input data (no depth info), should I copy datasets_depth_iphone.py to create the datasets_depth_xyz.py? Or should I just use the original datasets_depth_iphone.py and nvs_rgbd_iphone.gin?

Answer 1 · 2023-10-09T01:49:38.000Z

Or can you briefly explain the pipeline for data preping if I want to use only RGB images as input?

Answer 2 · 2023-10-10T01:26:43.000Z

Thank you for you question.
If you do not use depth info, you can directly use train_llff_dtu.py like llff dataset.

More precisely, I do not know the question "if I want to use only RGB images as input". In our paper, if you do not have depth maps from depth sensors, you can use depth estimation to predict depth maps, such as LLFF and DTU. But on LLFF and DTU datasets, they split datasets by selecting every 8 images as a test set.

Moreover, different scripts about datasets_depth_xxx.py are very similar. We separate the scripts because Kinnect has some noises in depth maps, and some uses inverse depth maps while some use disparity maps.

Answer 3 · 2023-10-10T13:03:27.000Z

Basically, you can prepare your data as follows:

Capture a few RGB images. Prepare your dataset like the iPhone dataset, for example, training images are named with '_train.jpg', test images are named with '_test.jpg'
Calculate camera poses by COLMAP
Calcuate inverse depth maps by the DPT depth estimator.
Organize datasets as the iPhone dataset (the depth map provided by iPhone are also inverse depth maps).
Set variables in nvs_rgbd_iphone.gin, and use datasets_depth_iphone.py and train_iphone.py

Answer 4 · 2023-10-13T02:48:00.000Z

Do you have any questions about this issue?

Answer 5 · 2023-10-18T05:15:29.000Z

I closed this issue since there is no further question. You might re-open the issue and open a new issue.