hzxie/Pix2Vox

Questions

Closed this issue · 2 comments

I hope you are doing great, I am trying to implement this code but I have certain questions, would you kindly guide me through the following please.

  1. After running runner.py, I get the output folder with checkpoints, images (with empty white image as 'voxels-001.png') and logs, could you guide me why is the image complete white and is it supposed to be 3d reconstructed output image?

  2. In config.py file, the changes you mentioned in the path are as below
    __C.DATASETS.PASCAL3D.VOXEL_PATH = '/path/to/Datasets/PASCAL3D/CAD/%s/%02d.binvox
    but there is no .binvox file in PASCAL3D/CAD but it is '.off' file in the original dataset
    and also in
    __C.DATASETS.PIX3D.VOXEL_PATH = '/path/to/Datasets/Pix3D/model/%s/%s/%s.binvox'
    there is no .binvox in Pix3D/model file either but instead there are.obj and .mat files.

I tried with all the combinations but I think my code isn't picking up the datasets correctly. Could you kindly let me know what changes should be made exactly.

  1. in Config.py file , could you please tell me what is line no. 61 means, '__C.DIR.RANDOM_BG_PATH = '/home/hzxie/Datasets/SUN2012/JPEGImages' and what are the changes that can be made here if at all anything is needed.

  2. And in file, Pix3d.json in datasets, the taxonomy file contains only chair but not any other object, but we are considering only aeroplane to run this code, so could you please elaborate on this.

Thanks for understanding.

  1. The images in the training dataset are with white background. If you feed the network with clutter-background images, there will be a large domain gap. You can use SAM to segment the object first.
  2. The .binvox file should be generated from the .off file. Please refer to this script: https://github.com/hzxie/Pix2Vox/blob/master/utils/binvox_converter.py
  3. Benefiting from SAM, I think pretrained with SUN2012 images is no longer needed.
  4. As far as I recall, Pix3D contains approximately 2800 chairs, but only a limited number of objects for other categories. Training on Pix3D for other categories appears to be infeasible.

Thank you so much for the clarity in your answers.