Bad images in training

Question

Bad images in training

Closed this issue 6 years ago · 7 comments

While playing around with the sm5 dataset, I noticed some of them are badly rendered.

Not sure if this will pose any problem for training, just wanted to point this out.

Answer 1 · 2018-10-15T18:00:24.000Z

Are you using the dataset you created yourself?

Answer 2 · 2018-10-15T18:50:43.000Z

Yes. I've seen several sequences with more than 2 objects in it, like the one I posted here. The image in row 2 column 2 and row 3 column 1 clearly shows 2 different objects (one piece with 3 blocks and another piece with 2 blocks (yellow, blue)).

Answer 3 · 2018-11-14T16:18:20.000Z

I've created my own dataloader based on https://github.com/l3robot/gqn_datasets_translator and I've observed the same kind of poor quality samples. Might such samples cause exploding gradients in the network?

Answer 4 · 2018-11-14T19:58:41.000Z

I believe the problem with these images is that the object is not at the center, but the rendering is actually correct.

Answer 5 · 2018-11-15T13:07:23.000Z

Yea I agree, but having some non-centered data(as well as multiple objects in an image) is not representative of the rest of the data on which the network is trying to learn so I would guess it could result in large errors. I'm not sure whether it becomes a problem or not if the amount of such images is small.

Answer 6 · 2018-11-15T16:19:27.000Z

I was wondering if you could detect this kind of stuff by looking at the pose? for example if one fit a sphere to the translations of normal ones, the center is at (0, 0, 0) while that of a bad one would be off by a bit.

Answer 7 · 2019-06-24T08:57:35.000Z

It looks to only be a few examples at this point, so I conclude it is a problem from the side of DeepMind rather than the converter.