facebookresearch/PoseDiffusion

RealEstate10k annotation

yuyu0927 opened this issue · 9 comments

Hi Jianyuan,

If I wanna train model on RealEstate10k, how should I preprocess the annotations of it? Is that in the same way with Co3D V2 dataset?

jytime commented

Hi yu,

Overall it follows the same flow but please be really careful with the coordinate conversion. If I remember correctly, Re10k also uses the COLMAP coordinate. Therefore, you can follow the sample here, and convert it from COLMAP coordinate to Pytorch3D one. Generally speaking, you need to permute/transpose the rotation matrix, and make the xy axis negative for both rotation and translation. You can refer to my code there. I would suggest to visualize the cameras to check all are good after finishing it.

Good luck.

Hi Jianyuan,

最近在处理RealEstate10k的数据,从官方下载的数据集都是txt文件,包含每个视频的链接和frame的信息,然后我从github另一处找到一个下载该数据的repo,改了改然后设置成了下载720p的视频再获取frame。下一步就是进行coordinate转换,xy均取反(因为两个坐标系定义中xy方向相反),rotation matrix要转置(因为要和pytorch3D相适应),RealEstate10k中是不包含bbox和catergory信息的,将其参照co3d annotation的格式保存为jgz文件即可(没有的信息就忽略)。请问我这样的处理方式正确吗?

jytime commented

Hi Yu,

Yes it is correct. Please be careful that you should do rotation matrix transpose first, and then take the negative of xy.

jytime commented

And if you want to do GGS on Re10K, please notice that resolution is really important for small baseline images like Re10k. It is always good to keep Re10k images the original resolution when doing GGS.

Thanks for your correction and advice! Actually I reversed the order in coordinate conversion. I will revise and visualize the corrected coordinate tomorrow. I understand that I should keep it in original resolution because it affects feature extraction in GGS. Btw, could you explain briefly why rotation matrix transpose should be done at first?

Hi @yuyu0927 , I just uploaded the dataset file for Re10K. I am going to train a new model on this version and check its performance.