Deep3D-pytorch

Team Members: Anish Madan, Apoorv Khattar, Yash Tomar

About the porject

Estimating a right-view from a monocular image(to make a stereo pair of images) which respects the geometry of the scene is an important problem in computer vision. This repository aims to achieve this by implementing Deep3D (see original repo) using PyTorch to generate right view of images. This generated pair of images can then be used to estimate depth in images, convert 2D video to 3D, etc.

Dataset

We used the KITTI Stereo 2015 Dataset. The dataset consists of 200 training scenes and 200 test scenes, which include 4 color images per scene, in a lossless png format. This means that we have 400 left and right image pairs for training.

Model Weights

The pretrained model can be downloaded from here

Results

The following are some results of our approach (from left to right- input left , ground truth right stereo image pair, generated stereo image pair, depth map generated using OpenCV for image pairs):

s1

s2

This work was done as part of our project for CSE344: Computer Vision course at IIIT Delhi.