/MFNet-pytorch

MFNet-pytorch, image semantic segmentation using RGB-Thermal images

Primary LanguagePython

MFNet-pytorch

Image semantic segmentation using RGB-Thermal images, for example:

Project home page: https://www.mi.t.u-tokyo.ac.jp/static/projects/mil_multispectral/

This is official pytorch implementation of MFNet: Towards real-time semantic segmentation for autonomous vehicles with multi-spectral scenes (IROS 2017). The pdf can be downloaded from HERE (Google Drive shared file).

Introduction

MFNet is a light CNN architecture for multispectral images semantic segmentation, with ~ 1/40x parameters and 6x ~ inference speed, while providing similar or higher accuracy compared to SegNet.

Requirements

* pytorch 0.4.0
* PIL 4.3.0
* numpy 1.14.0
* tqdm 4.19.4

Dataset

We published a new RGB-Thermal semantic segmentation dataset in support of further development of autonomous vehicles in the future. This dataset contains 1569 images (820 taken at daytime and 749 taken at nighttime) which is available in our project home page

Importance : .png images in our dataset are contain 4 channels, load them in this way to get the right format.

import numpy as np
from PIL import Image
im = Image.open('/path/to/dataset/01606D.png')
print(np.asarray(im).shape)

# (480, 640, 4)

Usage

  • run demo code use trained model
$ cd /path/to/this/repository
$ python run_demo.py
  • training
1. download our dataset
2. set directory of dataset in train.py
3. run train.py