Deep-Learning

Abstract

This project aims to classify accessibility feature of storefront on sidewalk from google street view images and it can help visually impaired people to avoid dangerous obstacles in the street and allow them to access each store. We are conducting some experiments on Faster-RCNN, a popular architecture in Object Detection. It enables us to have better understanding of how this model actually “sees” a physical object.

Dataset

Dataset Size:

Training Set: 928 images with labels
Validation Set: 100 images with labels

Detection Result

Testing without using depth-filtering

*************************Recall Precision **************************************
Door -> TP: 159 Predict: 350 Truth: 164 Precision:45.43% Recall:96.95%
Knob -> TP: 58 Predict: 211 Truth: 77 Precision:27.49% Recall:75.32%
Stairs -> TP: 94 Predict: 361 Truth: 96 Precision:26.04% Recall:97.92%
Ramp -> TP: 1 Predict: 13 Truth: 10 Precision:7.69% Recall:10.00%
*******************************************************************************

Testing using depth-filtering

*************************Recall Precision **************************************
Door -> TP: 154 Predict: 283 Truth: 164 Precision:54.42% Recall:93.90%
Knob -> TP: 58 Predict: 211 Truth: 77 Precision:27.49% Recall:75.32%
Stairs -> TP: 94 Predict: 361 Truth: 96 Precision:26.04% Recall:97.92%
Ramp -> TP: 1 Predict: 13 Truth: 10 Precision:7.69% Recall:10.00%

Non-filtering vs Depth-filtering

Non-filtering	Depth-Filtering

chrissuda/Deep-Learnng

Deep-Learning

Abstract

Dataset

Category:

Dataset Size:

Detection Result

Testing without using depth-filtering

Testing using depth-filtering

Non-filtering vs Depth-filtering

Predicting Doors' Geolocation

Prediction's Accuracy

Assiocated Stores' Names With Doors