CNN number detection

Project

The goal of this repository is to implement a number detection using Tensorflow with a custom Convolution Neural Net (CNN) architecture specifically for fast inference. The CNN will be trained using a custom created dataset that contains numbers from 1-9 and a category for 'not numbers (-1)'.

The CNN is fast enough to run in real-time on a Raspberry Pi.

Specs

Image size: 320 x 240
Camera fps: 38
Processing time (with isolator): 25ms - 45ms (on Raspberry Pi)

Includes

DataExtractor (to extract data for the dataset)
Isolator (to find the numbers inside an image)
Trainer (to train the model)
Tester (to test the model)

Example input images

Example extracted images (dataset images)

Install

Requirements

You need to have the following packages installed:

Python 3.6
Tensorflow 1.4.1+
OpenCV 4.0
Etc.

Install

Clone the repo and install 3rd-party libraries

$ git clone https://github.com/FabianGroeger96/cnn-number-detection
$ cd cnn-number-detection
$ pip3 install -r requirements.txt

Usage

Extract data with DataExtractor

Create a folder named images_to_extract in the data extractor directory (The folder can be named differently, but don't forget to change the INPUT_DATA_DIR variable in the constants.py file). This directory will be used to extract the regions of interest to train your CNN.
Copy the data to extract the regions of interest into the images_to_extract folder
Specify which categories you want to extract in the constants.py file, by changing the CATEGORIES variable
Run the extract_data.py file and call the method extract_data() from the Extractor
After the method is finished your extracted regions of interest are located in the data_extracted folder. In there you will also find folders for each of your categories. These folders are used to label the regions of interest for then training your CNN.

Label the Data (by Hand)

First of all you will have to extract the regions of interest with the DataExtractor (follow the step Extract data with DataExtractor)
Classify the images, by dragging them in the corresponding category folder

Label the Data (with existing Model)

First of all you will have to extract the regions of interest with the DataExtractor (follow the step Extract data with DataExtractor)
Specify in the constants.py file where your model will be located, by modifying the MODEL_DIR constant
Place your existing model in the directory that you specified before
Run the extract_data.py file and call the method categorize_with_trained_model(), this will categorize your regions of interest
Verify that the data was labeled correctly, by checking the data_extracted folder

Create dataset pickle files

If you are finished labeling the images, run the extract_data.py file and call the method rename_images_in_categories() from the Extractor, this will rename the images in each category
Run the extract_data.py file and call the method create_training_data(), this will create your pickle files (X.pickle and y.pickle) which contain the data and the labels of your data

Train the CNN

Check if the pickle files (X.pickle and y.pickle) were created in the root directory of the project
Run the train_model.py file within the trainer, this will train your model and save it to the directory specified in the constants.py (MODEL_DIR)

Test the CNN

Check if the model was created in the directory specified in constants.py (MODEL_DIR)
Upload a testing image to the tester directory
Run the test_model.py file within the tester and give the test_model_with_image(image_name) function the name of the image
Run the test_model.py file within the tester and give the test_model_with_folder(folder_name) function the name of the folder containing multiple images to test

AntonioHenriqueAC/cnn-number-detection

CNN number detection

Project

Specs

Includes

Example input images

Example extracted images (dataset images)

Install

Requirements

Install

Usage

Extract data with DataExtractor

Label the Data (by Hand)

Label the Data (with existing Model)

Create dataset pickle files

Train the CNN

Test the CNN