Dummy Labeling

Convert image datasets that are originally for classification to object detection.

You may want to check out its sister project too.

What differentiates this tool from the simple image annotator: once outputs from existing object detection models are saved, they could be loaded directly to the tool and served as a starting point. See the process workflow image below.

Description

I need to train a model that could automatically detect food on an uploaded image. However, there is no open sourced food image datasets for the object detection task. There are a handful of food images for classification, though. So why don't we use best of the both worlds: classification labels provided as ground truths, and bounding boxes from pre-trained object detection models? Then, I just need the best bounding box and assign it with the ground truth label.

Load outputs from SSD result, select & Save

Saved results

Process Workflow

To see how bounding boxes and ground truth labels are obtained, refer to the notebook under data_prep. Note: the notebook only shows one way. You are more than welcome to use your own ground truth labels and model output files, but if so you might need to tweak the code (mainly app.py) a bit.

Usage

To quickly get started and see the tool in action, check out the example/ folder.

Prereq: Python >= 3.6, also install all the required packages: pip install -r requirements.txt
cd into this directory after cloning the repo
Get bounding boxes and labels using existing tools. If you'd like to utilize TF Object Detection API, check out this notebook.
Gather the ground truth text file and OD outputs (json) and put it in a secured place.
Create a config file similar to example/sample_config.txt.
Start the app:

$ python app.py --dir /images/directory --config path/to/config.txt

You can also specify the file you would like the annotations output to (out.csv is the default)

$ python app.py --dir /images/directory --config path/to/config.txt --out test.csv

Open http://127.0.0.1:5000/tagger in your browser
- Only tested on Chrome

Output

In keeping with simplicity, the output is to a csv file with the following fields
- id - id of the bounding box within the image
- name - name of the bounding box within the image
- image - image the bounding box is associated with
- xMin - min x value of the bounding box
- xMax - max x value of the bounding box
- yMin - min y value of the bounding box
- yMax - max y value of the bounding box

HOWTOs