/cat-bbs-regression

Detect cat faces in images using CNNs with regression

Primary LanguagePythonMIT LicenseMIT

About

Simple convolutional neural network to detect cat bounding boxes in images. The system is restricted to one bounding box per image, which is localized using regression (i.e. directly predicting the bounding box coordinates). The model consists of 7 convolutional layers and 2 fully connected layers (including output layer).

Dependencies

  • python 2.7 (only tested with that version)
  • keras (tested in v1.06)
  • scipy
  • numpy
  • scikit-image

Usage

  • Download the 10k cats dataset and extract it, e.g. into directory /foo/bar/10k-cats. That directory should contain the subdirectories CAT_00, CAT_01, etc.
  • Train the model using train_convnet.py --dataset="/foo/bar/10k-cats".
  • Apply the model using train_convnet.py --dataset="/foo/bar/directory-with-cat-images".

Images

Example results:

Located cat face Located cat face Located cat face Located cat face Located cat face Located cat face