def build_cracknet(input_shape=(120,120,3),n_output_classes=2):
model = Sequential()
#bias_regularization = 0.0001
#kernel_regularization = 0.0001
# Convolution + Batch Norm. + ELU + Pooling #1
model.add(
Conv2D( 32, (11, 11),
input_shape=input_shape,
activation = 'relu',
strides=(1,1),
name="Conv1"
#kernel_regularizer=l2(kernel_regularization),
#bias_regularizer=l2(bias_regularization)
) )
model.add( BatchNormalization( name="Conv1BN"))
model.add( ELU( name="Conv1ELU" ))
model.add( MaxPooling2D(pool_size = (7,7),strides=(2,2), name="Conv1Pool" ) )
# Convolution + Batch Norm. + ELU + Pooling #2
model.add(
Conv2D( 48, (11, 11),
input_shape=(52,52,32),
activation = 'relu',
strides=(1,1), name="Conv2"
#kernel_regularizer=l2(kernel_regularization),
#bias_regularizer=l2(bias_regularization)
) )
model.add( BatchNormalization( name="Conv2BN" ))
model.add( ELU( name="Conv2ELU" ))
model.add( MaxPooling2D(pool_size = (5,5),strides=(2,2), name="Conv2Pool" ) )
# Convolution + Batch Norm. + ELU + Pooling #3
model.add(
Conv2D( 64, (7,7),
input_shape=(19,19,48), 96
activation = 'relu',
strides=(1,1),
name="Conv3"
#kernel_regularizer=l2(kernel_regularization),
#bias_regularizer=l2(bias_regularization)
) )
model.add(BatchNormalization(name="Conv3BN"))
model.add(ELU( name="Conv3ELU"))
model.add( MaxPooling2D(pool_size = (3,3),strides=(1,1), name="Conv3Pool" ) )
# Convolution + Batch Norm. + ELU + Pooling #4
model.add(
Conv2D( 80, (5,5),
activation = 'relu',
strides=(1,1),
name="Conv4"
#kernel_regularizer=l2(kernel_regularization),
#bias_regularizer=l2(bias_regularization
)
)
model.add(BatchNormalization( name="Conv4BN" ))
model.add(ELU( name="Conv4ELU" ))
model.add( MaxPooling2D(pool_size = (3,3),strides=(1,1), name="Conv4Pool" ) ) # Paper says (2,2) ?
# Flattening
model.add( Flatten() )
# FC #1
model.add( Dense( units = 5120, input_shape=(96,), activation = 'relu', ) )
model.add(ELU())
model.add(Dropout(0.2))
# FC #2
model.add( Dense( units = 96, input_shape=(2,), activation = 'relu' ) )
# Output Layer
model.add( Dense( units = n_output_classes, activation = 'softmax' ) )
# Compile
sgd_optimizer = SGD(lr=0.002, decay=0.1/350, momentum=1)
model.compile( optimizer = sgd_optimizer,
loss = 'categorical_crossentropy',
metrics = ['accuracy'] )
return model
This is repository is part of the nuclear plant crack inspection processing pipeline that attempts to reproduce the paper Deep Learning-based Crack Detection Using Convolutional Neural Network and Naıve Bayes Data Fusion. [1]
Two CNN models for crack detection are implemented in Tensorflow/Keras. A very simple model to test the workflow on low-end computers (that we call SimpleNet) and the model from the original paper (which we call CrackNet) is implemented in Tensorflow and Keras.
Also, transfer learning is applied to the VGG-16 model.
Any of these models is then embedded in the CNNDetector component that takes an image as an input, scans it for cracks, and returns a list of the bounding boxes with the probabilities of each being a crack. The CNNDetector in the following nuclear plant inspection processing pipeline can be implemented by locally instancing the tensorflow executor or consuming the model as as service with REST or GRPC.
The provided pre-trained models are the result of experimenting with different dataset preparations and model architectures and hyperparameter selection.
The following are publicly available datasets for crack and surface defect classification. A script is provided to download and prepare first dataset (instructions below).
Dataset | Description |
---|---|
Concrete Crack Images for Classification | Croncrete images having cracks collected from various buildings separated in two classes: "Positive crack" and "Negative crack". Each class has 20000 images with a total of 40000 images with 227 x 227 pixels with RGB channels. |
SDNET2018 | SDNET2018 is an annotated image dataset for training, validation, and benchmarking of artificial intelligence based crack detection algorithms for concrete. SDNET2018 contains over 56,000 images of cracked and non-cracked concrete bridge decks, walls, and pavements. The dataset includes cracks as narrow as 0.06 mm and as wide as 25 mm. |
NEU Surface Defect Database | In the Northeastern University (NEU) surface defect database, six kinds of typical surface defects of the hot-rolled steel strip are collected, i.e., rolled-in scale (RS), patches (Pa), crazing (Cr), pitted surface (PS), inclusion (In) and scratches (Sc). The database includes 1,800 grayscale images: 300 samples each of six different kinds of typical surface defects. |
Magentic tile defect dataset | This is the datasets of the upcoming paper "Saliency of magnetic tile surface defects"The images of 6 common magnetic tile defects were collected, and their pixel level ground-truth were labeled. (From original author description). |
Two notebooks to experiment with basic augmentation and PCA color augmentation are provided.
Datasets should be splitted in two or three sets (training and validation or training,validation, and testing set) and each set should a directory with the name of the class contain images of that class. This is the default directory structure used by Keras flow_from_directory method.
./data
datasets
training_set
crack
imgxyz.jpg
imgxyz.jpg
...
no_crack
imgxyz.jpg
imgxyz.jpg
...
validation_set
crack
imgxyz.jpg
imgxyz.jpg
...
no_crack
imgxyz.jpg
imgxyz.jpg
...
test_set
crack
imgxyz.jpg
imgxyz.jpg
...
no_crack
imgxyz.jpg
imgxyz.jpg
...
The following CNN model architectures are provided:
Model | Description |
---|---|
SimpleNet(Keras) | Very simple CNN. CPU friendly. |
CrackNet(Keras) | Implementation of CNN described in [1] in Keras with Tensorflow backend. |
CrackNet(Tensorflow) | Implementation of CNN described in [1] in Tensorflow. |
VGG16 | Transfer learning applied to VGG16. |
SimpleNET was trained on a public available cracks in concrete surface dataset [2] to be used as the model in the CNN detector for the nuclear plant crack inspection pipeline.
Original dataset consists of 20.000 samples for each class. For training session 18.000 samples were used for training and 2.000 samples were reserved for validation.
Some data augmentation was performed using Keras ImageDataGenerator ( see: https://keras.io/preprocessing/image/).
Parameter | Value |
---|---|
Optimizer | ADAM |
Loss function | Categorical Cross Entropy |
Epochs | 30 |
Batch size | 32 |
Data augmentation rescale | 1./255 |
Shear range | 0.2 |
Zoom range | 0.2 |
Horizontal flip | True |
Accuracy of near 98% was obtained against test set for checkpoint file: simplenet_cracks_weights.29-0.01.hdf5.
Testing against a real high res image containing cracks, the model fails to detect some positives and it is evident that more negative examples are needed for scenarios when surface contains elevations or other variations (bottom left section):
WIP
./
/data
/doc
/model-checkpoints
/models
/src
notebooks
data_preparation
cnn_model_development
examples
utils
/tensorboard_logs
/training_logs
where:
- data: contains datasets and other media.
- doc: documentation files.
- model-checkpoints: checkpoints generated during model training in hd5 format.
- models: models converted to Tensorflow SavedModel format ready for deployment with tensorflow serving.
- src: source code directory containing:
- notebooks:
- data_preparation: notebooks for EDA (Exploratory Data Analysis) and experiment with dataset augmentation.
- cnn_model_development: notebooks for development and training of CNN models.
- examples: examples and tests of using the cnn models.
- utils: utilities for dataset preparation and model reporting/evaluation.
- notebooks:
- tensorboard_logs: path to store tensorboard logs.
- training_logs: training logs in CSV to generate learning curves in reports.
Note: the following instruction steps use a custom docker image based on official Tensorflow Docker image for GPU.
Tested with Ubuntu 18.04 and Geforce GTX 950M with nvidia driver version 390.116.
- Clone repository.
git clone https://github.com/nhorro/tensorflow-crack-classification.git
- Download crack dataset.
wget https://data.mendeley.com/datasets/5y9wdsg2zt/1/files/c0d86f9f-852e-4d00-bf45-9a0e24e3b932/Concrete%20Crack%20Images%20for%20Classification.rar
mkdir -pv data/datasets/cracks
unrar x Concrete\ Crack\ Images\ for\ Classification.rar ./data/datasets/cracks
- Split the original dataset in training and evaluation (suggested: 80%-20%):
python src/utils/split_dataset.py --dir=data/datasets/cracks/ --train=80 --test=20 --output=data/datasets/cracks_splitted8020
- Run docker container for development (note: --network="host" is to enable access to the tensorflow serving API):
docker run -it --rm --runtime=nvidia -v $(realpath $PWD):/tf/notebooks --name tensorflowdev1 --network="host" nhorro/tensorflow1.12-py3-jupyter-opencv:1.1.0
-
Train the model from a notebook:
- Open notebook [SimpleNet - Development Notebook](src/cnn_model_development/SimpleNet - Development Notebook.ipynb) and run.
-
Optional: monitor and debug with tensorboard.
- Install tensorboard if not already installed:
pip install tensorboard # if not installed
Run tensorboard and connect to http://localhost:6006:
tensorboard --logdir=tensorboard_logs
Official image tensorflow/serving:1.12.0-gpu is used for serving.
export SERVING_MODEL=simplenet_cracks8020
docker run -t --rm --runtime=nvidia -p 8501:8501 -v $(realpath $PWD/models):/models/ --name crack_classification_service -e MODEL_NAME=$SERVING_MODEL tensorflow/serving:1.12.0-gpu
Query model status to check model availability.
curl http://localhost:8501/v1/models/simplenet_cracks8020
{
"model_version_status": [
{
"version": "1",
"state": "AVAILABLE",
"status": {
"error_code": "OK",
"error_message": ""
}
}
]
}
Query model metadata to obtain method signature definition (in this case, predict method input is a 64x64x3 image of DT_FLOAT elements).
curl http://localhost:8501/v1/models/simplenet_cracks8020/metadata
{
"model_spec":{
"name": "simplenet_cracks8020",
"signature_name": "",
"version": "1"
}
,
"metadata": {"signature_def": {
"signature_def": {
"serving_default": {
"inputs": {
"input_image": {
"dtype": "DT_FLOAT",
"tensor_shape": {
"dim": [
{
"size": "-1",
"name": ""
},
{
"size": "64",
"name": ""
},
{
"size": "64",
"name": ""
},
{
"size": "3",
"name": ""
}
],
"unknown_rank": false
},
"name": "conv2d_6_input:0"
}
},
"outputs": {
"dense_7/Softmax:0": {
"dtype": "DT_FLOAT",
"tensor_shape": {
"dim": [
{
"size": "-1",
"name": ""
},
{
"size": "2",
"name": ""
}
],
"unknown_rank": false
},
"name": "dense_7/Softmax:0"
}
},
"method_name": "tensorflow/serving/predict"
}
}
}
}
}
There is an example notebook of how to consume the API to classify an scan images.
WIP
- [1] Chen, Fu-Chen & Jahanshahi, Mohammad. (2017). NB-CNN: Deep Learning-based Crack Detection Using Convolutional Neural Network and Naïve Bayes Data Fusion. IEEE Transactions on Industrial Electronics. PP. 1-1. 10.1109/TIE.2017.2764844.
- [2] Özgenel, Çağlar Fırat (2018), “Concrete Crack Images for Classification”, Mendeley Data, v1http://dx.doi.org/10.17632/5y9wdsg2zt.1