The aim of the project is to create a convolutional neural network model that can identify cracks on concrete with a high accuracy. The problem is modelled as a binary classification problem (no cracks/negative and cracks/positive). The model is trained with a dataset of 40000 images (20000 images of concrete in good condition and 20000 images of concrete with cracks). The source of the dataset can be found here.
The project is completed with Spyder as the main IDE. The main frameworks used in this project are Numpy, Matplotlib and Tensorflow Keras.
The methodology for this project is inspired by a documentation on the official TensorFlow website. You can refer to it here.
The image data are loaded along with their corresponding labels. The data is first split into train-validation set, with a ratio of 70:30. The validation data is then further split into two portion to obtain some test data, with a ratio of 80:20. The overall train-validation-test split ratio is 70:24:6. No data augmentation is applied as the data size and variation are already sufficient.
The input layer is designed to receive coloured images with a dimension of 160x160. The full shape will be (160,160,3).
Transfer learning is applied for building the deep learning model of this project. Firstly, a preprocessing layer is created that will change the pixel values of input images to a range of -1 to 1. This layer serves as the feature scaler and it is also a requirement for the transfer learning model to output the correct signals.
For feature extractor, a pretrained model of MobileNet v2 is used. The model is readily available within TensorFlow Keras package, with ImageNet pretrained parameters. It is also frozen hence will not update during model training.
A global average pooling and dense layer are used as the classifier to output softmax signals. The softmax signals are used to identify the predicted class.
The simplified illustration of the model is shown in the figure below.
The model is trained with a batch size of 16 and 10 epochs. After training, the model reaches 99% training accuracy and 95% validation accuracy. The training results are shown in the figures below.
The graphs show a clear sign of convergence, indicating the model has been trained to reach an optimal level of accuracy and loss.
The model is evaluated with the test data. The loss and accuracy are shown in figure below.
Some predictions are also been made with the model, and compared with the actual results.