Dam surface crack detection method based on improved DeepLabV3+ network
ZHOU Xun1,2,3, LI Yonglong3, ZHOU Yingyue1,2, WANG Haoran3, LI Jiayang1,2, ZHAO Jiaqi1,2
1. School of Information Engineering, Southwest University of Science and Technology, Mianyang 621000, China; 2. Sichuan Provincial Key Laboratory Robot Technology Used for Special Environment, Southwest University of Science and Technology, Mianyang 621000, China; 3. Sichuan Energy Internet Research Institute, Tsinghua University, Chengdu 610213, China
Abstract:Image analysis is an efficient and accurate method for identifying hydropower dam surface defects. However, due to the complex background of dam crack images and the uneven proportion of cracks and background pixels, the detection effect of traditional algorithms is poor. Moreover, traditional artificial crack inspection is not only inefficient but also costly in the present day. Efficient and accurate dam surface crack detection techniques are crucial for dam maintenance and operation. In order to achieve accurate and efficient dam surface crack detection, a dam surface crack detection method based on an improved DeepLabV3+ model is proposed. Model training is carried out for the self-made dam surface crack image dataset of a hydropower station in Southwest China, and the model is evaluated by F1,score, ZMIoU, ZMPA, parameter quantity and other indicators. The following improvements are made to the traditional DeepLabV3+ network model: (1) A three line attention module is added to improve the model's ability to extract crack pixels and reduce proportion imbalance between background pixels and crack pixels. (2) The original pyramid pooling module is cascaded for model optimization so that the model can achieve more intensive pixel sampling, and subsequently obtain more abundant crack features. (3) In order to solve the problem of the significant/too large number of traditional DeepLabV3+ network parameters, MobileNetV2 network is used as the backbone of the model to extract the network, to reduce the network to a lightweight module, and to reduce model parameters. (4) Focal loss and Dice loss are used as the loss functions of the model to overcome the data imbalance and to improve the accuracy of network classification. The improved DeepLabV3+ network model in this paper could better realize the extraction of crack pixels, reduce the problems caused by the imbalance of pixel proportion, and better ensure the efficient and accurate detection of dam surface cracks. The experiment on the self-made dam surface crack image dataset of a hydropower station in Southwest China showed the following: (1) Compared with the original model, the improved DeepLabV3+ model increased F1, score by 3.33%, ZMIOU by 2.89%, ZMPA by 1.12%, and the parameters were reduced to 3 014 714. This finding showed that the improved model proposed in this paper had stronger performance than the original model, better ability to extract crack pixels, and could better complete the task of crack identification. (2) Compared with other attention mechanisms, the three line attention module proposed in this paper had certain advantages, which could increase the attention of the model to the crack pixels and enable the model to extract the crack features needed. Through an analysis of the experimental results, the model improved in this paper has stronger segmentation ability, less missing data and false detection, and can effectively complete the dam surface crack segmentation task. The improved method increases the efficiency and accuracy of dam surface crack detection and reduces the model parameters. It can provide powerful data support for crack detection and the safe operation of hydropower projects.
[1] 孙金华.我国水库大坝安全管理成就及面临的挑战[J].中国水利, 2018(20):1-6. SUN J H. Achievements and challenges of reservoir dam safety management in China[J]. China Water Resources, 2018(20):1-6.(in Chinese). [2] 中华人民共和国水利部,中华人民共和国国家统计局.第一次全国水利普查公报[N].中国水利报, 2013-03-26(2). Ministry of Water Resources of the People's Republic of China, National Bureau of Statistics of the People's Republic of China. Bulletin of first national census for water[N]. China Water Resources News, 2013-03-26(2).(in Chinese). [3] BUFFI G, MANCIOLA P, GRASSI S, et al. Survey of the Ridracoli Dam:UAV-based photogrammetry and traditional topographic techniques in the inspection of vertical structures[J]. Geomatics, Natural Hazards and Risk, 2017, 8(2):1562-1579. [4] 朱道雄.水电站建筑物病害分析及处理措施研究:以宝珠寺电站、紫兰坝电站为例[D].宜昌:三峡大学, 2020. ZHU D X. Disease analysis and treatment measures of hydropower station buildings:Taking Baozhusi Hydropower Station and Zilanba Hydropower Station as examples[D]. Yichang:China Three Gorges University, 2020.(in Chinese). [5] FUJITA Y, HAMAMOTO Y. A robust automatic crack detection method from noisy concrete surfaces[J]. Machine Vision and Applications, 2011, 22(2):245-254. [6] LIU Y Q, YEOH J K W. Automated crack pattern recognition from images for condition assessment of concrete structures[J]. Automation in Construction, 2021, 128(5):103765. [7] LEE B Y, KIM Y Y, YI S T, et al. Automated image processing technique for detecting and analysing concrete surface cracks[J]. Structure and Infrastructure Engineering, 2013, 9(6):567-577. [8] NOH Y, KOO D, KANG Y M, et al. Automatic crack detection on concrete images using segmentation via fuzzy C-means clustering[C]//2017 International Conference on Applied System Innovation (ICASI). Sapporo, Japan:IEEE, 2017:877-880. [9] YAMAGUCHI T, HASHIMOTO S. Fast crack detection method for large-size concrete surface images using percolation-based image processing[J]. Machine Vision and Applications, 2010, 21(5):797-809. [10] QU Z, CHEN Y X, LIU L, et al. The algorithm of concrete surface crack detection based on the genetic programming and percolation model[J]. IEEE Access, 2019, 7(1):57592-57603. [11] PRASANNA P, DANA K J, GUCUNSKI N, et al. Automated crack detection on concrete bridges[J]. IEEE Transactions on Automation Science and Engineering, 2016, 13(2):591-599. [12] YANG Y S, YANG C M, HUANG C W. Thin crack observation in a reinforced concrete bridge pier test using image processing and analysis[J]. Advances in Engineering Software, 2015, 83(1):99-108. [13] 王一兵,包腾飞,高治鑫.基于LabVIEW+VDM的混凝土坝裂缝检测方法[J].水利水电科技进展, 2021, 41(5):76-82. WANG Y B, BAO T F, GAO Z X. Crack detection method of concrete dams based on LabVIEW+VDM[J]. Advances in Science and Technology of Water Resources, 2021, 41(5):76-82.(in Chinese) [14] KUMAR P, SHARMA A, KOTA S R. Automatic multiclass instance segmentation of concrete damage using deep learning model[J]. IEEE Access, 2021, 9:90330-90345. [15] LIU Z Q, CAO Y W, WANG Y Z, et al. Computer vision-based concrete crack detection using U-Net fully convolutional networks[J]. Automation in Construction, 2019, 104(1):129-139. [16] CHA Y J, CHOI W, BVYVKÖZTVRK O. Deep learning-based crack damage detection using convolutional neural networks[J]. Computer-Aided Civil and Infrastructure Engineering, 2017, 32(5):361-378. [17] ZHENG M J, LEI Z J, ZHANG K. Intelligent detection of building cracks based on deep learning[J]. Image and Vision Computing, 2020, 103(1):103987. [18] WU X Y, MA J F, SUN Y, et al. Multi-scale deep pixel distribution learning for concrete crack detection[C]//202025th International Conference on Pattern Recognition (ICPR). Milan, Italy:IEEE, 2021:6577-6583. [19] ALI R, CHUAH J H, TALIP M S A, et al. Structural crack detection using deep convolutional neural networks[J]. Automation in Construction, 2022, 133(1):103989. [20] PARK S E, EEM S H, JEON H. Concrete crack detection and quantification using deep learning and structured light[J]. Construction and Building Materials, 2020, 252(1):119096. [21] BHOWMICK S, NAGARAJAIAH S, VEERARAGHAVAN A. Vision and deep learning-based algorithms to detect and quantify cracks on concrete surfaces from UAV videos[J]. Sensors, 2020, 20(21):6299. [22] 王君锋,刘凡,杨赛,等.基于多源迁移学习的大坝裂缝检测[J].计算机科学, 2022, 49(6A):319-324. WANG J F, LIU F, YANG S, et al. Dam crack detection based on multi-source transfer learning[J]. Computer Science, 2022, 49(6A):319-324.(in Chinese) [23] CHEN L C, ZHU Y K, PAPANDREOU G, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[C]//Proceedings of the 15th European Conference on Computer Vision. Munich, Germany:Springer, 2018:833-851. [24] SANDLER M, HOWARD A, ZHU M L, et al. MobileNetV2:Inverted residuals and linear bottlenecks[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA:IEEE, 2018:4510-4520. [25] RONNEBERGER O, FISCHER P, BROX T. U-Net:Convolutional networks for biomedical image segmentation[C]//18th International Conference on Medical Image Computing and Computer-Assisted Intervention. Munich, Germany:Springer, 2015:234-241. [26] BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet:A deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12):2481-2495. [27] ZHAO H S, SHI J P, QI X J, et al. Pyramid scene parsing network[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, USA:IEEE, 2017:2881-2890. [28] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. Semantic image segmentation with deep convolutional nets and fully connected CRFs[C/OL].(2014-12-22)[2022-09-21]. https://arxiv.org/abs/1412.7062. [29] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. DeepLab:Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 40(4):834-848. [30] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, USA:ACM, 2017:6000-6010. [31] HU J, SHEN L, ALBANIE S, et al. Squeeze-and-excitation networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(8):2011-2023. [32] WOO S, PARK J, LEE J Y, et al. CBAM:Convolutional block attention module[C]//Proceedings of the 15th European Conference on Computer Vision (ECCV). Munich, Germany:Springer, 2018:3-19. [33] WANG Q L, WU B G, ZHU P F, et al. ECA-Net:Efficient channel attention for deep convolutional neural networks[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, USA:IEEE, 2020:11531-11539. [34] LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston, USA:IEEE, 2015:3431-3440.