Fusion-Based Damage Segmentation for Multimodal Building Façade Images from an End-to-End Perspective

Multimodal image data have found widespread applications in visual-based building façade damage detection in recent years, offering comprehensive inspection of façade surfaces with the assistance of drones and infrared thermography. However, the comprehensive integration of such complementary data h...

Full description

Saved in:

Bibliographic Details
Main Authors:	Pujin Wang, Jiehui Wang, Qiong Liu, Lin Fang, Jie Xiao
Format:	Article
Language:	English
Published:	MDPI AG 2024-12-01
Series:	Buildings
Subjects:	building façade damage detection multimodal image registration multimodal image fusion instance segmentation
Online Access:	https://www.mdpi.com/2075-5309/15/1/63
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841549385417097216
author	Pujin Wang Jiehui Wang Qiong Liu Lin Fang Jie Xiao
author_facet	Pujin Wang Jiehui Wang Qiong Liu Lin Fang Jie Xiao
author_sort	Pujin Wang
collection	DOAJ
description	Multimodal image data have found widespread applications in visual-based building façade damage detection in recent years, offering comprehensive inspection of façade surfaces with the assistance of drones and infrared thermography. However, the comprehensive integration of such complementary data has been hindered by low levels of automation due to the absence of properly developed methods, resulting in high cost and low efficiency. Thus, this paper proposes an automatic end-to-end building façade damage detection method by integrating multimodal image registration, infrared–visible image fusion (IVIF), and damage segmentation. An infrared and visible image dataset consisting of 1761 pairs encompassing 4 main types of façade damage has been constructed for processing and training. A novel infrared–visible image registration method using main orientation assignment for feature point extraction is developed, reaching a high RMSE of 14.35 to align the multimodal images. Then, a deep learning-based infrared–visible image fusion (IVIF) network is trained to preserve damage characteristics between the modalities. For damage detection, a relatively high mean average precision (mAP) result of 85.4% is achieved by comparing four instance segmentation models, affirming the effective utilization of IVIF results.
format	Article
id	doaj-art-dea1da22213c43bbaafc48e21c3af341
institution	Kabale University
issn	2075-5309
language	English
publishDate	2024-12-01
publisher	MDPI AG
record_format	Article
series	Buildings
spelling	doaj-art-dea1da22213c43bbaafc48e21c3af3412025-01-10T13:15:55ZengMDPI AGBuildings2075-53092024-12-011516310.3390/buildings15010063Fusion-Based Damage Segmentation for Multimodal Building Façade Images from an End-to-End PerspectivePujin Wang0Jiehui Wang1Qiong Liu2Lin Fang3Jie Xiao4College of Civil Engineering, Tongji University, Shanghai 200092, ChinaCollege of Civil and Transportation Engineering, Shenzhen University, Shenzhen 518060, ChinaSchool of Environment and Architecture, University of Shanghai for Science and Technology, Shanghai 200093, ChinaShanghai Housing Quality Inspection Station Co., Ltd., Shanghai 200061, ChinaSchool of Civil and Transportation Engineering, Guangdong University of Technology, Guangzhou 510006, ChinaMultimodal image data have found widespread applications in visual-based building façade damage detection in recent years, offering comprehensive inspection of façade surfaces with the assistance of drones and infrared thermography. However, the comprehensive integration of such complementary data has been hindered by low levels of automation due to the absence of properly developed methods, resulting in high cost and low efficiency. Thus, this paper proposes an automatic end-to-end building façade damage detection method by integrating multimodal image registration, infrared–visible image fusion (IVIF), and damage segmentation. An infrared and visible image dataset consisting of 1761 pairs encompassing 4 main types of façade damage has been constructed for processing and training. A novel infrared–visible image registration method using main orientation assignment for feature point extraction is developed, reaching a high RMSE of 14.35 to align the multimodal images. Then, a deep learning-based infrared–visible image fusion (IVIF) network is trained to preserve damage characteristics between the modalities. For damage detection, a relatively high mean average precision (mAP) result of 85.4% is achieved by comparing four instance segmentation models, affirming the effective utilization of IVIF results.https://www.mdpi.com/2075-5309/15/1/63building façadedamage detectionmultimodal image registrationmultimodal image fusioninstance segmentation
spellingShingle	Pujin Wang Jiehui Wang Qiong Liu Lin Fang Jie Xiao Fusion-Based Damage Segmentation for Multimodal Building Façade Images from an End-to-End Perspective Buildings building façade damage detection multimodal image registration multimodal image fusion instance segmentation
title	Fusion-Based Damage Segmentation for Multimodal Building Façade Images from an End-to-End Perspective
title_full	Fusion-Based Damage Segmentation for Multimodal Building Façade Images from an End-to-End Perspective
title_fullStr	Fusion-Based Damage Segmentation for Multimodal Building Façade Images from an End-to-End Perspective
title_full_unstemmed	Fusion-Based Damage Segmentation for Multimodal Building Façade Images from an End-to-End Perspective
title_short	Fusion-Based Damage Segmentation for Multimodal Building Façade Images from an End-to-End Perspective
title_sort	fusion based damage segmentation for multimodal building facade images from an end to end perspective
topic	building façade damage detection multimodal image registration multimodal image fusion instance segmentation
url	https://www.mdpi.com/2075-5309/15/1/63
work_keys_str_mv	AT pujinwang fusionbaseddamagesegmentationformultimodalbuildingfacadeimagesfromanendtoendperspective AT jiehuiwang fusionbaseddamagesegmentationformultimodalbuildingfacadeimagesfromanendtoendperspective AT qiongliu fusionbaseddamagesegmentationformultimodalbuildingfacadeimagesfromanendtoendperspective AT linfang fusionbaseddamagesegmentationformultimodalbuildingfacadeimagesfromanendtoendperspective AT jiexiao fusionbaseddamagesegmentationformultimodalbuildingfacadeimagesfromanendtoendperspective

Fusion-Based Damage Segmentation for Multimodal Building Façade Images from an End-to-End Perspective

Similar Items