Semantic segmentation of camouflage objects via fusing reconstructed multispectral and RGB images

Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions. However, camouflage object segmentation is increasingly challenging due to advances in both camouflage materials and biological mimicry. Although multi...

Full description

Saved in:
Bibliographic Details
Main Authors: Feng Huang, Gonghan Yang, Jing Chen, Yixuan Xu, Jingze Su, Guimin Huang, Shu Wang, Wenxi Liu
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2025-08-01
Series:Defence Technology
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2214914725001229
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions. However, camouflage object segmentation is increasingly challenging due to advances in both camouflage materials and biological mimicry. Although multispectral-RGB based technology shows promise, conventional dual-aperture multispectral-RGB imaging systems are constrained by imprecise and time-consuming registration and fusion across different modalities, limiting their performance. Here, we propose the Reconstructed Multispectral-RGB Fusion Network (RMRF-Net), which reconstructs RGB images into multispectral ones, enabling efficient multimodal segmentation using only an RGB camera. Specifically, RMRF-Net employs a divergent-similarity feature correction strategy to minimize reconstruction errors and includes an efficient boundary-aware decoder to enhance object contours. Notably, we establish the first real-world aerial multispectral-RGB semantic segmentation of camouflage objects dataset, including 11 object categories. Experimental results demonstrate that RMRF-Net outperforms existing methods, achieving 17.38 FPS on the NVIDIA Jetson AGX Orin, with only a 0.96% drop in mIoU compared to the RTX 3090, showing its practical applicability in multimodal remote sensing.
ISSN:2214-9147