BiAttentionNet: a dual-branch automatic driving image segmentation network integrating spatial and channel attention mechanisms

Abstract Real-time semantic segmentation is one of the most researched areas in the field of computer vision, and research on dual-branch networks has gradually become a popular direction in network architecture research. In this paper, a dual-branch automatic driving image segmentation network inte...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ruijun Liu, Yijun Zhang, Jieying Chen, Zhigang Wu, Yaohui Zhu, Jun Liu, Min Chen
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-04-01
Series:	Scientific Reports
Online Access:	https://doi.org/10.1038/s41598-025-95470-4
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Abstract Real-time semantic segmentation is one of the most researched areas in the field of computer vision, and research on dual-branch networks has gradually become a popular direction in network architecture research. In this paper, a dual-branch automatic driving image segmentation network integrating spatial and channel attention mechanisms is proposed with named as “BiAttentionNet”. The network aims to balance network accuracy and real-time performance by processing high-level semantic information and low-level detail information separately. BiAttentionNet consists of three main parts: the detail branch, the semantic branch, and the proposed attention-guided fusion layer. The detail branch extracts local and surrounding context features using the designed PCSD convolution module to process wide-channel low-level feature information. The semantic branch utilizes an improved lightweight Unet network to extract semantic information from deep narrow channels. Finally, the proposed attention-guided fusion layer fuses the features of the dual branches using detail attention and channel attention mechanisms to achieve image segmentation tasks in road scenes. Comparative experiments with recent mainstream networks such as BiseNet v2, Fast-SCNN, ConvNeXt, SegNeXt, Segformer, CGNet, etc., on the Cityscapes dataset show that BiAttentionNet achieves a highest accuracy of 65.89% in the mIoU metric for the backbone network. This validates the effectiveness of the proposed BiAttentionNet.
ISSN:	2045-2322

BiAttentionNet: a dual-branch automatic driving image segmentation network integrating spatial and channel attention mechanisms

Similar Items