CGDINet: A Deep Learning-Based Salient Object Detection Algorithm

Salient object detection (SOD) is a key preprocessing step in computer vision, widely used in object tracking, action recognition, and image retrieval, among other fields. However, traditional SOD algorithms often face issues such as rough object boundaries, incomplete extraction of global image fea...

Full description

Saved in:
Bibliographic Details
Main Authors: Chengyu Hu, Jianxin Guo, Hanfei Xie, Qing Zhu, Baoxi Yuan, Yujie Gao, Xiangyang Ma, Jialu Chen, Juan Tian
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10820319/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Salient object detection (SOD) is a key preprocessing step in computer vision, widely used in object tracking, action recognition, and image retrieval, among other fields. However, traditional SOD algorithms often face issues such as rough object boundaries, incomplete extraction of global image features, and insufficient attention to key areas. To address these problems, an improved significance object detection network&#x2014;CGDINet (Coordinate Attention-Group Consensus Aggregation Module-Depth Auxiliary Module-Inverse Saliency Pyramid Reconstruction Network)&#x2014;is proposed. CGDINet introduces the Group Consensus Aggregation Module (GCAM) embedded with the Coordinate Attention (CA) mechanism, named CAGM (Coordinate Attention-Group Consensus Aggregation Module), to enhance global feature capture capabilities and improve the processing of directional features. Additionally, the Depth Auxiliary Module (DAM) is incorporated to enhance the focus on important regions. Experiments were conducted on five public datasets (DUT-OMRON, ECSSD, PASCAL-S, DUTS-TE, and HKU-IS). The results show that CGDINet outperforms other mainstream significance object detection models in evaluation metrics such as <inline-formula> <tex-math notation="LaTeX">${\mathrm {maxF}}_{\mathrm {\beta }}$ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">$\mathrm {S}_{\mathrm {\alpha }}$ </tex-math></inline-formula>, and MAE, with almost no increase in computational cost (FLOPs) and parameters. The experimental results validate that CGDINet can effectively address the issues of incomplete global feature extraction and insufficient attention to key areas, thereby significantly enhancing the performance of significance object detection.
ISSN:2169-3536