River floating object detection with transformer model in real time

Abstract The DEtection TRansformer (DETR) and the YOLO series have been at the forefront of advancements in object detection. The RT-DETR, a member of the DETR family, has notably addressed the speed limitations of its predecessors by utilizing a high-performance hybrid encoder that optimizes query...

Full description

Saved in:

Bibliographic Details
Main Authors:	Chong Zhang, Jie Yue, Jianglong Fu, Shouluan Wu
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-03-01
Series:	Scientific Reports
Online Access:	https://doi.org/10.1038/s41598-025-93659-1
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849774969667452928
author	Chong Zhang Jie Yue Jianglong Fu Shouluan Wu
author_facet	Chong Zhang Jie Yue Jianglong Fu Shouluan Wu
author_sort	Chong Zhang
collection	DOAJ
description	Abstract The DEtection TRansformer (DETR) and the YOLO series have been at the forefront of advancements in object detection. The RT-DETR, a member of the DETR family, has notably addressed the speed limitations of its predecessors by utilizing a high-performance hybrid encoder that optimizes query selection. Building upon this foundation, we introduce the LR-DETR, a lightweight evolution of RT-DETR for river floating object detection. This model incorporates the High-level Screening-feature Path Aggregation Network (HS-PAN), which refines feature fusion through a novel bottom-up fusion path, significantly enhancing its expressive power. Further innovation is evident in the introduction of the Residual Partial Convolutional Network (RPCN) as the backbone, which selectively applies convolutions to key channels, leveraging the concept of residuals to reduce computational redundancy and enhance accuracy. The enhancement of the RepBlock with Conv3XCBlock, along with the integration of a parameter-free attention mechanism within the convolutional layers, underscores our commitment to efficiency, ensuring that the model prioritizes valuable information while suppressing redundancy. A comparative analysis with existing detection models not only validates the effectiveness of our approach but also highlights its superiority and adaptability. Our experimental findings are compelling: LR-DETR achieves a 5% increase in mean Average Precision (mAP) at an Intersection over Union (IoU) threshold of 0.5, a 25.8% reduction in parameter count, and a 22.8% decrease in GFLOPs, compared to the RT-DETR algorithm. These improvements are particularly pronounced in the real-time detection of river floating objects, showcasing LR-DETR’s potential in specific environmental monitoring scenarios. The project page: https://github.com/zcfanhua/LR-DETR .
format	Article
id	doaj-art-e5363c2b24e24320b3848da3eb016cbb
institution	DOAJ
issn	2045-2322
language	English
publishDate	2025-03-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj-art-e5363c2b24e24320b3848da3eb016cbb2025-08-20T03:01:34ZengNature PortfolioScientific Reports2045-23222025-03-0115111610.1038/s41598-025-93659-1River floating object detection with transformer model in real timeChong Zhang0Jie Yue1Jianglong Fu2Shouluan Wu3HeBei University of ArchitectureHeBei University of ArchitectureHeBei University of ArchitectureJiangxi University of Science and TechnologyAbstract The DEtection TRansformer (DETR) and the YOLO series have been at the forefront of advancements in object detection. The RT-DETR, a member of the DETR family, has notably addressed the speed limitations of its predecessors by utilizing a high-performance hybrid encoder that optimizes query selection. Building upon this foundation, we introduce the LR-DETR, a lightweight evolution of RT-DETR for river floating object detection. This model incorporates the High-level Screening-feature Path Aggregation Network (HS-PAN), which refines feature fusion through a novel bottom-up fusion path, significantly enhancing its expressive power. Further innovation is evident in the introduction of the Residual Partial Convolutional Network (RPCN) as the backbone, which selectively applies convolutions to key channels, leveraging the concept of residuals to reduce computational redundancy and enhance accuracy. The enhancement of the RepBlock with Conv3XCBlock, along with the integration of a parameter-free attention mechanism within the convolutional layers, underscores our commitment to efficiency, ensuring that the model prioritizes valuable information while suppressing redundancy. A comparative analysis with existing detection models not only validates the effectiveness of our approach but also highlights its superiority and adaptability. Our experimental findings are compelling: LR-DETR achieves a 5% increase in mean Average Precision (mAP) at an Intersection over Union (IoU) threshold of 0.5, a 25.8% reduction in parameter count, and a 22.8% decrease in GFLOPs, compared to the RT-DETR algorithm. These improvements are particularly pronounced in the real-time detection of river floating objects, showcasing LR-DETR’s potential in specific environmental monitoring scenarios. The project page: https://github.com/zcfanhua/LR-DETR .https://doi.org/10.1038/s41598-025-93659-1
spellingShingle	Chong Zhang Jie Yue Jianglong Fu Shouluan Wu River floating object detection with transformer model in real time Scientific Reports
title	River floating object detection with transformer model in real time
title_full	River floating object detection with transformer model in real time
title_fullStr	River floating object detection with transformer model in real time
title_full_unstemmed	River floating object detection with transformer model in real time
title_short	River floating object detection with transformer model in real time
title_sort	river floating object detection with transformer model in real time
url	https://doi.org/10.1038/s41598-025-93659-1
work_keys_str_mv	AT chongzhang riverfloatingobjectdetectionwithtransformermodelinrealtime AT jieyue riverfloatingobjectdetectionwithtransformermodelinrealtime AT jianglongfu riverfloatingobjectdetectionwithtransformermodelinrealtime AT shouluanwu riverfloatingobjectdetectionwithtransformermodelinrealtime

River floating object detection with transformer model in real time

Similar Items