River floating object detection with transformer model in real time

Abstract The DEtection TRansformer (DETR) and the YOLO series have been at the forefront of advancements in object detection. The RT-DETR, a member of the DETR family, has notably addressed the speed limitations of its predecessors by utilizing a high-performance hybrid encoder that optimizes query...

Full description

Saved in:
Bibliographic Details
Main Authors: Chong Zhang, Jie Yue, Jianglong Fu, Shouluan Wu
Format: Article
Language:English
Published: Nature Portfolio 2025-03-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-93659-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849774969667452928
author Chong Zhang
Jie Yue
Jianglong Fu
Shouluan Wu
author_facet Chong Zhang
Jie Yue
Jianglong Fu
Shouluan Wu
author_sort Chong Zhang
collection DOAJ
description Abstract The DEtection TRansformer (DETR) and the YOLO series have been at the forefront of advancements in object detection. The RT-DETR, a member of the DETR family, has notably addressed the speed limitations of its predecessors by utilizing a high-performance hybrid encoder that optimizes query selection. Building upon this foundation, we introduce the LR-DETR, a lightweight evolution of RT-DETR for river floating object detection. This model incorporates the High-level Screening-feature Path Aggregation Network (HS-PAN), which refines feature fusion through a novel bottom-up fusion path, significantly enhancing its expressive power. Further innovation is evident in the introduction of the Residual Partial Convolutional Network (RPCN) as the backbone, which selectively applies convolutions to key channels, leveraging the concept of residuals to reduce computational redundancy and enhance accuracy. The enhancement of the RepBlock with Conv3XCBlock, along with the integration of a parameter-free attention mechanism within the convolutional layers, underscores our commitment to efficiency, ensuring that the model prioritizes valuable information while suppressing redundancy. A comparative analysis with existing detection models not only validates the effectiveness of our approach but also highlights its superiority and adaptability. Our experimental findings are compelling: LR-DETR achieves a 5% increase in mean Average Precision (mAP) at an Intersection over Union (IoU) threshold of 0.5, a 25.8% reduction in parameter count, and a 22.8% decrease in GFLOPs, compared to the RT-DETR algorithm. These improvements are particularly pronounced in the real-time detection of river floating objects, showcasing LR-DETR’s potential in specific environmental monitoring scenarios. The project page: https://github.com/zcfanhua/LR-DETR .
format Article
id doaj-art-e5363c2b24e24320b3848da3eb016cbb
institution DOAJ
issn 2045-2322
language English
publishDate 2025-03-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-e5363c2b24e24320b3848da3eb016cbb2025-08-20T03:01:34ZengNature PortfolioScientific Reports2045-23222025-03-0115111610.1038/s41598-025-93659-1River floating object detection with transformer model in real timeChong Zhang0Jie Yue1Jianglong Fu2Shouluan Wu3HeBei University of ArchitectureHeBei University of ArchitectureHeBei University of ArchitectureJiangxi University of Science and TechnologyAbstract The DEtection TRansformer (DETR) and the YOLO series have been at the forefront of advancements in object detection. The RT-DETR, a member of the DETR family, has notably addressed the speed limitations of its predecessors by utilizing a high-performance hybrid encoder that optimizes query selection. Building upon this foundation, we introduce the LR-DETR, a lightweight evolution of RT-DETR for river floating object detection. This model incorporates the High-level Screening-feature Path Aggregation Network (HS-PAN), which refines feature fusion through a novel bottom-up fusion path, significantly enhancing its expressive power. Further innovation is evident in the introduction of the Residual Partial Convolutional Network (RPCN) as the backbone, which selectively applies convolutions to key channels, leveraging the concept of residuals to reduce computational redundancy and enhance accuracy. The enhancement of the RepBlock with Conv3XCBlock, along with the integration of a parameter-free attention mechanism within the convolutional layers, underscores our commitment to efficiency, ensuring that the model prioritizes valuable information while suppressing redundancy. A comparative analysis with existing detection models not only validates the effectiveness of our approach but also highlights its superiority and adaptability. Our experimental findings are compelling: LR-DETR achieves a 5% increase in mean Average Precision (mAP) at an Intersection over Union (IoU) threshold of 0.5, a 25.8% reduction in parameter count, and a 22.8% decrease in GFLOPs, compared to the RT-DETR algorithm. These improvements are particularly pronounced in the real-time detection of river floating objects, showcasing LR-DETR’s potential in specific environmental monitoring scenarios. The project page: https://github.com/zcfanhua/LR-DETR .https://doi.org/10.1038/s41598-025-93659-1
spellingShingle Chong Zhang
Jie Yue
Jianglong Fu
Shouluan Wu
River floating object detection with transformer model in real time
Scientific Reports
title River floating object detection with transformer model in real time
title_full River floating object detection with transformer model in real time
title_fullStr River floating object detection with transformer model in real time
title_full_unstemmed River floating object detection with transformer model in real time
title_short River floating object detection with transformer model in real time
title_sort river floating object detection with transformer model in real time
url https://doi.org/10.1038/s41598-025-93659-1
work_keys_str_mv AT chongzhang riverfloatingobjectdetectionwithtransformermodelinrealtime
AT jieyue riverfloatingobjectdetectionwithtransformermodelinrealtime
AT jianglongfu riverfloatingobjectdetectionwithtransformermodelinrealtime
AT shouluanwu riverfloatingobjectdetectionwithtransformermodelinrealtime