River floating object detection with transformer model in real time
Abstract The DEtection TRansformer (DETR) and the YOLO series have been at the forefront of advancements in object detection. The RT-DETR, a member of the DETR family, has notably addressed the speed limitations of its predecessors by utilizing a high-performance hybrid encoder that optimizes query...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-03-01
|
| Series: | Scientific Reports |
| Online Access: | https://doi.org/10.1038/s41598-025-93659-1 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849774969667452928 |
|---|---|
| author | Chong Zhang Jie Yue Jianglong Fu Shouluan Wu |
| author_facet | Chong Zhang Jie Yue Jianglong Fu Shouluan Wu |
| author_sort | Chong Zhang |
| collection | DOAJ |
| description | Abstract The DEtection TRansformer (DETR) and the YOLO series have been at the forefront of advancements in object detection. The RT-DETR, a member of the DETR family, has notably addressed the speed limitations of its predecessors by utilizing a high-performance hybrid encoder that optimizes query selection. Building upon this foundation, we introduce the LR-DETR, a lightweight evolution of RT-DETR for river floating object detection. This model incorporates the High-level Screening-feature Path Aggregation Network (HS-PAN), which refines feature fusion through a novel bottom-up fusion path, significantly enhancing its expressive power. Further innovation is evident in the introduction of the Residual Partial Convolutional Network (RPCN) as the backbone, which selectively applies convolutions to key channels, leveraging the concept of residuals to reduce computational redundancy and enhance accuracy. The enhancement of the RepBlock with Conv3XCBlock, along with the integration of a parameter-free attention mechanism within the convolutional layers, underscores our commitment to efficiency, ensuring that the model prioritizes valuable information while suppressing redundancy. A comparative analysis with existing detection models not only validates the effectiveness of our approach but also highlights its superiority and adaptability. Our experimental findings are compelling: LR-DETR achieves a 5% increase in mean Average Precision (mAP) at an Intersection over Union (IoU) threshold of 0.5, a 25.8% reduction in parameter count, and a 22.8% decrease in GFLOPs, compared to the RT-DETR algorithm. These improvements are particularly pronounced in the real-time detection of river floating objects, showcasing LR-DETR’s potential in specific environmental monitoring scenarios. The project page: https://github.com/zcfanhua/LR-DETR . |
| format | Article |
| id | doaj-art-e5363c2b24e24320b3848da3eb016cbb |
| institution | DOAJ |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-03-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-e5363c2b24e24320b3848da3eb016cbb2025-08-20T03:01:34ZengNature PortfolioScientific Reports2045-23222025-03-0115111610.1038/s41598-025-93659-1River floating object detection with transformer model in real timeChong Zhang0Jie Yue1Jianglong Fu2Shouluan Wu3HeBei University of ArchitectureHeBei University of ArchitectureHeBei University of ArchitectureJiangxi University of Science and TechnologyAbstract The DEtection TRansformer (DETR) and the YOLO series have been at the forefront of advancements in object detection. The RT-DETR, a member of the DETR family, has notably addressed the speed limitations of its predecessors by utilizing a high-performance hybrid encoder that optimizes query selection. Building upon this foundation, we introduce the LR-DETR, a lightweight evolution of RT-DETR for river floating object detection. This model incorporates the High-level Screening-feature Path Aggregation Network (HS-PAN), which refines feature fusion through a novel bottom-up fusion path, significantly enhancing its expressive power. Further innovation is evident in the introduction of the Residual Partial Convolutional Network (RPCN) as the backbone, which selectively applies convolutions to key channels, leveraging the concept of residuals to reduce computational redundancy and enhance accuracy. The enhancement of the RepBlock with Conv3XCBlock, along with the integration of a parameter-free attention mechanism within the convolutional layers, underscores our commitment to efficiency, ensuring that the model prioritizes valuable information while suppressing redundancy. A comparative analysis with existing detection models not only validates the effectiveness of our approach but also highlights its superiority and adaptability. Our experimental findings are compelling: LR-DETR achieves a 5% increase in mean Average Precision (mAP) at an Intersection over Union (IoU) threshold of 0.5, a 25.8% reduction in parameter count, and a 22.8% decrease in GFLOPs, compared to the RT-DETR algorithm. These improvements are particularly pronounced in the real-time detection of river floating objects, showcasing LR-DETR’s potential in specific environmental monitoring scenarios. The project page: https://github.com/zcfanhua/LR-DETR .https://doi.org/10.1038/s41598-025-93659-1 |
| spellingShingle | Chong Zhang Jie Yue Jianglong Fu Shouluan Wu River floating object detection with transformer model in real time Scientific Reports |
| title | River floating object detection with transformer model in real time |
| title_full | River floating object detection with transformer model in real time |
| title_fullStr | River floating object detection with transformer model in real time |
| title_full_unstemmed | River floating object detection with transformer model in real time |
| title_short | River floating object detection with transformer model in real time |
| title_sort | river floating object detection with transformer model in real time |
| url | https://doi.org/10.1038/s41598-025-93659-1 |
| work_keys_str_mv | AT chongzhang riverfloatingobjectdetectionwithtransformermodelinrealtime AT jieyue riverfloatingobjectdetectionwithtransformermodelinrealtime AT jianglongfu riverfloatingobjectdetectionwithtransformermodelinrealtime AT shouluanwu riverfloatingobjectdetectionwithtransformermodelinrealtime |