A lightweight and optimized deep learning model for detecting banana bunches and stalks in autonomous harvesting vehicles
Developing algorithms to identify fruit cutting locations is important for the functionality of harvesting robots. However, existing studies often rely on multi-stage detection processes. This complicates system design and hinders real-time performance. To address these challenges, this study propos...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2025-08-01
|
| Series: | Smart Agricultural Technology |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2772375525002849 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Developing algorithms to identify fruit cutting locations is important for the functionality of harvesting robots. However, existing studies often rely on multi-stage detection processes. This complicates system design and hinders real-time performance. To address these challenges, this study proposes a novel detection model for banana-harvesting robots. The model simultaneously detects banana bunches and stalks in orchard environments. It is built upon the YOLOv8n (You Only Look Once version 8 nano) baseline and includes enhancements to improve accuracy while preserving a lightweight architecture. Specifically, the standard convolution layers are upgraded with a lightweight group-shuffle convolution module, reducing complexity while preserving efficiency. Additionally, a novel C2f-fast efficient channel attention module is proposed in the backbone, significantly enhancing the model's feature extraction capabilities. Furthermore, the bidirectional feature pyramid network is introduced in the original neck network, improving feature aggregation and adaptability to varying environmental conditions. Experimental results demonstrate that the proposed model achieves performance, with precision, recall, and mAP50 metrics of 96.3 %, 90 %, and 94.5 %, respectively, exceeding the baseline model by 0.5 %, 2.6 %, and 1 %. Moreover, the parameters and size of the proposed model are optimized to 1.7 M and 3.7 MB, reflecting reductions of 43.3 % and 40.3 %, respectively, in comparison to the baseline. Notably, the proposed model outperforms the previous detection models, offering high accuracy while optimizing computational efficiency. These advancements make the proposed model highly suitable for deployment on embedded systems in agricultural robots. |
|---|---|
| ISSN: | 2772-3755 |