D<sup>3</sup>-YOLOv10: Improved YOLOv10-Based Lightweight Tomato Detection Algorithm Under Facility Scenario

Accurate and efficient tomato detection is one of the key techniques for intelligent automatic picking in the area of precision agriculture. However, under the facility scenario, existing detection algorithms still have challenging problems such as weak feature extraction ability for occlusion condi...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ao Li, Chunrui Wang, Tongtong Ji, Qiyang Wang, Tianxue Zhang
Format:	Article
Language:	English
Published:	MDPI AG 2024-12-01
Series:	Agriculture
Subjects:	tomato detection YOLOv10 occlusion recognition attention mechanism knowledge distillation
Online Access:	https://www.mdpi.com/2077-0472/14/12/2268
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850050666949509120
author	Ao Li Chunrui Wang Tongtong Ji Qiyang Wang Tianxue Zhang
author_facet	Ao Li Chunrui Wang Tongtong Ji Qiyang Wang Tianxue Zhang
author_sort	Ao Li
collection	DOAJ
description	Accurate and efficient tomato detection is one of the key techniques for intelligent automatic picking in the area of precision agriculture. However, under the facility scenario, existing detection algorithms still have challenging problems such as weak feature extraction ability for occlusion conditions and different fruit sizes, low accuracy on edge location, and heavy model parameters. To address these problems, this paper proposed D<sup>3</sup>-YOLOv10, a lightweight YOLOv10-based detection framework. Initially, a compact dynamic faster network (DyFasterNet) was developed, where multiple adaptive convolution kernels are aggregated to extract local effective features for fruit size adaption. Additionally, the deformable large kernel attention mechanism (D-LKA) was designed for the terminal phase of the neck network by adaptively adjusting the receptive field to focus on irregular tomato deformations and occlusions. Then, to further improve detection boundary accuracy and convergence, a dynamic FM-WIoU regression loss with a scaling factor was proposed. Finally, a knowledge distillation scheme using semantic frequency prompts was developed to optimize the model for lightweight deployment in practical applications. We evaluated the proposed framework using a self-made tomato dataset and designed a two-stage category balancing method based on diffusion models to address the sample class-imbalanced issue. The experimental results demonstrated that the D<sup>3</sup>-YOLOv10 model achieved an <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>m</mi><mi>A</mi><msub><mi>P</mi><mrow><mn>0.5</mn></mrow></msub></mrow></semantics></math></inline-formula> of 91.8%, with a substantial reduction of 54.0% in parameters and 64.9% in FLOPs, compared to the benchmark model. Meanwhile, the detection speed of 80.1 FPS more effectively meets the demand for real-time tomato detection. This study can effectively contribute to the advancement of smart agriculture research on the detection of fruit targets.
format	Article
id	doaj-art-e0ae85131a1f401fb69b2bb3ad230a8d
institution	DOAJ
issn	2077-0472
language	English
publishDate	2024-12-01
publisher	MDPI AG
record_format	Article
series	Agriculture
spelling	doaj-art-e0ae85131a1f401fb69b2bb3ad230a8d2025-08-20T02:53:22ZengMDPI AGAgriculture2077-04722024-12-011412226810.3390/agriculture14122268D<sup>3</sup>-YOLOv10: Improved YOLOv10-Based Lightweight Tomato Detection Algorithm Under Facility ScenarioAo Li0Chunrui Wang1Tongtong Ji2Qiyang Wang3Tianxue Zhang4School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, ChinaSchool of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, ChinaSchool of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, ChinaSchool of Agricultural Engineering, Jiangsu University, Zhenjiang 212013, ChinaSchool of Mechanical Engineering and Automation, Beihang University, Beijing 100191, ChinaAccurate and efficient tomato detection is one of the key techniques for intelligent automatic picking in the area of precision agriculture. However, under the facility scenario, existing detection algorithms still have challenging problems such as weak feature extraction ability for occlusion conditions and different fruit sizes, low accuracy on edge location, and heavy model parameters. To address these problems, this paper proposed D<sup>3</sup>-YOLOv10, a lightweight YOLOv10-based detection framework. Initially, a compact dynamic faster network (DyFasterNet) was developed, where multiple adaptive convolution kernels are aggregated to extract local effective features for fruit size adaption. Additionally, the deformable large kernel attention mechanism (D-LKA) was designed for the terminal phase of the neck network by adaptively adjusting the receptive field to focus on irregular tomato deformations and occlusions. Then, to further improve detection boundary accuracy and convergence, a dynamic FM-WIoU regression loss with a scaling factor was proposed. Finally, a knowledge distillation scheme using semantic frequency prompts was developed to optimize the model for lightweight deployment in practical applications. We evaluated the proposed framework using a self-made tomato dataset and designed a two-stage category balancing method based on diffusion models to address the sample class-imbalanced issue. The experimental results demonstrated that the D<sup>3</sup>-YOLOv10 model achieved an <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>m</mi><mi>A</mi><msub><mi>P</mi><mrow><mn>0.5</mn></mrow></msub></mrow></semantics></math></inline-formula> of 91.8%, with a substantial reduction of 54.0% in parameters and 64.9% in FLOPs, compared to the benchmark model. Meanwhile, the detection speed of 80.1 FPS more effectively meets the demand for real-time tomato detection. This study can effectively contribute to the advancement of smart agriculture research on the detection of fruit targets.https://www.mdpi.com/2077-0472/14/12/2268tomato detectionYOLOv10occlusion recognitionattention mechanismknowledge distillation
spellingShingle	Ao Li Chunrui Wang Tongtong Ji Qiyang Wang Tianxue Zhang D<sup>3</sup>-YOLOv10: Improved YOLOv10-Based Lightweight Tomato Detection Algorithm Under Facility Scenario Agriculture tomato detection YOLOv10 occlusion recognition attention mechanism knowledge distillation
title	D<sup>3</sup>-YOLOv10: Improved YOLOv10-Based Lightweight Tomato Detection Algorithm Under Facility Scenario
title_full	D<sup>3</sup>-YOLOv10: Improved YOLOv10-Based Lightweight Tomato Detection Algorithm Under Facility Scenario
title_fullStr	D<sup>3</sup>-YOLOv10: Improved YOLOv10-Based Lightweight Tomato Detection Algorithm Under Facility Scenario
title_full_unstemmed	D<sup>3</sup>-YOLOv10: Improved YOLOv10-Based Lightweight Tomato Detection Algorithm Under Facility Scenario
title_short	D<sup>3</sup>-YOLOv10: Improved YOLOv10-Based Lightweight Tomato Detection Algorithm Under Facility Scenario
title_sort	d sup 3 sup yolov10 improved yolov10 based lightweight tomato detection algorithm under facility scenario
topic	tomato detection YOLOv10 occlusion recognition attention mechanism knowledge distillation
url	https://www.mdpi.com/2077-0472/14/12/2268
work_keys_str_mv	AT aoli dsup3supyolov10improvedyolov10basedlightweighttomatodetectionalgorithmunderfacilityscenario AT chunruiwang dsup3supyolov10improvedyolov10basedlightweighttomatodetectionalgorithmunderfacilityscenario AT tongtongji dsup3supyolov10improvedyolov10basedlightweighttomatodetectionalgorithmunderfacilityscenario AT qiyangwang dsup3supyolov10improvedyolov10basedlightweighttomatodetectionalgorithmunderfacilityscenario AT tianxuezhang dsup3supyolov10improvedyolov10basedlightweighttomatodetectionalgorithmunderfacilityscenario

D<sup>3</sup>-YOLOv10: Improved YOLOv10-Based Lightweight Tomato Detection Algorithm Under Facility Scenario

Similar Items