FGS-YOLOv8s-seg: A Lightweight and Efficient Instance Segmentation Model for Detecting Tomato Maturity Levels in Greenhouse Environments

In a greenhouse environment, the application of artificial intelligence technology for selective tomato harvesting still faces numerous challenges, including varying lighting, background interference, and indistinct fruit surface features. This study proposes an improved instance segmentation model...

Full description

Saved in:
Bibliographic Details
Main Authors: Dongfang Song, Ping Liu, Yanjun Zhu, Tianyuan Li, Kun Zhang
Format: Article
Language:English
Published: MDPI AG 2025-07-01
Series:Agronomy
Subjects:
Online Access:https://www.mdpi.com/2073-4395/15/7/1687
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In a greenhouse environment, the application of artificial intelligence technology for selective tomato harvesting still faces numerous challenges, including varying lighting, background interference, and indistinct fruit surface features. This study proposes an improved instance segmentation model called FGS-YOLOv8s-seg, which achieves accurate detection and maturity grading of tomatoes in greenhouse environments. The model incorporates a novel SegNext_Attention mechanism at the end of the backbone, while simultaneously replacing Bottleneck structures in the neck layer with FasterNet blocks and integrating Gaussian Context Transformer modules to form a lightweight C2f_FasterNet_GCT structure. Experiments show that this model performs significantly better than mainstream segmentation models in core indicators such as precision (86.9%), recall (76.3%), average precision (mAP<sub>@0.5</sub> 84.8%), F1-score (81.3%), and GFLOPs (35.6 M). Compared with the YOLOv8s-seg baseline model, these metrics show improvements of 2.6%, 3.8%, 5.1%, 3.3%, and 6.8 M, respectively. Ablation experiments demonstrate that the improved architecture contributes significantly to performance gains, with combined improvements yielding optimal results. The analysis of detection performance videos under different cultivation patterns demonstrates the generalizability of the improved model in complex environments, achieving an optimal balance between detection accuracy (86.9%) and inference speed (53.2 fps). This study provides a reliable technical solution for the selective harvesting of greenhouse tomatoes.
ISSN:2073-4395