An improved U-net and attention mechanism-based model for sugar beet and weed segmentation

IntroductionWeeds are a major factor affecting crop yield and quality. Accurate identification and localization of crops and weeds are essential for achieving automated weed management in precision agriculture, especially given the challenges in recognition accuracy and real-time processing in compl...

Full description

Saved in:
Bibliographic Details
Main Authors: Yadong Li, Ruinan Guo, Rujia Li, Rongbiao Ji, Mengyao Wu, Dinghao Chen, Cong Han, Ruilin Han, Yongxiu Liu, Yuwen Ruan, Jianping Yang
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Plant Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpls.2024.1449514/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841543795541278720
author Yadong Li
Ruinan Guo
Rujia Li
Rongbiao Ji
Mengyao Wu
Dinghao Chen
Cong Han
Ruilin Han
Yongxiu Liu
Yuwen Ruan
Jianping Yang
Jianping Yang
author_facet Yadong Li
Ruinan Guo
Rujia Li
Rongbiao Ji
Mengyao Wu
Dinghao Chen
Cong Han
Ruilin Han
Yongxiu Liu
Yuwen Ruan
Jianping Yang
Jianping Yang
author_sort Yadong Li
collection DOAJ
description IntroductionWeeds are a major factor affecting crop yield and quality. Accurate identification and localization of crops and weeds are essential for achieving automated weed management in precision agriculture, especially given the challenges in recognition accuracy and real-time processing in complex field environments. To address this issue, this paper proposes an efficient crop-weed segmentation model based on an improved UNet architecture and attention mechanisms to enhance both recognition accuracy and processing speed.MethodsThe model adopts the encoder-decoder structure of UNet, utilizing MaxViT (Multi-Axis Vision Transformer) as the encoder to capture both global and local features within images. Additionally, CBAM (Convolutional Block Attention Module) is incorporated into the decoder as a multi-scale feature fusion module, adaptively adjusting feature map weights to enable the model to focus more accurately on the edges and textures of crops and weeds.Results and discussionExperimental results show that the proposed model achieved 84.28% mIoU and 88.59% mPA on the sugar beet dataset, representing improvements of 3.08% and 3.15% over the baseline UNet model, respectively, and outperforming mainstream models such as FCN, PSPNet, SegFormer, DeepLabv3+, and HRNet. Moreover, the model’s inference time is only 0.0559 seconds, reducing computational overhead while maintaining high accuracy. Its performance on a sunflower dataset further verifies the model’s generalizability and robustness. This study, therefore, provides an efficient and accurate solution for crop-weed segmentation, laying a foundation for future research on automated crop and weed identification.
format Article
id doaj-art-dca311b612e94a39a32153b9986124a3
institution Kabale University
issn 1664-462X
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Plant Science
spelling doaj-art-dca311b612e94a39a32153b9986124a32025-01-13T06:11:05ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2025-01-011510.3389/fpls.2024.14495141449514An improved U-net and attention mechanism-based model for sugar beet and weed segmentationYadong Li0Ruinan Guo1Rujia Li2Rongbiao Ji3Mengyao Wu4Dinghao Chen5Cong Han6Ruilin Han7Yongxiu Liu8Yuwen Ruan9Jianping Yang10Jianping Yang11College of Big Data, Yunnan Agricultural University, Kunming, ChinaModern Educational Technology Center, Yunnan Agricultural University, Kunming, ChinaSchool of Information Technology and Artificial Intelligence, Zhejiang University of Finance and Economics, Hangzhou, ChinaCollege of Big Data, Yunnan Agricultural University, Kunming, ChinaCollege of Big Data, Yunnan Agricultural University, Kunming, ChinaCollege of Big Data, Baoshan University, Baoshan, ChinaCollege of Economics and Management, Yunnan Agricultural University, Kunming, ChinaCollege of Economics and Management, Yunnan Agricultural University, Kunming, ChinaCollege of plant protection, Yunnan Agricultural University, Kunming, ChinaFaculty of Sciences, University of Copenhagen, Copenhagen, DenmarkCollege of Big Data, Yunnan Agricultural University, Kunming, ChinaThe Key Laboratory for Crop Production and Smart Agriculture of Yunnan Province, Yunnan Agricultural University, Kunming, ChinaIntroductionWeeds are a major factor affecting crop yield and quality. Accurate identification and localization of crops and weeds are essential for achieving automated weed management in precision agriculture, especially given the challenges in recognition accuracy and real-time processing in complex field environments. To address this issue, this paper proposes an efficient crop-weed segmentation model based on an improved UNet architecture and attention mechanisms to enhance both recognition accuracy and processing speed.MethodsThe model adopts the encoder-decoder structure of UNet, utilizing MaxViT (Multi-Axis Vision Transformer) as the encoder to capture both global and local features within images. Additionally, CBAM (Convolutional Block Attention Module) is incorporated into the decoder as a multi-scale feature fusion module, adaptively adjusting feature map weights to enable the model to focus more accurately on the edges and textures of crops and weeds.Results and discussionExperimental results show that the proposed model achieved 84.28% mIoU and 88.59% mPA on the sugar beet dataset, representing improvements of 3.08% and 3.15% over the baseline UNet model, respectively, and outperforming mainstream models such as FCN, PSPNet, SegFormer, DeepLabv3+, and HRNet. Moreover, the model’s inference time is only 0.0559 seconds, reducing computational overhead while maintaining high accuracy. Its performance on a sunflower dataset further verifies the model’s generalizability and robustness. This study, therefore, provides an efficient and accurate solution for crop-weed segmentation, laying a foundation for future research on automated crop and weed identification.https://www.frontiersin.org/articles/10.3389/fpls.2024.1449514/fullsemantic segmentationUNETdeep learningMaxViTCBAMattention mechanism
spellingShingle Yadong Li
Ruinan Guo
Rujia Li
Rongbiao Ji
Mengyao Wu
Dinghao Chen
Cong Han
Ruilin Han
Yongxiu Liu
Yuwen Ruan
Jianping Yang
Jianping Yang
An improved U-net and attention mechanism-based model for sugar beet and weed segmentation
Frontiers in Plant Science
semantic segmentation
UNET
deep learning
MaxViT
CBAM
attention mechanism
title An improved U-net and attention mechanism-based model for sugar beet and weed segmentation
title_full An improved U-net and attention mechanism-based model for sugar beet and weed segmentation
title_fullStr An improved U-net and attention mechanism-based model for sugar beet and weed segmentation
title_full_unstemmed An improved U-net and attention mechanism-based model for sugar beet and weed segmentation
title_short An improved U-net and attention mechanism-based model for sugar beet and weed segmentation
title_sort improved u net and attention mechanism based model for sugar beet and weed segmentation
topic semantic segmentation
UNET
deep learning
MaxViT
CBAM
attention mechanism
url https://www.frontiersin.org/articles/10.3389/fpls.2024.1449514/full
work_keys_str_mv AT yadongli animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT ruinanguo animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT rujiali animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT rongbiaoji animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT mengyaowu animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT dinghaochen animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT conghan animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT ruilinhan animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT yongxiuliu animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT yuwenruan animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT jianpingyang animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT jianpingyang animprovedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT yadongli improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT ruinanguo improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT rujiali improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT rongbiaoji improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT mengyaowu improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT dinghaochen improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT conghan improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT ruilinhan improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT yongxiuliu improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT yuwenruan improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT jianpingyang improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation
AT jianpingyang improvedunetandattentionmechanismbasedmodelforsugarbeetandweedsegmentation