Swin Attention Augmented Residual Network: a fine-grained pest image recognition method

Pest infestation is a major cause of crop losses and a significant factor contributing to agricultural economic damage. Accurate identification of pests is therefore critical to ensuring crop safety. However, existing pest recognition methods often struggle to distinguish fine-grained visual differe...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiang Wang, Zhiyong Xiao, Zhaohong Deng
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-06-01
Series:Frontiers in Plant Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpls.2025.1619551/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850117860933763072
author Xiang Wang
Zhiyong Xiao
Zhaohong Deng
author_facet Xiang Wang
Zhiyong Xiao
Zhaohong Deng
author_sort Xiang Wang
collection DOAJ
description Pest infestation is a major cause of crop losses and a significant factor contributing to agricultural economic damage. Accurate identification of pests is therefore critical to ensuring crop safety. However, existing pest recognition methods often struggle to distinguish fine-grained visual differences between pest species and are susceptible to background interference from crops and environments. To address these challenges, we propose an improved pest identification method based on the Swin Transformer architecture, named Swin-AARNet (Attention Augmented Residual Network). Our method achieves efficient and accurate pest recognition. On the one hand, Swin-AARNet enhances local key features and establishes a feature complementation mechanism, thereby improving the extraction capability of local features. On the other hand, it integrates multi-scale information to effectively alleviate the problem of fine-grained feature ambiguity or loss. Furthermore, Swin-AARNet attained a classification accuracy of 78.77% on IP102, the largest publicly available pest dataset to date. To further validate its effectiveness and generalization ability, we conducted additional training and evaluation on the citrus benchmark dataset CPB and Li, achieving impressive accuracies of 82.17% and 99.48%, respectively. SwinAARNet demonstrates strong capability in distinguishing pests with highly similar appearances while remaining robust against complex and variable backgrounds. This makes it a promising tool for enhancing agricultural safety management, including crop environment monitoring and early invasion warning. Compared with other state-of-the-art models, our proposed method exhibits superior performance in pest image classification tasks, highlighting its potential for real-world agricultural applications.
format Article
id doaj-art-bee629f54a2d41d5aa12ed00fd8bd83b
institution OA Journals
issn 1664-462X
language English
publishDate 2025-06-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Plant Science
spelling doaj-art-bee629f54a2d41d5aa12ed00fd8bd83b2025-08-20T02:36:01ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2025-06-011610.3389/fpls.2025.16195511619551Swin Attention Augmented Residual Network: a fine-grained pest image recognition methodXiang WangZhiyong XiaoZhaohong DengPest infestation is a major cause of crop losses and a significant factor contributing to agricultural economic damage. Accurate identification of pests is therefore critical to ensuring crop safety. However, existing pest recognition methods often struggle to distinguish fine-grained visual differences between pest species and are susceptible to background interference from crops and environments. To address these challenges, we propose an improved pest identification method based on the Swin Transformer architecture, named Swin-AARNet (Attention Augmented Residual Network). Our method achieves efficient and accurate pest recognition. On the one hand, Swin-AARNet enhances local key features and establishes a feature complementation mechanism, thereby improving the extraction capability of local features. On the other hand, it integrates multi-scale information to effectively alleviate the problem of fine-grained feature ambiguity or loss. Furthermore, Swin-AARNet attained a classification accuracy of 78.77% on IP102, the largest publicly available pest dataset to date. To further validate its effectiveness and generalization ability, we conducted additional training and evaluation on the citrus benchmark dataset CPB and Li, achieving impressive accuracies of 82.17% and 99.48%, respectively. SwinAARNet demonstrates strong capability in distinguishing pests with highly similar appearances while remaining robust against complex and variable backgrounds. This makes it a promising tool for enhancing agricultural safety management, including crop environment monitoring and early invasion warning. Compared with other state-of-the-art models, our proposed method exhibits superior performance in pest image classification tasks, highlighting its potential for real-world agricultural applications.https://www.frontiersin.org/articles/10.3389/fpls.2025.1619551/fullartificial intelligencedeep learningfine-grained insect imageSwin Transformerimage classification
spellingShingle Xiang Wang
Zhiyong Xiao
Zhaohong Deng
Swin Attention Augmented Residual Network: a fine-grained pest image recognition method
Frontiers in Plant Science
artificial intelligence
deep learning
fine-grained insect image
Swin Transformer
image classification
title Swin Attention Augmented Residual Network: a fine-grained pest image recognition method
title_full Swin Attention Augmented Residual Network: a fine-grained pest image recognition method
title_fullStr Swin Attention Augmented Residual Network: a fine-grained pest image recognition method
title_full_unstemmed Swin Attention Augmented Residual Network: a fine-grained pest image recognition method
title_short Swin Attention Augmented Residual Network: a fine-grained pest image recognition method
title_sort swin attention augmented residual network a fine grained pest image recognition method
topic artificial intelligence
deep learning
fine-grained insect image
Swin Transformer
image classification
url https://www.frontiersin.org/articles/10.3389/fpls.2025.1619551/full
work_keys_str_mv AT xiangwang swinattentionaugmentedresidualnetworkafinegrainedpestimagerecognitionmethod
AT zhiyongxiao swinattentionaugmentedresidualnetworkafinegrainedpestimagerecognitionmethod
AT zhaohongdeng swinattentionaugmentedresidualnetworkafinegrainedpestimagerecognitionmethod