Prediction of soybean yellow mottle mosaic virus in soybean using hyperspectral imaging

Abstract Disease incidence is a key factor contributing to reduced crop yield. Thus, early identification of crop diseases is crucial for minimizing the effects of disease incidence and maximizing crop yield. Therefore, this study aims to identify soybean yellow mottle mosaic virus (SYMMV) using the...

Full description

Saved in:
Bibliographic Details
Main Authors: Amit Ghimire, Hong Seok Lee, Youngnam Yoon, Yoonha Kim
Format: Article
Language:English
Published: BMC 2025-08-01
Series:Plant Methods
Subjects:
Online Access:https://doi.org/10.1186/s13007-025-01428-y
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Disease incidence is a key factor contributing to reduced crop yield. Thus, early identification of crop diseases is crucial for minimizing the effects of disease incidence and maximizing crop yield. Therefore, this study aims to identify soybean yellow mottle mosaic virus (SYMMV) using the hyperspectral imaging (HSI) method combined with the machine learning (ML) technique. The soybeans were cultivated under two different environmental conditions, namely, EN I and EN II. In EN I, soybean plants were infected with SYMMV at the third vegetative growth stage, whereas in EN II, infected seeds were used. A reverse transcription polymerase chain reaction was conducted to distinguish the infected from noninfected plants. Mean spectrum values obtained from regions of interest in the Environmental Visualizing Images software served as data, while their respective wavelengths were used as features for ML models. The information gain method was used for the selection of characteristic wavelengths associated with disease identification. Continuous wavelengths ranging from 653 nm to 682 nm showed more information gain in both environments, indicating their significant role in SYMMV classification. Two classification models, random forest and k-nearest neighbor, classified the infected and noninfected plants at an early stage with over 90% accuracy. The support vector machine classified the disease with an average accuracy of > 95% across both environments, showing the best performance among the selected models. The logistic regression model showed lower accuracy, exceeding 82% in EN I, but improved to > 90% in EN II. These findings suggest that HSI combined with ML is the best alternative to the traditional method of disease identification in plants.
ISSN:1746-4811