AYOLO: Development of a Real-Time Object Detection Model for the Detection of Secretly Cultivated Plants
AYOLO introduces a novel fusion architecture that integrates unsupervised learning techniques with Vision Transformers, leveraging the YOLO series models as its foundation. This innovation enables the effective utilization of rich, unlabeled data, establishing a new pretraining methodology tailored...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-03-01
|
| Series: | Applied Sciences |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2076-3417/15/5/2718 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | AYOLO introduces a novel fusion architecture that integrates unsupervised learning techniques with Vision Transformers, leveraging the YOLO series models as its foundation. This innovation enables the effective utilization of rich, unlabeled data, establishing a new pretraining methodology tailored to YOLO architectures. On a custom dataset comprising 80 images of poppy plants, AYOLO achieved a remarkable Average Precision (AP) of 38.7% while maintaining a high rendering speed of 239 FPS (Frames Per Second) on a Tesla K80 GPU. Real-time performance is demonstrated by achieving 239 FPS, and feature fusion optimally combines spatial and semantic information across scales. This performance surpasses the previous state-of-the-art YOLO v6-3.0 by +2.2% AP while retaining comparable speed. AYOLO exemplifies the potential of integrating advanced information fusion techniques with supervised pretraining, significantly enhancing precision and efficiency for object detection models optimized for small, specialized datasets. |
|---|---|
| ISSN: | 2076-3417 |