A generalizable model for the facial recognition of sika deer with enhanced cross-domain performance

Animal recognition technology is recognized as a fundamental tool for ecological conservation, species monitoring, and scientific research. The population management of sika deer, a first-class protected species in China, heavily depends on precise individual identification. However, challenges such...

Full description

Saved in:
Bibliographic Details
Main Authors: Ye Mu, Jinghuan Hu, Zhipeng Li, Heyang Wang, He Gong, Yu Sun, Tianli Hu
Format: Article
Language:English
Published: Elsevier 2025-08-01
Series:Smart Agricultural Technology
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2772375525002047
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Animal recognition technology is recognized as a fundamental tool for ecological conservation, species monitoring, and scientific research. The population management of sika deer, a first-class protected species in China, heavily depends on precise individual identification. However, challenges such as complex backgrounds, occlusions, and age-related facial variations pose significant obstacles to robust facial recognition. To address these domain-specific limitations, an enhanced YOLOv11-based architecture, DeerFace-YOLO, is proposed. The model is validated through transfer learning across developmental stages using a curated dataset of 40 individuals with diverse postures in complex environments. To assess scalability, the framework is extended to pig facial recognition and cattle individual detection tasks, demonstrating cross-species generalization capabilities. A dual-channel architecture is implemented, integrating Vision Transformer (ViT) and Convolutional Neural Networks (CNNs) to synergize local feature extraction with global contextual modeling, thereby mitigating CNN’s limited receptive fields and ViT’s computational complexity. Background interference is suppressed through adaptive attention mechanisms. For multi-scale feature optimization, a Multi-Scale Feature Enhancement (MSFE) module is designed to dynamically prioritize task-relevant features. Additionally, a Reparameterizable Lightweight Detection Head (RLDH) is developed to maintain accuracy while reducing parameter redundancy by 33 %. Experimental results demonstrate state-of-the-art performance, achieving 98.7 % recognition accuracy and a threefold reduction in computational load compared to baseline models. The framework establishes a standardized dataset for non-invasive animal identification and provides a scalable solution for ecological conservation and precision livestock management.
ISSN:2772-3755