MDA-DETR: Enhancing Offending Animal Detection with Multi-Channel Attention and Multi-Scale Feature Aggregation

Conflicts between humans and animals in agricultural and settlement areas have recently increased, resulting in significant resource loss and risks to human and animal lives. This growing issue presents a global challenge. This paper addresses the detection and identification of offending animals, p...

Full description

Saved in:
Bibliographic Details
Main Authors: Haiyan Zhang, Huiqi Li, Guodong Sun, Feng Yang
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Animals
Subjects:
Online Access:https://www.mdpi.com/2076-2615/15/2/259
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589345840693248
author Haiyan Zhang
Huiqi Li
Guodong Sun
Feng Yang
author_facet Haiyan Zhang
Huiqi Li
Guodong Sun
Feng Yang
author_sort Haiyan Zhang
collection DOAJ
description Conflicts between humans and animals in agricultural and settlement areas have recently increased, resulting in significant resource loss and risks to human and animal lives. This growing issue presents a global challenge. This paper addresses the detection and identification of offending animals, particularly in obscured or blurry nighttime images. This article introduces Multi-Channel Coordinated Attention and Multi-Dimension Feature Aggregation (MDA-DETR). It integrates multi-scale features for enhanced detection accuracy, employing a Multi-Channel Coordinated Attention (MCCA) mechanism to incorporate location, semantic, and long-range dependency information and a Multi-Dimension Feature Aggregation Module (DFAM) for cross-scale feature aggregation. Additionally, the VariFocal Loss function is utilized to assign pixel weights, enhancing detail focus and maintaining accuracy. In the dataset section, this article uses a dataset from the Northeast China Tiger and Leopard National Park, which includes images of six common offending animal species. In the comprehensive experiments on the dataset, the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>m</mi><mi>A</mi><msub><mi>P</mi><mn>50</mn></msub></mrow></semantics></math></inline-formula> index of MDA-DETR was 1.3%, 0.6%, 0.3%, 3%, 1.1%, and 0.5% higher than RT-DETR-r18, yolov8n, yolov9-C, DETR, Deformable-detr, and DCA-yolov8, respectively, indicating that MDA-DETR is superior to other advanced methods.
format Article
id doaj-art-9cfe5310202b47128f90ee4e939d645f
institution Kabale University
issn 2076-2615
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Animals
spelling doaj-art-9cfe5310202b47128f90ee4e939d645f2025-01-24T13:18:19ZengMDPI AGAnimals2076-26152025-01-0115225910.3390/ani15020259MDA-DETR: Enhancing Offending Animal Detection with Multi-Channel Attention and Multi-Scale Feature AggregationHaiyan Zhang0Huiqi Li1Guodong Sun2Feng Yang3School of Information Science and Technology, Beijing Forestry University, Beijing 100083, ChinaSchool of Information Science and Technology, Beijing Forestry University, Beijing 100083, ChinaSchool of Information Science and Technology, Beijing Forestry University, Beijing 100083, ChinaSchool of Information Science and Technology, Beijing Forestry University, Beijing 100083, ChinaConflicts between humans and animals in agricultural and settlement areas have recently increased, resulting in significant resource loss and risks to human and animal lives. This growing issue presents a global challenge. This paper addresses the detection and identification of offending animals, particularly in obscured or blurry nighttime images. This article introduces Multi-Channel Coordinated Attention and Multi-Dimension Feature Aggregation (MDA-DETR). It integrates multi-scale features for enhanced detection accuracy, employing a Multi-Channel Coordinated Attention (MCCA) mechanism to incorporate location, semantic, and long-range dependency information and a Multi-Dimension Feature Aggregation Module (DFAM) for cross-scale feature aggregation. Additionally, the VariFocal Loss function is utilized to assign pixel weights, enhancing detail focus and maintaining accuracy. In the dataset section, this article uses a dataset from the Northeast China Tiger and Leopard National Park, which includes images of six common offending animal species. In the comprehensive experiments on the dataset, the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>m</mi><mi>A</mi><msub><mi>P</mi><mn>50</mn></msub></mrow></semantics></math></inline-formula> index of MDA-DETR was 1.3%, 0.6%, 0.3%, 3%, 1.1%, and 0.5% higher than RT-DETR-r18, yolov8n, yolov9-C, DETR, Deformable-detr, and DCA-yolov8, respectively, indicating that MDA-DETR is superior to other advanced methods.https://www.mdpi.com/2076-2615/15/2/259object detectionRT-DETRtransformercomputer visionattention mechanism
spellingShingle Haiyan Zhang
Huiqi Li
Guodong Sun
Feng Yang
MDA-DETR: Enhancing Offending Animal Detection with Multi-Channel Attention and Multi-Scale Feature Aggregation
Animals
object detection
RT-DETR
transformer
computer vision
attention mechanism
title MDA-DETR: Enhancing Offending Animal Detection with Multi-Channel Attention and Multi-Scale Feature Aggregation
title_full MDA-DETR: Enhancing Offending Animal Detection with Multi-Channel Attention and Multi-Scale Feature Aggregation
title_fullStr MDA-DETR: Enhancing Offending Animal Detection with Multi-Channel Attention and Multi-Scale Feature Aggregation
title_full_unstemmed MDA-DETR: Enhancing Offending Animal Detection with Multi-Channel Attention and Multi-Scale Feature Aggregation
title_short MDA-DETR: Enhancing Offending Animal Detection with Multi-Channel Attention and Multi-Scale Feature Aggregation
title_sort mda detr enhancing offending animal detection with multi channel attention and multi scale feature aggregation
topic object detection
RT-DETR
transformer
computer vision
attention mechanism
url https://www.mdpi.com/2076-2615/15/2/259
work_keys_str_mv AT haiyanzhang mdadetrenhancingoffendinganimaldetectionwithmultichannelattentionandmultiscalefeatureaggregation
AT huiqili mdadetrenhancingoffendinganimaldetectionwithmultichannelattentionandmultiscalefeatureaggregation
AT guodongsun mdadetrenhancingoffendinganimaldetectionwithmultichannelattentionandmultiscalefeatureaggregation
AT fengyang mdadetrenhancingoffendinganimaldetectionwithmultichannelattentionandmultiscalefeatureaggregation