Multi-axis compression fusion network for vehicle re-identification

Abstract Vehicle re-identification (Re-ID) has become a challenging retrieval task due to the high inter-class similarity and low intra-class similarity among vehicles. To address this challenge, the self-attention mechanism has been extensively studied and applied, demonstrating its effectiveness i...

Full description

Saved in:
Bibliographic Details
Main Authors: Tengda Ma, Ke Sun, Xiyu Pang, Wei Si, Tongxin Liu, Cheng Wang
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-15854-4
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Vehicle re-identification (Re-ID) has become a challenging retrieval task due to the high inter-class similarity and low intra-class similarity among vehicles. To address this challenge, the self-attention mechanism has been extensively studied and applied, demonstrating its effectiveness in capturing long-range dependencies in vehicle Re-ID. Traditional spatial self-attention and channel self-attention assign different weights to each node (position/channel) based on pairwise dependencies at a global scale to model long-term dependencies, but this approach is not only computationally complex but also unable to fully mine refined features. In this paper, we propose a vehicle Re-ID network design based on a multi-axis compression fusion (MCF) attention mechanism. The MCF attention mechanism preserves feature information on different axes through compression operations while maintaining high computational efficiency. It utilizes single-axis self-attention calculations to update the weights and strengthens the regions of common interest across multiple axes by fusing information from multiple axes, thereby enhancing the effect of attention learning. On the basis of this mechanism, we propose a multi-axis compression fusion network (MCF-Net), which combines the spatial multi-axis compression fusion (S-MCF) module and the channel multi-axis compression fusion (C-MCF) module, and uses a rigid partitioning strategy to capture both global and fine-grained features. Experiments show that MCF-Net achieves state-of-the-art performance on the vehicle Re-ID datasets VeRi-776 and VehicleID.
ISSN:2045-2322