AttenFlow: Context-Aware Architecture with Consensus-Based Retrieval and Graph Attention for Automated Document Processing

Automated document processing and circulation systems face critical challenges in achieving reliable retrieval accuracy and robust classification performance, particularly in security-critical organizational environments. Traditional approaches suffer from fundamental limitations, including fixed fu...

Full description

Saved in:
Bibliographic Details
Main Authors: Xianfeng Zhang, Bin Hu, Shukan Liu, Qiao Sun, Lin Chen
Format: Article
Language:English
Published: MDPI AG 2025-07-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/13/7517
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Automated document processing and circulation systems face critical challenges in achieving reliable retrieval accuracy and robust classification performance, particularly in security-critical organizational environments. Traditional approaches suffer from fundamental limitations, including fixed fusion strategies in hybrid retrieval systems, inability to model inter-document relationships in classification tasks, and lack of confidence estimation for result reliability. This paper introduces AttenFlow, a novel context-aware architecture that revolutionizes document management through two core technical innovations. First, we propose the retriever consensus confidence fusion (RCCF) method, which addresses the limitations of conventional hybrid retrieval approaches by introducing consensus-based fusion strategies that dynamically adapt to retriever agreement levels while providing confidence estimates for results. RCCF measures the consensus between different retrievers through sophisticated ranking and scoring consistency metrics, enabling adaptive weight assignment that amplifies high-consensus results while adopting conservative approaches for uncertain cases. Second, we develop adversarial mutual-attention hybrid-dimensional graph attention network (AM-HDGAT) for text, which transforms document classification by modeling inter-document relationships through graph structures while integrating high-dimensional semantic features and low-dimensional statistical features through mutual-attention mechanisms. The approach incorporates adversarial training to enhance robustness against potential security threats, making it particularly suitable for critical document processing applications. Comprehensive experimental evaluation across multiple benchmark datasets demonstrates the substantial effectiveness of our innovations. RCCF achieves improvements of up to 16.9% in retrieval performance metrics compared to traditional fusion methods while providing reliable confidence estimates. AM-HDGAT for text demonstrates superior classification performance with an average F1-score improvement of 2.23% compared to state-of-the-art methods, maintaining 82.4% performance retention under adversarial attack scenarios. Real-world deployment validation shows a 34.5% reduction in manual processing time and 95.7% user satisfaction scores, establishing AttenFlow as a significant advancement in intelligent document management technology.
ISSN:2076-3417