Optimization of deep learning architecture based on multi-path convolutional neural network algorithm

Abstract Current multi-stream convolutional neural network (MSCNN) exhibits notable limitations in path cooperation, feature fusion, and resource utilization when handling complex tasks. To enhance MSCNN’s feature extraction ability, computational efficiency, and model robustness, this study conduct...

Full description

Saved in:
Bibliographic Details
Main Authors: Chuan Zhou, Yan Liu, Xinghan An, Xiyao Xu, Hao Wang
Format: Article
Language:English
Published: Nature Portfolio 2025-06-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-03765-3
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Current multi-stream convolutional neural network (MSCNN) exhibits notable limitations in path cooperation, feature fusion, and resource utilization when handling complex tasks. To enhance MSCNN’s feature extraction ability, computational efficiency, and model robustness, this study conducts an in-depth investigation of these architectural deficiencies and proposes corresponding improvements. At present, there are some problems in multi-path architecture, such as isolated information among paths, low efficiency of feature fusion mechanism, and high computational complexity. These issues lead to insufficient performance of the model in robustness indicators such as noise resistance, occlusion sensitivity, and resistance to sample attacks. The architecture also faces challenges in data scalability efficiency and resource scalability requirements. Therefore, this study proposes an optimized model based on a dynamic path cooperation mechanism and lightweight design, innovatively introducing a path attention mechanism and feature-sharing module to enhance information interaction between paths. Self-attention fusion method is adopted to improve the efficiency of feature fusion. At the same time, by combining path selection and model pruning technology, the effective balance between model performance and computational resources demand is realized. The study employs three datasets, Canadian Institute for Advanced Research-10 (CIFAR-10), ImageNet, and Custom Dataset for performance comparison and simulation. The results show that the proposed optimized model is superior to the current mainstream model in many indicators. For example, on the Medical Images dataset, the optimized model’s noise robustness, occlusion sensitivity, and sample attack resistance are 0.931, 0.950, and 0.709, respectively. On E-commerce Data, the optimized model’s data scalability efficiency reaches 0.969, and the resource scalability requirement is only 0.735, showing excellent task adaptability and resource utilization efficiency. Therefore, the study provides a critical reference for the optimization and practical application of MSCNN, contributing to the application research of deep learning in complex tasks.
ISSN:2045-2322