Parameter Disentanglement for Diverse Representations

Recent advances in neural network architectures reveal the importance of diverse representations. However, simply integrating more branches or increasing the width for the diversity would inevitably increase model complexity, leading to prohibitive inference costs. In this paper, we revisit the lear...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jingxu Wang, Jingda Guo, Ruili Wang, Zhao Zhang, Liyong Fu, Qiaolin Ye
Format:	Article
Language:	English
Published:	Tsinghua University Press 2025-05-01
Series:	Big Data Mining and Analytics
Subjects:	representation learning efficient network computer vision
Online Access:	https://www.sciopen.com/article/10.26599/BDMA.2024.9020087
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849304126976950272
author	Jingxu Wang Jingda Guo Ruili Wang Zhao Zhang Liyong Fu Qiaolin Ye
author_facet	Jingxu Wang Jingda Guo Ruili Wang Zhao Zhang Liyong Fu Qiaolin Ye
author_sort	Jingxu Wang
collection	DOAJ
description	Recent advances in neural network architectures reveal the importance of diverse representations. However, simply integrating more branches or increasing the width for the diversity would inevitably increase model complexity, leading to prohibitive inference costs. In this paper, we revisit the learnable parameters in neural networks and showcase that it is feasible to disentangle learnable parameters to latent sub-parameters, which focus on different patterns and representations. This important finding leads us to study further the aggregation of diverse representations in a network structure. To this end, we propose Parameter Disentanglement for Diverse Representations (PDDR), which considers diverse patterns in parallel during training, and aggregates them into one for efficient inference. To further enhance the diverse representations, we develop a lightweight refinement module in PDDR, which adaptively refines the combination of diverse representations according to the input. PDDR can be seamlessly integrated into modern networks, significantly improving the learning capacity of a network while maintaining the same complexity for inference. Experimental results show great improvements on various tasks, with an improvement of 1.47% over Residual Network 50 (ResNet50) on ImageNet, and we improve the detection results of Retina Residual Network 50 (Retina-ResNet50) by 1.7% Mean Average Precision (mAP). Integrating PDDR into recent lightweight vision transformer models, the resulting model outperforms related works by a clear margin.
format	Article
id	doaj-art-6a9fe109a21041bba2c370da5ecabe01
institution	Kabale University
issn	2096-0654 2097-406X
language	English
publishDate	2025-05-01
publisher	Tsinghua University Press
record_format	Article
series	Big Data Mining and Analytics
spelling	doaj-art-6a9fe109a21041bba2c370da5ecabe012025-08-20T03:55:49ZengTsinghua University PressBig Data Mining and Analytics2096-06542097-406X2025-05-018360662310.26599/BDMA.2024.9020087Parameter Disentanglement for Diverse RepresentationsJingxu Wang0Jingda Guo1Ruili Wang2Zhao Zhang3Liyong Fu4Qiaolin Ye5the College of Information Science and Technology & Artificial Intelligence, State Key Laboratory of Tree Genetics and Breeding, and also with the Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, Chinathe Research Institute of New Technology, Hillstone Networks, Santa Clara, CA 95054, USAthe School of Mathematical and Computational Sciences, Massey University, Auckland 102-904, New Zealandthe Key Laboratory of Knowledge Engineering with Big Data, Ministry of Education, Hefei University of Technology, Hefei 230009, Chinathe College of Forestry, Hebei Agricultural University, Baoding 071000, China, and also with the Institute of Forest Resource Information Techniques, Chinese Academy of Forestry, Beijing 100091, Chinathe College of Information Science and Technology & Artificial Intelligence, State Key Laboratory of Tree Genetics and Breeding, and also with the Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, ChinaRecent advances in neural network architectures reveal the importance of diverse representations. However, simply integrating more branches or increasing the width for the diversity would inevitably increase model complexity, leading to prohibitive inference costs. In this paper, we revisit the learnable parameters in neural networks and showcase that it is feasible to disentangle learnable parameters to latent sub-parameters, which focus on different patterns and representations. This important finding leads us to study further the aggregation of diverse representations in a network structure. To this end, we propose Parameter Disentanglement for Diverse Representations (PDDR), which considers diverse patterns in parallel during training, and aggregates them into one for efficient inference. To further enhance the diverse representations, we develop a lightweight refinement module in PDDR, which adaptively refines the combination of diverse representations according to the input. PDDR can be seamlessly integrated into modern networks, significantly improving the learning capacity of a network while maintaining the same complexity for inference. Experimental results show great improvements on various tasks, with an improvement of 1.47% over Residual Network 50 (ResNet50) on ImageNet, and we improve the detection results of Retina Residual Network 50 (Retina-ResNet50) by 1.7% Mean Average Precision (mAP). Integrating PDDR into recent lightweight vision transformer models, the resulting model outperforms related works by a clear margin.https://www.sciopen.com/article/10.26599/BDMA.2024.9020087representation learningefficient networkcomputer vision
spellingShingle	Jingxu Wang Jingda Guo Ruili Wang Zhao Zhang Liyong Fu Qiaolin Ye Parameter Disentanglement for Diverse Representations Big Data Mining and Analytics representation learning efficient network computer vision
title	Parameter Disentanglement for Diverse Representations
title_full	Parameter Disentanglement for Diverse Representations
title_fullStr	Parameter Disentanglement for Diverse Representations
title_full_unstemmed	Parameter Disentanglement for Diverse Representations
title_short	Parameter Disentanglement for Diverse Representations
title_sort	parameter disentanglement for diverse representations
topic	representation learning efficient network computer vision
url	https://www.sciopen.com/article/10.26599/BDMA.2024.9020087
work_keys_str_mv	AT jingxuwang parameterdisentanglementfordiverserepresentations AT jingdaguo parameterdisentanglementfordiverserepresentations AT ruiliwang parameterdisentanglementfordiverserepresentations AT zhaozhang parameterdisentanglementfordiverserepresentations AT liyongfu parameterdisentanglementfordiverserepresentations AT qiaolinye parameterdisentanglementfordiverserepresentations

Parameter Disentanglement for Diverse Representations

Similar Items