Head information bottleneck (HIB): leveraging information bottleneck for efficient transformer head attribution and pruning
Abstract Multi-head attention mechanisms have been widely applied in speech pre-training. However, their roles and effectiveness in various downstream tasks have not been fully explored. Attention heads may vary in importance depending on the downstream task. We assume that the attention allocation...
Saved in:
| Main Authors: | Yukun Qian, Xuyi Zhuang, Mingjiang Wang |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
SpringerOpen
2025-07-01
|
| Series: | EURASIP Journal on Audio, Speech, and Music Processing |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s13636-025-00411-8 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
The Supervised Information Bottleneck
by: Nir Z. Weingarten, et al.
Published: (2025-04-01) -
Advancing Model Explainability: Visual Concept Knowledge Distillation for Concept Bottleneck Models
by: Ju-Hwan Lee, et al.
Published: (2025-01-01) -
Hadoop bottleneck detection algorithm based on information gain
by: Zaole TAN, et al.
Published: (2016-07-01) -
Information Bottleneck Driven Deep Video Compression—IBOpenDVCW
by: Timor Leiderman, et al.
Published: (2024-09-01) -
Multi-attribute bottleneck identification method for hybrid flow shops in panel furniture intelligent manufacturing
by: Xinyi Yue, et al.
Published: (2025-06-01)