Leveraging logit uncertainty for better knowledge distillation

Leveraging logit uncertainty for better knowledge distillation

Abstract Knowledge distillation improves student model performance. However, using a larger teacher model does not necessarily result in better distillation gains due to significant architecture and output gaps with smaller student networks. To address this issue, we reconsider teacher outputs and f...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhen Guo, Dong Wang, Qiang He, Pengzhou Zhang
Format:	Article
Language:	English
Published:	Nature Portfolio 2024-12-01
Series:	Scientific Reports
Subjects:	Knowledge distillation Uncertainty learning
Online Access:	https://doi.org/10.1038/s41598-024-82647-6
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Optimal Knowledge Distillation through Non-Heuristic Control of Dark Knowledge
by: Darian Onchis, et al.
Published: (2024-08-01)

A Review of Knowledge Distillation in Object Detection
by: Shengjie Cheng, et al.
Published: (2025-01-01)

Decoupled Time-Dimensional Progressive Self-Distillation With Knowledge Calibration for Edge Computing-Enabled AIoT
by: Yingchao Wang, et al.
Published: (2024-01-01)

Autocorrelation Matrix Knowledge Distillation: A Task-Specific Distillation Method for BERT Models
by: Kai Zhang, et al.
Published: (2024-10-01)

Aligning to the teacher: multilevel feature-aligned knowledge distillation
by: Yang Zhang, et al.
Published: (2025-08-01)

Reward estimation with scheduled knowledge distillation for dialogue policy learning
by: Junyan Qiu, et al.
Published: (2023-12-01)

Improving Low-Resource Neural Machine Translation With Teacher-Free Knowledge Distillation
by: Xinlu Zhang, et al.
Published: (2020-01-01)

Distilling Diverse Knowledge for Deep Ensemble Learning
by: Naoki Okamoto, et al.
Published: (2025-01-01)

Lightweight defect detection algorithm of tunnel lining based on knowledge distillation
by: Anfu Zhu, et al.
Published: (2024-11-01)

A Malware Classification Method Based on Knowledge Distillation and Feature Fusion
by: Xin Guan, et al.
Published: (2025-01-01)

A contrast enhanced representation normalization approach to knowledge distillation
by: Zhiqiang Bao, et al.
Published: (2025-04-01)

Code summarization based on large model knowledge distillation
by: YOU Gang, LIU Wenjie, LI Meipeng, SUN Liqun, WANG Lian, TIAN Tieku
Published: (2025-08-01)

Multimodal Knowledge Distillation for Emotion Recognition
by: Zhenxuan Zhang, et al.
Published: (2025-06-01)

Explainable Recommender Systems Through Reinforcement Learning and Knowledge Distillation on Knowledge Graphs
by: Alexandra Vultureanu-Albişi, et al.
Published: (2025-03-01)

Knowledge distillation with resampling for imbalanced data classification: Enhancing predictive performance and explainability stability
by: Kazuki Fujiwara
Published: (2024-12-01)

Knowledge Reasoning- and Progressive Distillation-Integrated Detection of Electrical Construction Violations
by: Bin Ma, et al.
Published: (2024-12-01)

Prune and Distill: A Novel Knowledge Distillation Method for GCNs-Based Recommender Systems
by: Peng Yi, et al.
Published: (2025-01-01)

High efficiency classification of thyroid cytopathological images based on knowledge distillation and vision transformer
by: Jiazhe Zhang, et al.
Published: (2025-08-01)

Optimised knowledge distillation for efficient social media emotion recognition using DistilBERT and ALBERT
by: Muhammad Hussain, et al.
Published: (2025-08-01)

De-speckling of medical ultrasound image using metric-optimized knowledge distillation
by: Mostafa Khalifa, et al.
Published: (2025-07-01)

Logitwise Distillation Network: Improving Knowledge Distillation via Introducing Sample Confidence
by: Teng Shen, et al.
Published: (2025-02-01)

Predicting Subsurface Layer Thickness and Seismic Wave Velocity Using Deep Learning: Knowledge Distillation Approach
by: Amir Moslemi, et al.
Published: (2025-01-01)

Dual defense: Combining preemptive exclusion of members and knowledge distillation to mitigate membership inference attacks
by: Jun Niu, et al.
Published: (2025-01-01)

Knowledge Distillation-Enhanced Behavior Transformer for Decision-Making of Autonomous Driving
by: Rui Zhao, et al.
Published: (2025-01-01)

Confidence-Based Knowledge Distillation to Reduce Training Costs and Carbon Footprint for Low-Resource Neural Machine Translation
by: Maria Zafar, et al.
Published: (2025-07-01)

BDEKD: mitigating backdoor attacks in NLP models via ensemble knowledge distillation
by: Zijie Zhang, et al.
Published: (2025-07-01)

The Role of Teacher Calibration in Knowledge Distillation
by: Suyoung Kim, et al.
Published: (2025-01-01)

Timestamp-Guided Knowledge Distillation for Robust Sensor-Based Time-Series Forecasting
by: Jiahe Yan, et al.
Published: (2025-07-01)

DiReDi: Distillation and Reverse Distillation for AIoT Applications
by: Chen Sun, et al.
Published: (2024-01-01)

M3AE-Distill: An Efficient Distilled Model for Medical Vision–Language Downstream Tasks
by: Xudong Liang, et al.
Published: (2025-07-01)

Transformer-Guided Serial Knowledge Distillation for High-Precision Anomaly Detection
by: Danyang Wang, et al.
Published: (2025-01-01)

Knowledge Distillation‐Based Zero‐Shot Learning for Process Fault Diagnosis
by: Yi Liu, et al.
Published: (2025-06-01)

Leveraging FastViT based knowledge distillation with EfficientNet-B0 for diabetic retinopathy severity classification
by: Jyotirmayee Rautaray, et al.
Published: (2025-08-01)

LGFA-MTKD: Enhancing Multi-Teacher Knowledge Distillation with Local and Global Frequency Attention
by: Xin Cheng, et al.
Published: (2024-11-01)

Federated Knowledge Distillation With 3D Transformer Adaptation for Weakly Labeled Multi-Organ Medical Image Segmentation
by: Tareq Mahmod AlZubi, et al.
Published: (2025-01-01)

Optimizing Deep Learning Models for Resource‐Constrained Environments With Cluster‐Quantized Knowledge Distillation
by: Niaz Ashraf Khan, et al.
Published: (2025-05-01)

Knowledge Distillation in Object Detection for Resource-Constrained Edge Computing
by: Arief Setyanto, et al.
Published: (2025-01-01)

Improving Age Estimation in Occluded Facial Images with Knowledge Distillation and Layer-Wise Feature Reconstruction
by: Shuangfei Yu, et al.
Published: (2025-05-01)

Class and Data-Incremental Learning Framework for Baggage Threat Segmentation via Knowledge Distillation
by: Ammara Nasim, et al.
Published: (2025-01-01)

Knowledge Distillation for Molecular Property Prediction: A Scalability Analysis
by: Rahul Sheshanarayana, et al.
Published: (2025-06-01)