Improving Low-Resource Neural Machine Translation With Teacher-Free Knowledge Distillation

Improving Low-Resource Neural Machine Translation With Teacher-Free Knowledge Distillation

Knowledge Distillation (KD) aims to distill the knowledge of a cumbersome teacher model into a lightweight student model. Its success is generally attributed to the privileged information on similarities among categories provided by the teacher model, and in this sense, only strong teacher models ar...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xinlu Zhang, Xiao Li, Yating Yang, Rui Dong
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Neural machine translation knowledge distillation prior knowledge
Online Access:	https://ieeexplore.ieee.org/document/9257421/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Confidence-Based Knowledge Distillation to Reduce Training Costs and Carbon Footprint for Low-Resource Neural Machine Translation
by: Maria Zafar, et al.
Published: (2025-07-01)

Non‐Autoregressive Translation Algorithm Based on LLM Knowledge Distillation in English Corpus
by: Fang Ju, et al.
Published: (2025-01-01)

Decoupled Time-Dimensional Progressive Self-Distillation With Knowledge Calibration for Edge Computing-Enabled AIoT
by: Yingchao Wang, et al.
Published: (2024-01-01)

Leveraging logit uncertainty for better knowledge distillation
by: Zhen Guo, et al.
Published: (2024-12-01)

Knowledge distillation for spiking neural networks: aligning features and saliency
by: Yifan Hu, et al.
Published: (2025-01-01)

Optimal Knowledge Distillation through Non-Heuristic Control of Dark Knowledge
by: Darian Onchis, et al.
Published: (2024-08-01)

Knowledge Distillation for Molecular Property Prediction: A Scalability Analysis
by: Rahul Sheshanarayana, et al.
Published: (2025-06-01)

Autocorrelation Matrix Knowledge Distillation: A Task-Specific Distillation Method for BERT Models
by: Kai Zhang, et al.
Published: (2024-10-01)

Optimizing Deep Learning Models for Resource‐Constrained Environments With Cluster‐Quantized Knowledge Distillation
by: Niaz Ashraf Khan, et al.
Published: (2025-05-01)

Aligning to the teacher: multilevel feature-aligned knowledge distillation
by: Yang Zhang, et al.
Published: (2025-08-01)

A Multi-Scale Convolutional Neural Network with Self-Knowledge Distillation for Bearing Fault Diagnosis
by: Jiamao Yu, et al.
Published: (2024-11-01)

Distilling knowledge from graph neural networks trained on cell graphs to non-neural student models
by: Vasundhara Acharya, et al.
Published: (2025-08-01)

Lightweight CNN for Resource-Constrained BCD System Using Knowledge Distillation
by: Falmata Modu, et al.
Published: (2025-01-01)

A Review of Knowledge Distillation in Object Detection
by: Shengjie Cheng, et al.
Published: (2025-01-01)

Knowledge Distillation in Object Detection for Resource-Constrained Edge Computing
by: Arief Setyanto, et al.
Published: (2025-01-01)

Real-time aerial fire detection on resource-constrained devices using knowledge distillation
by: Sabina Jangirova, et al.
Published: (2025-08-01)

Few-Shot Graph Anomaly Detection via Dual-Level Knowledge Distillation
by: Xuan Li, et al.
Published: (2025-01-01)

The Role of Teacher Calibration in Knowledge Distillation
by: Suyoung Kim, et al.
Published: (2025-01-01)

Code summarization based on large model knowledge distillation
by: YOU Gang, LIU Wenjie, LI Meipeng, SUN Liqun, WANG Lian, TIAN Tieku
Published: (2025-08-01)

Prune and Distill: A Novel Knowledge Distillation Method for GCNs-Based Recommender Systems
by: Peng Yi, et al.
Published: (2025-01-01)

Logitwise Distillation Network: Improving Knowledge Distillation via Introducing Sample Confidence
by: Teng Shen, et al.
Published: (2025-02-01)

Transformer Fault Diagnosis Based on Knowledge Distillation and Residual Convolutional Neural Networks
by: Haikun Shang, et al.
Published: (2025-06-01)

Knowledge Distillation with Geometry-Consistent Feature Alignment for Robust Low-Light Apple Detection
by: Yuanping Shi, et al.
Published: (2025-08-01)

Explainable Recommender Systems Through Reinforcement Learning and Knowledge Distillation on Knowledge Graphs
by: Alexandra Vultureanu-Albişi, et al.
Published: (2025-03-01)

LGFA-MTKD: Enhancing Multi-Teacher Knowledge Distillation with Local and Global Frequency Attention
by: Xin Cheng, et al.
Published: (2024-11-01)

Optimised knowledge distillation for efficient social media emotion recognition using DistilBERT and ALBERT
by: Muhammad Hussain, et al.
Published: (2025-08-01)

Transformer-Guided Serial Knowledge Distillation for High-Precision Anomaly Detection
by: Danyang Wang, et al.
Published: (2025-01-01)

Timestamp-Guided Knowledge Distillation for Robust Sensor-Based Time-Series Forecasting
by: Jiahe Yan, et al.
Published: (2025-07-01)

A Lightweight Intrusion Detection System for IoT and UAV Using Deep Neural Networks with Knowledge Distillation
by: Treepop Wisanwanichthan, et al.
Published: (2025-07-01)

Knowledge Distillation for Face Recognition Using Synthetic Data With Dynamic Latent Sampling
by: Hatef Otroshi Shahreza, et al.
Published: (2024-01-01)

Correlation-Based Knowledge Distillation in Exemplar-Free Class-Incremental Learning
by: Zijian Gao, et al.
Published: (2025-01-01)

Knowledge Reasoning- and Progressive Distillation-Integrated Detection of Electrical Construction Violations
by: Bin Ma, et al.
Published: (2024-12-01)

A contrast enhanced representation normalization approach to knowledge distillation
by: Zhiqiang Bao, et al.
Published: (2025-04-01)

Dynamic subgraph pruning and causal-aware knowledge distillation for temporal knowledge graphs
by: Qian Liu, et al.
Published: (2025-07-01)

Knowledge distillation with resampling for imbalanced data classification: Enhancing predictive performance and explainability stability
by: Kazuki Fujiwara
Published: (2024-12-01)

Gradual Geometry-Guided Knowledge Distillation for Source-Data-Free Domain Adaptation
by: Yangkuiyi Zhang, et al.
Published: (2025-04-01)

A Malware Classification Method Based on Knowledge Distillation and Feature Fusion
by: Xin Guan, et al.
Published: (2025-01-01)

The Impact of Integrating Shallow and Deep Information on Knowledge Distillation
by: Yilin Miao, et al.
Published: (2025-01-01)

Distilling Diverse Knowledge for Deep Ensemble Learning
by: Naoki Okamoto, et al.
Published: (2025-01-01)

De-speckling of medical ultrasound image using metric-optimized knowledge distillation
by: Mostafa Khalifa, et al.
Published: (2025-07-01)