Smoothed per-tensor weight quantization: a robust solution for neural network deployment

Smoothed per-tensor weight quantization: a robust solution for neural network deployment

This paper introduces a novel method to improve quantization outcomes for per-tensor weight quantization, focusing on enhancing computational efficiency and compatibility with resource-constrained hardware. Addressing the inherent challenges of depth-wise convolutions, the proposed smooth quantizati...

Full description

Saved in:

Bibliographic Details
Main Author:	Xin Chang
Format:	Article
Language:	English
Published:	Polish Academy of Sciences 2025-07-01
Series:	International Journal of Electronics and Telecommunications
Subjects:	per-tensor quantization edge device neural network compression
Online Access:	https://journals.pan.pl/Content/135755/23_4966_Chang_L_sk.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Convolution Smooth: A Post-Training Quantization Method for Convolutional Neural Networks
by: Yongyuan Chen, et al.
Published: (2025-01-01)

Ultimate Compression: Joint Method of Quantization and Tensor Decomposition for Compact Models on the Edge
by: Mohammed Alnemari, et al.
Published: (2024-10-01)

Reducing Memory and Computational Cost for Deep Neural Network Training with Quantized Parameter Updates
by: Leo Buron, et al.
Published: (2025-08-01)

Fully Quantized Neural Networks for Audio Source Separation
by: Elad Cohen, et al.
Published: (2024-01-01)

Edge-Optimized Deep Learning Architectures for Classification of Agricultural Insects with Mobile Deployment
by: Muhammad Hannan Akhtar, et al.
Published: (2025-04-01)

An Adaptive Approach in Channel Quantization for Small Cells Based on Per-Receiver Antenna Quantization
by: Sanjeeb Shrestha, et al.
Published: (2025-01-01)

On-Edge Deployment of Vision Transformers for Medical Diagnostics Using the Kvasir-Capsule Dataset
by: Dara Varam, et al.
Published: (2024-09-01)

Quantization for a Condensation System
by: Shivam Dubey, et al.
Published: (2025-04-01)

Source Quantization and Coding over Noisy Channel Analysis
by: Runfeng Wang, et al.
Published: (2024-11-01)

Quantized Convolutional Neural Networks Robustness under Perturbation [version 1; peer review: 2 approved]
by: Guy Kember, et al.
Published: (2025-04-01)

Randomized Quantization for Privacy in Resource Constrained Machine Learning at-the-Edge and Federated Learning
by: Ce Feng, et al.
Published: (2025-01-01)

Accelerated Tensor Robust Principal Component Analysis via Factorized Tensor Norm Minimization
by: Geunseop Lee
Published: (2025-07-01)

Mixed precision quantization based on information entropy
by: Ting Qin, et al.
Published: (2025-04-01)

Forward-backward box smoothing with quantized measurements
by: Sun Wen
Published: (2022-05-01)

A Novel Mixed-Precision Quantization Approach for CNNs
by: Dan Wu, et al.
Published: (2025-01-01)

Utilizing the Attention Mechanism for Accuracy Prediction in Quantized Neural Networks
by: Lu Wei, et al.
Published: (2025-02-01)

Optimizing binary neural network quantization for fixed pattern noise robustness
by: Francisco Javier Andreo-Oliver, et al.
Published: (2025-07-01)

HLQ: Hardware-Friendly Logarithmic Quantization Aware Training for Power-Efficient Low-Precision CNN Models
by: Dahun Choi, et al.
Published: (2024-01-01)

Conditional Quantization for Uniform Distributions on Line Segments and Regular Polygons
by: Pigar Biteng, et al.
Published: (2025-03-01)

Mitigating Quantization Errors Due to Activation Spikes in Gated Linear Unit-Based Large Language Models
by: Jaewoo Yang, et al.
Published: (2025-04-01)

Efficient Spectral Compression of Wavelength-Shifting Soliton and Its Application in Integratable All-Optical Quantization
by: Chao Mei, et al.
Published: (2019-01-01)

An interpolated quantized guard band algorithm for physical layer key generation
by: Yongli An, et al.
Published: (2025-03-01)

Addressing Activation Outliers in LLMs: A Systematic Review of Post-Training Quantization Techniques
by: Patrik Czako, et al.
Published: (2025-01-01)

Data-oriented optimized nonuniform quantization for CR-enhanced communication efficiency in federated learning
by: Shuai Luo, et al.
Published: (2025-06-01)

Quantized convolutional neural networks: a hardware perspective
by: Li Zhang, et al.
Published: (2025-07-01)

Slim-sugarcane: a lightweight and high-precision method for sugarcane node detection and edge deployment in natural environments
by: Lijiao Wei, et al.
Published: (2025-07-01)

WAPS-Quant: Low-Bit Post-Training Quantization Using Weight-Activation Product Scaling
by: Geunjae Choi, et al.
Published: (2025-01-01)

Conditional Optimal Sets and the Quantization Coefficients for Some Uniform Distributions
by: Evans Nyanney, et al.
Published: (2025-07-01)

Speaker Authentication Using Vector Quantization
by: Bushra Q. Al-Abudi, et al.
Published: (2009-12-01)

Nonperturbative Lorentz violation and field quantization
by: V. Alan Kostelecký, et al.
Published: (2025-06-01)

Efficient Deep Learning Model Compression for Sensor-Based Vision Systems via Outlier-Aware Quantization
by: Joonhyuk Yoo, et al.
Published: (2025-05-01)

Enhanced Vector Quantization for Embedded Machine Learning: A Post-Training Approach With Incremental Clustering
by: Thommas K. S. Flores, et al.
Published: (2025-01-01)

Self-Supervised Pretraining and Quantization for Fault Tolerant Neural Networks: Friend or Foe?
by: Rosario Milazzo, et al.
Published: (2025-01-01)

ClipQ: Clipping Optimization for the Post-Training Quantization of Convolutional Neural Network
by: Yiming Chen, et al.
Published: (2025-04-01)

Optimization Strategies Applied to Deep Learning Models for Image Steganalysis: Application of Pruning, Quantization and Weight Clustering
by: Gabriel Ferreira, et al.
Published: (2025-04-01)

Qptimization design of video encoder quantizer for general DSPs
by: GAN Yong1, et al.
Published: (2007-01-01)

Covid-19 pandemic data analysis using tensor methods
by: Dipak Dulal, et al.
Published: (2024-03-01)

Quantization-Aware Training With Dynamic and Static Pruning
by: Sangho An, et al.
Published: (2025-01-01)

TCL: Time-Dependent Clustering Loss for Optimizing Post-Training Feature Map Quantization for Partitioned DNNs
by: Oscar Artur Bernd Berg, et al.
Published: (2025-01-01)

A Simultaneous Decomposition for a Quaternion Tensor Quaternity with Applications
by: Jia-Wei Huo, et al.
Published: (2025-05-01)