Addressing Activation Outliers in LLMs: A Systematic Review of Post-Training Quantization Techniques

Addressing Activation Outliers in LLMs: A Systematic Review of Post-Training Quantization Techniques

Large Language Models (LLMs) have transformed natural language processing, yet their deployment remains challenging due to substantial computational, memory, and energy demands. Post-training quantization has emerged as a key strategy for enabling efficient inference, particularly in resource-constr...

Full description

Saved in:

Bibliographic Details
Main Authors:	Patrik Czako, Gabor Kertesz, Sandor Szenasi
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Activation outliers efficient inference large language models (LLMs) model compression quantization
Online Access:	https://ieeexplore.ieee.org/document/10994764/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Efficient Deep Learning Model Compression for Sensor-Based Vision Systems via Outlier-Aware Quantization
by: Joonhyuk Yoo, et al.
Published: (2025-05-01)

Mitigating Quantization Errors Due to Activation Spikes in Gated Linear Unit-Based Large Language Models
by: Jaewoo Yang, et al.
Published: (2025-04-01)

Systematic Analysis of Retrieval-Augmented Generation-Based LLMs for Medical Chatbot Applications
by: Arunabh Bora, et al.
Published: (2024-10-01)

Efficient LLMs Training and Inference: An Introduction
by: Rui Li, et al.
Published: (2025-01-01)

ResDecode: Accelerating Large Language Models Inference via Residual Decoding Heads
by: Ziqian Zeng, et al.
Published: (2025-06-01)

AutoTA: A Dynamic Intent-Based Virtual Teaching Assistant for Students Using Open Source LLMs
by: Rajashree Dahal, et al.
Published: (2025-01-01)

Mixed precision quantization based on information entropy
by: Ting Qin, et al.
Published: (2025-04-01)

Combining the Strengths of LLMs and Persuasive Technology to Combat Cyberhate
by: Malik Almaliki, et al.
Published: (2025-05-01)

A Novel Mixed-Precision Quantization Approach for CNNs
by: Dan Wu, et al.
Published: (2025-01-01)

Quantization for a Condensation System
by: Shivam Dubey, et al.
Published: (2025-04-01)

Source Quantization and Coding over Noisy Channel Analysis
by: Runfeng Wang, et al.
Published: (2024-11-01)

LLMs in Cyber Security: Bridging Practice and Education
by: Hany F. Atlam
Published: (2025-07-01)

Enhanced Vector Quantization for Embedded Machine Learning: A Post-Training Approach With Incremental Clustering
by: Thommas K. S. Flores, et al.
Published: (2025-01-01)

Outlier Detection and Explanation Method Based on FOLOF Algorithm
by: Lei Bai, et al.
Published: (2025-05-01)

Data-oriented optimized nonuniform quantization for CR-enhanced communication efficiency in federated learning
by: Shuai Luo, et al.
Published: (2025-06-01)

Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm
by: Sari Masri, et al.
Published: (2025-01-01)

Fully Quantized Neural Networks for Audio Source Separation
by: Elad Cohen, et al.
Published: (2024-01-01)

Detection of Outliers and Influential Observations in Linear Ridge Measurement Error Models with Stochastic Linear Restrictions
by: F. Ghapani, et al.
Published: (2015-12-01)

HLQ: Hardware-Friendly Logarithmic Quantization Aware Training for Power-Efficient Low-Precision CNN Models
by: Dahun Choi, et al.
Published: (2024-01-01)

RIDGE LEAST ABSOLUTE DEVIATION PERFORMANCE IN ADDRESSING MULTICOLLINEARITY AND DIFFERENT LEVELS OF OUTLIER SIMULTANEOUSLY
by: Netti Herawati, et al.
Published: (2022-09-01)

Leveraging LLMs for COVID-19 Fake News Generation and Detection: A Comparative Analysis on Twitter Data
by: Hong N. Dao, et al.
Published: (2025-01-01)

How well can LLMs grade essays in Arabic?
by: Rayed Ghazawi, et al.
Published: (2025-12-01)

TCL: Time-Dependent Clustering Loss for Optimizing Post-Training Feature Map Quantization for Partitioned DNNs
by: Oscar Artur Bernd Berg, et al.
Published: (2025-01-01)

On protecting the data privacy of Large Language Models (LLMs) and LLM agents: A literature review
by: Biwei Yan, et al.
Published: (2025-06-01)

Smart Building Recommendations with LLMs: A Semantic Comparison Approach
by: Ioannis Papaioannou, et al.
Published: (2025-06-01)

A Bibliometric Exposition and Review on Leveraging LLMs for Programming Education
by: Joanah Pwanedo Amos, et al.
Published: (2025-01-01)

Potentials and Challenges of Large Language Models (LLMs) in the Context of Administrative Decision-Making
by: Paulina Jo Pesch, et al.
Published: (2025-03-01)

Fast Ways to Detect Outliers
by: Emad Obaid Merza, et al.
Published: (2021-03-01)

Detecting Outliers in Exponentiated Pareto Distribution
by: M. Jabbari Nooghabi
Published: (2017-07-01)

BERTugues: A Novel BERT Transformer Model Pre-trained for Brazilian Portuguese
by: Ricardo Mazza Zago, et al.
Published: (2024-12-01)

Integrating Large Language Models in Political Discourse Studies on Social Media: Challenges of Validating an LLMs-in-the-loop Pipeline
by: Giada Marino, et al.
Published: (2024-10-01)

Novel hybrid and weighted ensemble models to predict river discharge series with outliers
by: Maha Shabbir, et al.
Published: (2024-04-01)

Smoothed per-tensor weight quantization: a robust solution for neural network deployment
by: Xin Chang
Published: (2025-07-01)

Detection of outliers in processing of small size data
by: B. C. Попукайло
Published: (2016-10-01)

Learning to represent causality in recommender systems driven by large language models (LLMs)
by: Serge Stéphane Aman, et al.
Published: (2025-08-01)

COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization
by: Aozhong Zhang, et al.
Published: (2025-01-01)

A survey on outlier-resistant state estimation and its applications
by: Lei Zou, et al.
Published: (2025-12-01)

Local outlier factor algorithm based on correction of bidirectional neighbor
by: Xiaohui YANG, et al.
Published: (2020-08-01)

Outlier Ensemble Based on Isolation Forest: The CBOEA Approach
by: Chaabouni Ali, et al.
Published: (2025-02-01)

Comparison between Statistical Approaches and Data Mining Algorithms for Outlier Detection
by: Annisa Putri Utami, et al.
Published: (2024-05-01)