Efficient LLMs Training and Inference: An Introduction

Efficient LLMs Training and Inference: An Introduction

ChatGPT was released in late November 2022, making a significant impact globally. Following this release, numerous domestic and international open-source projects for large model training emerged, including Alpaca, BOOLM, LLaMA, ChatGLM, DeepSpeedChat, and ColossalChat. Both academia and industry ha...

Full description

Saved in:

Bibliographic Details
Main Authors:	Rui Li, Deji Fu, Chunyu Shi, Zhilan Huang, Gang Lu
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Large language models training inference optimization
Online Access:	https://ieeexplore.ieee.org/document/10756602/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Addressing Activation Outliers in LLMs: A Systematic Review of Post-Training Quantization Techniques
by: Patrik Czako, et al.
Published: (2025-01-01)

Long-context inference optimization for large language models: a survey
by: TAO Wei, et al.
Published: (2025-01-01)

AsymGroup: Asymmetric Grouping and Communication Optimization for 2D Tensor Parallelism in LLM Inference
by: Ki Tae Kim, et al.
Published: (2025-01-01)

Hive: A secure, scalable framework for distributed Ollama inference
by: Domen Vake, et al.
Published: (2025-05-01)

ResDecode: Accelerating Large Language Models Inference via Residual Decoding Heads
by: Ziqian Zeng, et al.
Published: (2025-06-01)

Advancing Software Vulnerability Detection with Reasoning LLMs: DeepSeek-R1′s Performance and Insights
by: Wenting Qin, et al.
Published: (2025-06-01)

Cryptographic inference for large language model via secret sharing
by: CHENG Ke, et al.
Published: (2025-06-01)

Design of electrocatalysts based on knowledge enhanced LLMs
by: WANG Ludi, et al.
Published: (2025-03-01)

Design of electrocatalysts based on knowledge enhanced LLMs
by: WANG Ludi, et al.
Published: (2025-03-01)

Automatic XPath generation agents for vertical websites by LLMs
by: Jing Huang, et al.
Published: (2025-06-01)

Evaluating LLMs for visualization generation and understanding
by: Saadiq Rauf Khan, et al.
Published: (2025-05-01)

Opt-CoInfer: Optimal collaborative inference across IoT and cloud for fast and accurate CNN inference
by: Zhanhua Zhang, et al.
Published: (2023-01-01)

Human-in-the-Loop Learning With LLMs for Efficient RASE Tagging in Building Compliance Regulations
by: Dhoyazan Al-Turki, et al.
Published: (2024-01-01)

The Dangerous Effects of a Frustratingly Easy LLMs Jailbreak Attack
by: Marco Bombieri, et al.
Published: (2025-01-01)

The Personality of the Intelligent Cockpit? Exploring the Personality Traits of In-Vehicle LLMs with Psychometrics
by: Qianli Lin, et al.
Published: (2024-10-01)

LLMs on a Budget: System-Level Approaches to Power-Efficient and Scalable Fine-Tuning
by: Kailash Gogineni, et al.
Published: (2025-01-01)

Designing Social Robots with LLMs for Engaging Human Interaction
by: Maria Pinto-Bernal, et al.
Published: (2025-06-01)

Evaluating LLMs for Automated Scoring in Formative Assessments
by: Pedro C. Mendonça, et al.
Published: (2025-03-01)

Long-context inference optimization for large language models: a survey
by: TAO Wei, et al.
Published: (2025-01-01)

Detecting Human Bias in Emergency Triage Using LLMs
by: Marta Avalos, et al.
Published: (2024-05-01)

R2GenGPT: Radiology Report Generation with frozen LLMs
by: Zhanyu Wang, et al.
Published: (2023-11-01)

ELO-Mask: Effective and Layerwise Optimization of Mask for Sparse LLMs
by: Bingjie Xiang, et al.
Published: (2024-01-01)

How well can LLMs grade essays in Arabic?
by: Rayed Ghazawi, et al.
Published: (2025-12-01)

ADTime: Adaptive Multivariate Time Series Forecasting Using LLMs
by: Jinglei Pei, et al.
Published: (2025-04-01)

Going vegan with ChatGPT: Towards designing LLMs for personalized lifestyle changes
by: Munachiso Okenyi, et al.
Published: (2025-06-01)

Leveraging LLMs for predictive insights in food policy and behavioral interventions
by: Micha Kaiser, et al.
Published: (2025-08-01)

LLMs: A game-changer for software engineers?
by: Md. Asraful Haque
Published: (2025-03-01)

An automated data collection process for constructing graph data relying on LLMs
by: Ngoc Ton Ho, et al.
Published: (2024-10-01)

An automated data collection process for constructing graph data relying on LLMs
by: Ngoc Ton Ho, et al.
Published: (2024-10-01)

Heating, Ventilation, and Air Conditioning (HVAC) Temperature and Humidity Control Optimization Based on Large Language Models (LLMs)
by: Xuanrong Zhu, et al.
Published: (2025-04-01)

Prompt Engineering for evaluators: optimizing LLMs to judge linguistic proficiency
by: Lorenzo Gregori
Published: (2025-07-01)

eidos: A modular approach to external function integration in LLMs
by: José F. Aldana-Martín, et al.
Published: (2025-09-01)

When LLMs meet cybersecurity: a systematic literature review
by: Jie Zhang, et al.
Published: (2025-02-01)

Advancements and limitations of LLMs in replicating human color-word associations
by: Makoto Fukushima, et al.
Published: (2025-05-01)

Efficient approximate Bayesian inference for quantifying uncertainty in multiscale animal movement models
by: Majaliwa M. Masolele, et al.
Published: (2024-12-01)

Generating learning guides for medical education with LLMs and statistical analysis of test results
by: Iván Roselló Atanet, et al.
Published: (2025-03-01)

Assessment of Large Language Models (LLMs) in decision-making support for gynecologic oncology
by: Khanisyah Erza Gumilar, et al.
Published: (2024-12-01)

A Student-Centric Evaluation Survey to Explore the Impact of LLMs on UML Modeling
by: Bilal Al-Ahmad, et al.
Published: (2025-07-01)

Upcoming: Assessing the Potential Challenges of Paid LLMs and Inequities in Language Classrooms
by: Aditi Jhaveri
Published: (2025-07-01)

Enhancing human-centered dynamic scene understanding via multiple LLMs collaborated reasoning
by: Hang Zhang, et al.
Published: (2025-03-01)