TextNeX: Text Network of eXperts for Robust Text Classification—Case Study on Machine-Generated-Text Detection

Efficient and accurate text classification is essential for a wide range of natural language processing applications, including sentiment analysis, spam detection and machine-generated text identification. While recent advancements in transformer-based large language models have achieved remarkable...

Full description

Saved in:

Bibliographic Details
Main Authors:	Emmanuel Pintelas, Athanasios Koursaris, Ioannis E. Livieris, Vasilis Tampakas
Format:	Article
Language:	English
Published:	MDPI AG 2025-05-01
Series:	Mathematics
Subjects:	natural language processing text classification lightweight transformer-based models machine-generated text detection
Online Access:	https://www.mdpi.com/2227-7390/13/10/1555
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Efficient and accurate text classification is essential for a wide range of natural language processing applications, including sentiment analysis, spam detection and machine-generated text identification. While recent advancements in transformer-based large language models have achieved remarkable performance, they often come with significant computational costs, limiting their applicability in resource-constrained environments. In this work, we propose TextNeX, a new ensemble model that leverages lightweight language models to achieve state-of-the-art performance while maintaining computational efficiency. The development process of TextNeX model follows a three-phase procedure: (i) <i>Expansion</i>: generation of a pool of diverse lightweight models via randomized model setups and variations of training data; (ii) <i>Selection</i>: application of a clustering-based heterogeneity-driven selection to retain the most complementary models and (iii) <i>Ensemble optimization</i>: optimization of the selected models’ contributions using sequential quadratic programming. Experimental evaluations on three challenging text classification datasets demonstrate that TextNeX outperforms existing state-of-the-art ensemble models in accuracy, robustness and computational effectiveness, offering a practical alternative to large-scale models for real-world applications.
ISSN:	2227-7390

TextNeX: Text Network of eXperts for Robust Text Classification—Case Study on Machine-Generated-Text Detection

Similar Items