Transfer learning driven fake news detection and classification using large language models

Abstract Today, the problem of using social media to spread false information is not only widespread but also quite serious. The extensive dissemination of fake news, regardless of whether it is produced by human beings or computer programs, has a negative impact not only on society but also on indi...

Full description

Saved in:

Bibliographic Details
Main Authors:	Basma S. Alqadi, Suliman A. Alsuhibany, Samia Nawaz Yousafzai, Sharf Alzu’bi, Deema Mohammed Alsekait, Diaa Salama AbdElminaam
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-08-01
Series:	Scientific Reports
Subjects:	Transfer learning Large language models Fake news detection Deep learning RoBERTa Word embedding
Online Access:	https://doi.org/10.1038/s41598-025-10670-2
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849344347817902080
author	Basma S. Alqadi Suliman A. Alsuhibany Samia Nawaz Yousafzai Sharf Alzu’bi Deema Mohammed Alsekait Diaa Salama AbdElminaam
author_facet	Basma S. Alqadi Suliman A. Alsuhibany Samia Nawaz Yousafzai Sharf Alzu’bi Deema Mohammed Alsekait Diaa Salama AbdElminaam
author_sort	Basma S. Alqadi
collection	DOAJ
description	Abstract Today, the problem of using social media to spread false information is not only widespread but also quite serious. The extensive dissemination of fake news, regardless of whether it is produced by human beings or computer programs, has a negative impact not only on society but also on individuals in terms of politics and society. Currently of social networks, the quick dissemination of news provides a challenge when it comes to establishing the reliability of the information in a satisfactory manner. Because of this, the requirement for automated technologies that can identify fake news has become of the utmost importance. Existing fake news detection methods often suffer from challenges such as limited labeled data, inability to fully capture complex linguistic nuances, and inadequate integration of different embedding techniques, which restrict their effectiveness and generalizability. In this work, we propose a novel multi-stage transfer learning framework that leverages the strengths of pre-trained large language models, particularly RoBERTa, tailored specifically for fake news detection in limited data scenarios. Unlike prior studies which primarily rely on standard fine-tuning, our approach introduces a systematic comparison of word embedding techniques such as Word2Vec and one-hot encoding, combined with a refined fine-tuning process to enhance model performance and interpretability. The experimental results on two real-world benchmark datasets demonstrate that our method achieves a significant accuracy improvement of at least 3.9% over existing state-of-the-art models, while also providing insights into the role of embedding techniques in fake news classification. To address these limitations, our approach fills the gap by combining multi-stage transfer learning with embedding comparisons and task-specific optimizations, enabling more robust and accurate detection on small datasets. Based on the findings of our experiments conducted on two datasets derived from the real world, we have determined that the transfer learning-based strategy that we have developed can outperform the most advanced approaches by a minimum of 3.9% in terms of accuracy and offering a rational explanation.
format	Article
id	doaj-art-72ba2084e24d45eab7fc608dc6593f6e
institution	Kabale University
issn	2045-2322
language	English
publishDate	2025-08-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj-art-72ba2084e24d45eab7fc608dc6593f6e2025-08-20T03:42:41ZengNature PortfolioScientific Reports2045-23222025-08-0115111710.1038/s41598-025-10670-2Transfer learning driven fake news detection and classification using large language modelsBasma S. Alqadi0Suliman A. Alsuhibany1Samia Nawaz Yousafzai2Sharf Alzu’bi3Deema Mohammed Alsekait4Diaa Salama AbdElminaam5Computer Science Department,College of Computer and Information Science, Imam Mohammad Ibn Saud Islamic UniversityDepartment of Computer Science,College of Computer, Qassim UniversityDepartment of Computer Science, HITEC University TaxilaDepartment of Information Technology,College of Engineering and Technology, Royal University for WomenDepartment of Information Technology,College of Computer and Information Sciences, Princess Nourah bint Abdulrahman UniversityFaculty of Computers and Artificial Intelligence, Benha UniversityAbstract Today, the problem of using social media to spread false information is not only widespread but also quite serious. The extensive dissemination of fake news, regardless of whether it is produced by human beings or computer programs, has a negative impact not only on society but also on individuals in terms of politics and society. Currently of social networks, the quick dissemination of news provides a challenge when it comes to establishing the reliability of the information in a satisfactory manner. Because of this, the requirement for automated technologies that can identify fake news has become of the utmost importance. Existing fake news detection methods often suffer from challenges such as limited labeled data, inability to fully capture complex linguistic nuances, and inadequate integration of different embedding techniques, which restrict their effectiveness and generalizability. In this work, we propose a novel multi-stage transfer learning framework that leverages the strengths of pre-trained large language models, particularly RoBERTa, tailored specifically for fake news detection in limited data scenarios. Unlike prior studies which primarily rely on standard fine-tuning, our approach introduces a systematic comparison of word embedding techniques such as Word2Vec and one-hot encoding, combined with a refined fine-tuning process to enhance model performance and interpretability. The experimental results on two real-world benchmark datasets demonstrate that our method achieves a significant accuracy improvement of at least 3.9% over existing state-of-the-art models, while also providing insights into the role of embedding techniques in fake news classification. To address these limitations, our approach fills the gap by combining multi-stage transfer learning with embedding comparisons and task-specific optimizations, enabling more robust and accurate detection on small datasets. Based on the findings of our experiments conducted on two datasets derived from the real world, we have determined that the transfer learning-based strategy that we have developed can outperform the most advanced approaches by a minimum of 3.9% in terms of accuracy and offering a rational explanation.https://doi.org/10.1038/s41598-025-10670-2Transfer learningLarge language modelsFake news detectionDeep learningRoBERTaWord embedding
spellingShingle	Basma S. Alqadi Suliman A. Alsuhibany Samia Nawaz Yousafzai Sharf Alzu’bi Deema Mohammed Alsekait Diaa Salama AbdElminaam Transfer learning driven fake news detection and classification using large language models Scientific Reports Transfer learning Large language models Fake news detection Deep learning RoBERTa Word embedding
title	Transfer learning driven fake news detection and classification using large language models
title_full	Transfer learning driven fake news detection and classification using large language models
title_fullStr	Transfer learning driven fake news detection and classification using large language models
title_full_unstemmed	Transfer learning driven fake news detection and classification using large language models
title_short	Transfer learning driven fake news detection and classification using large language models
title_sort	transfer learning driven fake news detection and classification using large language models
topic	Transfer learning Large language models Fake news detection Deep learning RoBERTa Word embedding
url	https://doi.org/10.1038/s41598-025-10670-2
work_keys_str_mv	AT basmasalqadi transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels AT sulimanaalsuhibany transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels AT samianawazyousafzai transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels AT sharfalzubi transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels AT deemamohammedalsekait transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels AT diaasalamaabdelminaam transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels

Transfer learning driven fake news detection and classification using large language models

Similar Items