Transfer learning driven fake news detection and classification using large language models

Abstract Today, the problem of using social media to spread false information is not only widespread but also quite serious. The extensive dissemination of fake news, regardless of whether it is produced by human beings or computer programs, has a negative impact not only on society but also on indi...

Full description

Saved in:
Bibliographic Details
Main Authors: Basma S. Alqadi, Suliman A. Alsuhibany, Samia Nawaz Yousafzai, Sharf Alzu’bi, Deema Mohammed Alsekait, Diaa Salama AbdElminaam
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-10670-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849344347817902080
author Basma S. Alqadi
Suliman A. Alsuhibany
Samia Nawaz Yousafzai
Sharf Alzu’bi
Deema Mohammed Alsekait
Diaa Salama AbdElminaam
author_facet Basma S. Alqadi
Suliman A. Alsuhibany
Samia Nawaz Yousafzai
Sharf Alzu’bi
Deema Mohammed Alsekait
Diaa Salama AbdElminaam
author_sort Basma S. Alqadi
collection DOAJ
description Abstract Today, the problem of using social media to spread false information is not only widespread but also quite serious. The extensive dissemination of fake news, regardless of whether it is produced by human beings or computer programs, has a negative impact not only on society but also on individuals in terms of politics and society. Currently of social networks, the quick dissemination of news provides a challenge when it comes to establishing the reliability of the information in a satisfactory manner. Because of this, the requirement for automated technologies that can identify fake news has become of the utmost importance. Existing fake news detection methods often suffer from challenges such as limited labeled data, inability to fully capture complex linguistic nuances, and inadequate integration of different embedding techniques, which restrict their effectiveness and generalizability. In this work, we propose a novel multi-stage transfer learning framework that leverages the strengths of pre-trained large language models, particularly RoBERTa, tailored specifically for fake news detection in limited data scenarios. Unlike prior studies which primarily rely on standard fine-tuning, our approach introduces a systematic comparison of word embedding techniques such as Word2Vec and one-hot encoding, combined with a refined fine-tuning process to enhance model performance and interpretability. The experimental results on two real-world benchmark datasets demonstrate that our method achieves a significant accuracy improvement of at least 3.9% over existing state-of-the-art models, while also providing insights into the role of embedding techniques in fake news classification. To address these limitations, our approach fills the gap by combining multi-stage transfer learning with embedding comparisons and task-specific optimizations, enabling more robust and accurate detection on small datasets. Based on the findings of our experiments conducted on two datasets derived from the real world, we have determined that the transfer learning-based strategy that we have developed can outperform the most advanced approaches by a minimum of 3.9% in terms of accuracy and offering a rational explanation.
format Article
id doaj-art-72ba2084e24d45eab7fc608dc6593f6e
institution Kabale University
issn 2045-2322
language English
publishDate 2025-08-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-72ba2084e24d45eab7fc608dc6593f6e2025-08-20T03:42:41ZengNature PortfolioScientific Reports2045-23222025-08-0115111710.1038/s41598-025-10670-2Transfer learning driven fake news detection and classification using large language modelsBasma S. Alqadi0Suliman A. Alsuhibany1Samia Nawaz Yousafzai2Sharf Alzu’bi3Deema Mohammed Alsekait4Diaa Salama AbdElminaam5Computer Science Department,College of Computer and Information Science, Imam Mohammad Ibn Saud Islamic UniversityDepartment of Computer Science,College of Computer, Qassim UniversityDepartment of Computer Science, HITEC University TaxilaDepartment of Information Technology,College of Engineering and Technology, Royal University for WomenDepartment of Information Technology,College of Computer and Information Sciences, Princess Nourah bint Abdulrahman UniversityFaculty of Computers and Artificial Intelligence, Benha UniversityAbstract Today, the problem of using social media to spread false information is not only widespread but also quite serious. The extensive dissemination of fake news, regardless of whether it is produced by human beings or computer programs, has a negative impact not only on society but also on individuals in terms of politics and society. Currently of social networks, the quick dissemination of news provides a challenge when it comes to establishing the reliability of the information in a satisfactory manner. Because of this, the requirement for automated technologies that can identify fake news has become of the utmost importance. Existing fake news detection methods often suffer from challenges such as limited labeled data, inability to fully capture complex linguistic nuances, and inadequate integration of different embedding techniques, which restrict their effectiveness and generalizability. In this work, we propose a novel multi-stage transfer learning framework that leverages the strengths of pre-trained large language models, particularly RoBERTa, tailored specifically for fake news detection in limited data scenarios. Unlike prior studies which primarily rely on standard fine-tuning, our approach introduces a systematic comparison of word embedding techniques such as Word2Vec and one-hot encoding, combined with a refined fine-tuning process to enhance model performance and interpretability. The experimental results on two real-world benchmark datasets demonstrate that our method achieves a significant accuracy improvement of at least 3.9% over existing state-of-the-art models, while also providing insights into the role of embedding techniques in fake news classification. To address these limitations, our approach fills the gap by combining multi-stage transfer learning with embedding comparisons and task-specific optimizations, enabling more robust and accurate detection on small datasets. Based on the findings of our experiments conducted on two datasets derived from the real world, we have determined that the transfer learning-based strategy that we have developed can outperform the most advanced approaches by a minimum of 3.9% in terms of accuracy and offering a rational explanation.https://doi.org/10.1038/s41598-025-10670-2Transfer learningLarge language modelsFake news detectionDeep learningRoBERTaWord embedding
spellingShingle Basma S. Alqadi
Suliman A. Alsuhibany
Samia Nawaz Yousafzai
Sharf Alzu’bi
Deema Mohammed Alsekait
Diaa Salama AbdElminaam
Transfer learning driven fake news detection and classification using large language models
Scientific Reports
Transfer learning
Large language models
Fake news detection
Deep learning
RoBERTa
Word embedding
title Transfer learning driven fake news detection and classification using large language models
title_full Transfer learning driven fake news detection and classification using large language models
title_fullStr Transfer learning driven fake news detection and classification using large language models
title_full_unstemmed Transfer learning driven fake news detection and classification using large language models
title_short Transfer learning driven fake news detection and classification using large language models
title_sort transfer learning driven fake news detection and classification using large language models
topic Transfer learning
Large language models
Fake news detection
Deep learning
RoBERTa
Word embedding
url https://doi.org/10.1038/s41598-025-10670-2
work_keys_str_mv AT basmasalqadi transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels
AT sulimanaalsuhibany transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels
AT samianawazyousafzai transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels
AT sharfalzubi transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels
AT deemamohammedalsekait transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels
AT diaasalamaabdelminaam transferlearningdrivenfakenewsdetectionandclassificationusinglargelanguagemodels