Cyberbullying-Related Automated Hate Speech Detection on Social Media Platforms Using Stack Ensemble Classification Method

Abstract Hate speech (HS) has grown because of increasing social media platform usage, which includes Twitter, YouTube, and Facebook. The frequent attempts to implement automated detection systems remain unsuccessful at separating hate speech from objectionable language, because user-generated conte...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad Mubeen, Aliza Muskan, Arslan Akram, Javed Rashid, Tagrid Abdullah N. Alshalali, Nadeem Sarwar
Format: Article
Language:English
Published: Springer 2025-07-01
Series:International Journal of Computational Intelligence Systems
Subjects:
Online Access:https://doi.org/10.1007/s44196-025-00919-z
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Hate speech (HS) has grown because of increasing social media platform usage, which includes Twitter, YouTube, and Facebook. The frequent attempts to implement automated detection systems remain unsuccessful at separating hate speech from objectionable language, because user-generated content tends toward informal, brief, and diverse expressions. The determination of hate speech within texts proves exceptionally hard, since precise context detection is needed to distinguish abusive language from neutral statements. Precision in hate speech identification and filtering stands essential, because these online content forms have negative impacts on both minority and majority groups while heightening their conflicts. The research presents a stacked ensemble classification system that classifies tweets into three groups: hate speech, abusive language, or neutral. The framework uses term frequency–inverse document frequency (TF–IDF) extracted from tweet texts for which support vector machine (SVM), together with Random Forest, XGBoost, and Logistic Regression, base machine learning models function as classifiers. The final model outcome results from linking several base learning models into an ensemble configuration. The Kaggle Hate Speech data set served as training material for the system, because it contained 24,784 tweets along with eight attributes. The model performance received improvement through exclusion of manually derived features. The proposed ensemble model demonstrated superior performance with 96% accuracy, while each single classifier had lower accuracy rates (SVM: 93%, Random Forest: 94%, and XGBoost: 88%). The research outcomes show stacking represents an effective method to enhance systems for detecting hate speech operating on social media platforms.
ISSN:1875-6883