MLP-SVM: a hybrid approach for improving the performance of the classification model for health-related documents from social media using multi-layer perceptron and support vector machine
Abstract In this digital age, due to the rise in medical-related data, there is a concentration in data preprocessing, analysis, prediction, and classification of documents written in natural languages. The aim of this paper is to form a Hybrid model (MLP-SVM) using multilayer perceptron and support...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Springer
2025-04-01
|
| Series: | Discover Applied Sciences |
| Subjects: | |
| Online Access: | https://doi.org/10.1007/s42452-025-06851-3 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Abstract In this digital age, due to the rise in medical-related data, there is a concentration in data preprocessing, analysis, prediction, and classification of documents written in natural languages. The aim of this paper is to form a Hybrid model (MLP-SVM) using multilayer perceptron and support vector machine aimed at analyzing and classifying the Afaan Oromo health-related data into predefined labels. Due to its quickness and affordability, automatic classification has gained more interest in this domain. We collected health text data, comments, and posts from the social media in Afaan Oromo. The text preprocessing tasks are applied to data to remove unnecessary texts, punctuation, and numbers. A feature extraction task is applied to obtain a standard Afaan Oromo health dataset using TF-IDF. The demonstration was done using the proposed model, in which the output from the last hidden-layer of the neural network becomes input to the SVM, which finally classifies into various desired classes. Additionally, we employed other methods, such as multi-layer perceptron, logistic regression, SVM, MLP-LR, and the K-NN classifier, for the soundness of the proposed methods in our experiment. Grounded on the comparative study, the proposed model outperformed the other by achieving 98.23% in both accuracy and F1-score. This experimental result presents that the hybrid model is able to enhance the performance of the classifier alone. |
|---|---|
| ISSN: | 3004-9261 |