Enhancing telemedicine service quality through sentiment analysis of user review dataset in IndonesiaMendeley Data

Sentiment analysis, a field within natural language processing, text mining, and computational linguistics, evaluates user opinions and product ratings. This article describes a dataset of user reviews collected from telemedicine applications in Indonesia to understand sentiments related to service...

Full description

Saved in:
Bibliographic Details
Main Authors: Edi Sutoyo, Muhammad Cekas Permana
Format: Article
Language:English
Published: Elsevier 2025-08-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S235234092500602X
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Sentiment analysis, a field within natural language processing, text mining, and computational linguistics, evaluates user opinions and product ratings. This article describes a dataset of user reviews collected from telemedicine applications in Indonesia to understand sentiments related to service quality. The dataset comprises 255,679 textual reviews containing positive and negative feedback, offering valuable input for analyzing user experiences. Reviews were sourced from publicly available platforms, ensuring diversity in user perspectives.The dataset exhibits significant class imbalance, with negative reviews constituting a small proportion compared to positive reviews (ratio exceeding 1:14). To address this imbalance, advanced resampling techniques, including Easy Data Augmentation (EDA), were applied. The dataset underwent rigorous preprocessing to remove noise, standardize content, and tokenize reviews for compatibility with deep learning models.This dataset has been utilized with architectures such as SRNN, 1D-CNN, 1L-LSTM, and BiLSTM for sentiment classification. Generated word clouds highlight frequently mentioned terms, enabling exploratory analysis. The dataset is publicly available, providing a resource for benchmarking sentiment classification algorithms and studying the impact of imbalanced data handling on model performance. This work contributes to enhancing telemedicine service quality and advancing Indonesian natural language processing research.
ISSN:2352-3409