Reddit comment analysis: sentiment prediction and topic modeling using VADER and BERTopic

This work aims at exploring data analysis techniques applied to the social media platform Reddit, highlighting the execution of an Exploratory Data Analysis (EDA) to identify trends and patterns of interaction among users. For sentiment analysis of the comments, the VADER model ("Valence Aware...

Full description

Saved in:
Bibliographic Details
Main Authors: Denilson de Oliveira Silva, Richard Matheus Avelino da Silva, Patrícia Virgínia de Santana Lima, Jéssica Cristina Pereira Batista, Sílvio Fernando Alves Xavier Júnior
Format: Article
Language:English
Published: Universidade Federal de Pernambuco (UFPE) 2024-12-01
Series:Socioeconomic Analytics
Subjects:
Online Access:https://periodicos.ufpe.br/revistas/index.php/SECAN/article/view/265074
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This work aims at exploring data analysis techniques applied to the social media platform Reddit, highlighting the execution of an Exploratory Data Analysis (EDA) to identify trends and patterns of interaction among users. For sentiment analysis of the comments, the VADER model ("Valence Aware Dictionary and Sentiment Reasoner") is used, and topic modeling is performed with BERTopic ("Bidirectional Encoder Representations from Transformers for Topic Modeling"). The goal is to compare the accuracy and effectiveness of the models in classifying emotions and themes expressed in the comments. The comparison of the models allows identifying which approach yields the most accurate results, which is aligned with the context of discussions on Reddit, providing valuable insights into user behavior and preferences.
ISSN:2965-4661