Investigating Reddit Data on Type 2 Diabetes Management During the COVID-19 Pandemic Using Latent Dirichlet Allocation Topic Modeling and Valence Aware Dictionary for Sentiment Reasoning Analysis: Content Analysis

Abstract BackgroundType 2 diabetes (T2D) is a chronic disease that can be partially managed through healthy behaviors. However, the COVID-19 pandemic impacted how people managed T2D due to work and school closures and social isolation. Moreover, individuals with T2D were at in...

Full description

Saved in:
Bibliographic Details
Main Authors: Meghan Nagpal, Niloofar Jalali, Diana Sherifali, Plinio Morita, Joseph A Cafazzo
Format: Article
Language:English
Published: JMIR Publications 2025-02-01
Series:JMIR Formative Research
Online Access:https://formative.jmir.org/2025/1/e51154
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850205035716149248
author Meghan Nagpal
Niloofar Jalali
Diana Sherifali
Plinio Morita
Joseph A Cafazzo
author_facet Meghan Nagpal
Niloofar Jalali
Diana Sherifali
Plinio Morita
Joseph A Cafazzo
author_sort Meghan Nagpal
collection DOAJ
description Abstract BackgroundType 2 diabetes (T2D) is a chronic disease that can be partially managed through healthy behaviors. However, the COVID-19 pandemic impacted how people managed T2D due to work and school closures and social isolation. Moreover, individuals with T2D were at increased risk of complications from COVID-19 and experienced worsened mental health due to stress and anxiety. ObjectiveThis study aims to synthesize emerging themes related to the health behaviors of people living with T2D, and how they were affected during the early stages of the COVID-19 pandemic by examining Reddit forums dedicated to people living with T2D. MethodsData from Reddit forums related to T2D, from January 2018 to early March 2021, were downloaded using the Pushshift API; support vector machines were used to classify whether a post was made in the context of the pandemic. Latent Dirichlet allocation topic modelling was performed to identify topics of discussion across the entire dataset and a subsequent iteration was performed to identify topics specific to the COVID-19 pandemic. Sentiment analysis using the VADER (Valence Aware Dictionary for Sentiment Reasoning) algorithm was performed to assess attitudes towards the pandemic. ResultsFrom all posts, the identified topics of discussion were classified into the following themes: managing lifestyle (sentiment score 0.25, 95% CI 0.25-0.26), managing blood glucose (sentiment score 0.19, 95% CI 0.18-0.19), obtaining diabetes care (sentiment score 0.19, 95% CI 0.18-0.20), and coping and receiving support (sentiment score 0.34, 95% CI 0.33-0.35). Among the COVID-19–specific posts, the topics of discussion were coping with poor mental health (sentiment score 0.04, 95% CI −0.01 to0.11), accessing doctor and medications and controlling blood glucose (sentiment score 0.14, 95% CI 0.09-0.20), changing food habits during the pandemic (sentiment score 0.25, 95% CI 0.20-0.31), impact of stress on blood glucose levels (sentiment score 0.03, 95% CI −0.03 to 0.08), changing status of employment and insurance (sentiment score 0.17, 95% CI 0.13-0.22), and risk of COVID-19 complications (sentiment score 0.09, 95% CI 0.03-0.14). Overall, posts classified as COVID-19–related (0.12, 95% CI 0.01-0.15) were associated with a lower sentiment score than those classified as nonCOVID (0.25, 95% CI 0.24-0.25). This study was limited due to the lack of a method for assessing the demographics of users and verifying whether users had T2D. ConclusionsThemes identified from Reddit data suggested that the COVID-19 pandemic significantly influenced how people with T2D managed their disease, particularly in terms of accessing care and dealing with the complications of the virus. Overall, the early stages of the pandemic negatively impacted the attitudes of people living with T2D. This study demonstrates that social media data can be a qualitative data source for understanding patient perspectives.
format Article
id doaj-art-4c3beacc129f4ac68a99d505e809a45c
institution OA Journals
issn 2561-326X
language English
publishDate 2025-02-01
publisher JMIR Publications
record_format Article
series JMIR Formative Research
spelling doaj-art-4c3beacc129f4ac68a99d505e809a45c2025-08-20T02:11:11ZengJMIR PublicationsJMIR Formative Research2561-326X2025-02-019e51154e5115410.2196/51154Investigating Reddit Data on Type 2 Diabetes Management During the COVID-19 Pandemic Using Latent Dirichlet Allocation Topic Modeling and Valence Aware Dictionary for Sentiment Reasoning Analysis: Content AnalysisMeghan Nagpalhttp://orcid.org/0000-0002-5290-8809Niloofar Jalalihttp://orcid.org/0000-0002-7356-0573Diana Sherifalihttp://orcid.org/0000-0002-4423-3848Plinio Moritahttp://orcid.org/0000-0001-9515-6478Joseph A Cafazzohttp://orcid.org/0000-0002-3114-4440 Abstract BackgroundType 2 diabetes (T2D) is a chronic disease that can be partially managed through healthy behaviors. However, the COVID-19 pandemic impacted how people managed T2D due to work and school closures and social isolation. Moreover, individuals with T2D were at increased risk of complications from COVID-19 and experienced worsened mental health due to stress and anxiety. ObjectiveThis study aims to synthesize emerging themes related to the health behaviors of people living with T2D, and how they were affected during the early stages of the COVID-19 pandemic by examining Reddit forums dedicated to people living with T2D. MethodsData from Reddit forums related to T2D, from January 2018 to early March 2021, were downloaded using the Pushshift API; support vector machines were used to classify whether a post was made in the context of the pandemic. Latent Dirichlet allocation topic modelling was performed to identify topics of discussion across the entire dataset and a subsequent iteration was performed to identify topics specific to the COVID-19 pandemic. Sentiment analysis using the VADER (Valence Aware Dictionary for Sentiment Reasoning) algorithm was performed to assess attitudes towards the pandemic. ResultsFrom all posts, the identified topics of discussion were classified into the following themes: managing lifestyle (sentiment score 0.25, 95% CI 0.25-0.26), managing blood glucose (sentiment score 0.19, 95% CI 0.18-0.19), obtaining diabetes care (sentiment score 0.19, 95% CI 0.18-0.20), and coping and receiving support (sentiment score 0.34, 95% CI 0.33-0.35). Among the COVID-19–specific posts, the topics of discussion were coping with poor mental health (sentiment score 0.04, 95% CI −0.01 to0.11), accessing doctor and medications and controlling blood glucose (sentiment score 0.14, 95% CI 0.09-0.20), changing food habits during the pandemic (sentiment score 0.25, 95% CI 0.20-0.31), impact of stress on blood glucose levels (sentiment score 0.03, 95% CI −0.03 to 0.08), changing status of employment and insurance (sentiment score 0.17, 95% CI 0.13-0.22), and risk of COVID-19 complications (sentiment score 0.09, 95% CI 0.03-0.14). Overall, posts classified as COVID-19–related (0.12, 95% CI 0.01-0.15) were associated with a lower sentiment score than those classified as nonCOVID (0.25, 95% CI 0.24-0.25). This study was limited due to the lack of a method for assessing the demographics of users and verifying whether users had T2D. ConclusionsThemes identified from Reddit data suggested that the COVID-19 pandemic significantly influenced how people with T2D managed their disease, particularly in terms of accessing care and dealing with the complications of the virus. Overall, the early stages of the pandemic negatively impacted the attitudes of people living with T2D. This study demonstrates that social media data can be a qualitative data source for understanding patient perspectives.https://formative.jmir.org/2025/1/e51154
spellingShingle Meghan Nagpal
Niloofar Jalali
Diana Sherifali
Plinio Morita
Joseph A Cafazzo
Investigating Reddit Data on Type 2 Diabetes Management During the COVID-19 Pandemic Using Latent Dirichlet Allocation Topic Modeling and Valence Aware Dictionary for Sentiment Reasoning Analysis: Content Analysis
JMIR Formative Research
title Investigating Reddit Data on Type 2 Diabetes Management During the COVID-19 Pandemic Using Latent Dirichlet Allocation Topic Modeling and Valence Aware Dictionary for Sentiment Reasoning Analysis: Content Analysis
title_full Investigating Reddit Data on Type 2 Diabetes Management During the COVID-19 Pandemic Using Latent Dirichlet Allocation Topic Modeling and Valence Aware Dictionary for Sentiment Reasoning Analysis: Content Analysis
title_fullStr Investigating Reddit Data on Type 2 Diabetes Management During the COVID-19 Pandemic Using Latent Dirichlet Allocation Topic Modeling and Valence Aware Dictionary for Sentiment Reasoning Analysis: Content Analysis
title_full_unstemmed Investigating Reddit Data on Type 2 Diabetes Management During the COVID-19 Pandemic Using Latent Dirichlet Allocation Topic Modeling and Valence Aware Dictionary for Sentiment Reasoning Analysis: Content Analysis
title_short Investigating Reddit Data on Type 2 Diabetes Management During the COVID-19 Pandemic Using Latent Dirichlet Allocation Topic Modeling and Valence Aware Dictionary for Sentiment Reasoning Analysis: Content Analysis
title_sort investigating reddit data on type 2 diabetes management during the covid 19 pandemic using latent dirichlet allocation topic modeling and valence aware dictionary for sentiment reasoning analysis content analysis
url https://formative.jmir.org/2025/1/e51154
work_keys_str_mv AT meghannagpal investigatingredditdataontype2diabetesmanagementduringthecovid19pandemicusinglatentdirichletallocationtopicmodelingandvalenceawaredictionaryforsentimentreasoninganalysiscontentanalysis
AT niloofarjalali investigatingredditdataontype2diabetesmanagementduringthecovid19pandemicusinglatentdirichletallocationtopicmodelingandvalenceawaredictionaryforsentimentreasoninganalysiscontentanalysis
AT dianasherifali investigatingredditdataontype2diabetesmanagementduringthecovid19pandemicusinglatentdirichletallocationtopicmodelingandvalenceawaredictionaryforsentimentreasoninganalysiscontentanalysis
AT pliniomorita investigatingredditdataontype2diabetesmanagementduringthecovid19pandemicusinglatentdirichletallocationtopicmodelingandvalenceawaredictionaryforsentimentreasoninganalysiscontentanalysis
AT josephacafazzo investigatingredditdataontype2diabetesmanagementduringthecovid19pandemicusinglatentdirichletallocationtopicmodelingandvalenceawaredictionaryforsentimentreasoninganalysiscontentanalysis