Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context

Novel methods of using smartphones to collect speech records of dietary intake facilitate self-reporting of intake in free-living conditions, and may further reduce biases affecting reliable capture such as complexity and reactivity. However, the processing of speech records for dietary assessment i...

Full description

Saved in:
Bibliographic Details
Main Authors: Connor T. Dodd, Marc T. P. Adam, Janelle L. Windus, Megan E. Rollo
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10945822/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849738967957635072
author Connor T. Dodd
Marc T. P. Adam
Janelle L. Windus
Megan E. Rollo
author_facet Connor T. Dodd
Marc T. P. Adam
Janelle L. Windus
Megan E. Rollo
author_sort Connor T. Dodd
collection DOAJ
description Novel methods of using smartphones to collect speech records of dietary intake facilitate self-reporting of intake in free-living conditions, and may further reduce biases affecting reliable capture such as complexity and reactivity. However, the processing of speech records for dietary assessment is a time-consuming task for analysts and thus restricted by research infrastructure. Low- and Low-Middle Income Countries have been disproportionately affected by barriers to dietary assessment (e.g., due to varying literacy or limited research infrastructure). As such, speech records could be a promising avenue to address the research gap in Low and Low-Middle Income regions. While recent advances in speech recognition and natural language processing technology have facilitated automation of this process, no existing studies have evaluated the effectiveness of this technology when applied to an LLMIC context. To this end we adapt the methods identified in a review of the literature to this new context and evaluate their performance on a data set of Khmer speech recordings describing dietary data captured in a free-living context in Cambodia.
format Article
id doaj-art-da09a5d0eb664068b0e68e81aae6edc0
institution DOAJ
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-da09a5d0eb664068b0e68e81aae6edc02025-08-20T03:06:24ZengIEEEIEEE Access2169-35362025-01-0113599115992210.1109/ACCESS.2025.355599810945822Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC ContextConnor T. Dodd0https://orcid.org/0000-0002-4141-4387Marc T. P. Adam1https://orcid.org/0000-0002-6036-4282Janelle L. Windus2https://orcid.org/0000-0003-0108-9505Megan E. Rollo3https://orcid.org/0000-0003-1303-2063School of Information and Physical Sciences, The University of Newcastle, Callaghan, NSW, AustraliaSchool of Information and Physical Sciences, The University of Newcastle, Callaghan, NSW, AustraliaSchool of Health Sciences, The University of Newcastle, Callaghan, NSW, AustraliaSchool of Health Sciences, The University of Newcastle, Callaghan, NSW, AustraliaNovel methods of using smartphones to collect speech records of dietary intake facilitate self-reporting of intake in free-living conditions, and may further reduce biases affecting reliable capture such as complexity and reactivity. However, the processing of speech records for dietary assessment is a time-consuming task for analysts and thus restricted by research infrastructure. Low- and Low-Middle Income Countries have been disproportionately affected by barriers to dietary assessment (e.g., due to varying literacy or limited research infrastructure). As such, speech records could be a promising avenue to address the research gap in Low and Low-Middle Income regions. While recent advances in speech recognition and natural language processing technology have facilitated automation of this process, no existing studies have evaluated the effectiveness of this technology when applied to an LLMIC context. To this end we adapt the methods identified in a review of the literature to this new context and evaluate their performance on a data set of Khmer speech recordings describing dietary data captured in a free-living context in Cambodia.https://ieeexplore.ieee.org/document/10945822/Dietary assessmentfood recordingnatural language processingsemi-automationspeech-to-text
spellingShingle Connor T. Dodd
Marc T. P. Adam
Janelle L. Windus
Megan E. Rollo
Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context
IEEE Access
Dietary assessment
food recording
natural language processing
semi-automation
speech-to-text
title Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context
title_full Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context
title_fullStr Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context
title_full_unstemmed Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context
title_short Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context
title_sort automated processing of speech recordings for dietary assessment evaluation in the llmic context
topic Dietary assessment
food recording
natural language processing
semi-automation
speech-to-text
url https://ieeexplore.ieee.org/document/10945822/
work_keys_str_mv AT connortdodd automatedprocessingofspeechrecordingsfordietaryassessmentevaluationinthellmiccontext
AT marctpadam automatedprocessingofspeechrecordingsfordietaryassessmentevaluationinthellmiccontext
AT janellelwindus automatedprocessingofspeechrecordingsfordietaryassessmentevaluationinthellmiccontext
AT meganerollo automatedprocessingofspeechrecordingsfordietaryassessmentevaluationinthellmiccontext