Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context
Novel methods of using smartphones to collect speech records of dietary intake facilitate self-reporting of intake in free-living conditions, and may further reduce biases affecting reliable capture such as complexity and reactivity. However, the processing of speech records for dietary assessment i...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10945822/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849738967957635072 |
|---|---|
| author | Connor T. Dodd Marc T. P. Adam Janelle L. Windus Megan E. Rollo |
| author_facet | Connor T. Dodd Marc T. P. Adam Janelle L. Windus Megan E. Rollo |
| author_sort | Connor T. Dodd |
| collection | DOAJ |
| description | Novel methods of using smartphones to collect speech records of dietary intake facilitate self-reporting of intake in free-living conditions, and may further reduce biases affecting reliable capture such as complexity and reactivity. However, the processing of speech records for dietary assessment is a time-consuming task for analysts and thus restricted by research infrastructure. Low- and Low-Middle Income Countries have been disproportionately affected by barriers to dietary assessment (e.g., due to varying literacy or limited research infrastructure). As such, speech records could be a promising avenue to address the research gap in Low and Low-Middle Income regions. While recent advances in speech recognition and natural language processing technology have facilitated automation of this process, no existing studies have evaluated the effectiveness of this technology when applied to an LLMIC context. To this end we adapt the methods identified in a review of the literature to this new context and evaluate their performance on a data set of Khmer speech recordings describing dietary data captured in a free-living context in Cambodia. |
| format | Article |
| id | doaj-art-da09a5d0eb664068b0e68e81aae6edc0 |
| institution | DOAJ |
| issn | 2169-3536 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | IEEE |
| record_format | Article |
| series | IEEE Access |
| spelling | doaj-art-da09a5d0eb664068b0e68e81aae6edc02025-08-20T03:06:24ZengIEEEIEEE Access2169-35362025-01-0113599115992210.1109/ACCESS.2025.355599810945822Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC ContextConnor T. Dodd0https://orcid.org/0000-0002-4141-4387Marc T. P. Adam1https://orcid.org/0000-0002-6036-4282Janelle L. Windus2https://orcid.org/0000-0003-0108-9505Megan E. Rollo3https://orcid.org/0000-0003-1303-2063School of Information and Physical Sciences, The University of Newcastle, Callaghan, NSW, AustraliaSchool of Information and Physical Sciences, The University of Newcastle, Callaghan, NSW, AustraliaSchool of Health Sciences, The University of Newcastle, Callaghan, NSW, AustraliaSchool of Health Sciences, The University of Newcastle, Callaghan, NSW, AustraliaNovel methods of using smartphones to collect speech records of dietary intake facilitate self-reporting of intake in free-living conditions, and may further reduce biases affecting reliable capture such as complexity and reactivity. However, the processing of speech records for dietary assessment is a time-consuming task for analysts and thus restricted by research infrastructure. Low- and Low-Middle Income Countries have been disproportionately affected by barriers to dietary assessment (e.g., due to varying literacy or limited research infrastructure). As such, speech records could be a promising avenue to address the research gap in Low and Low-Middle Income regions. While recent advances in speech recognition and natural language processing technology have facilitated automation of this process, no existing studies have evaluated the effectiveness of this technology when applied to an LLMIC context. To this end we adapt the methods identified in a review of the literature to this new context and evaluate their performance on a data set of Khmer speech recordings describing dietary data captured in a free-living context in Cambodia.https://ieeexplore.ieee.org/document/10945822/Dietary assessmentfood recordingnatural language processingsemi-automationspeech-to-text |
| spellingShingle | Connor T. Dodd Marc T. P. Adam Janelle L. Windus Megan E. Rollo Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context IEEE Access Dietary assessment food recording natural language processing semi-automation speech-to-text |
| title | Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context |
| title_full | Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context |
| title_fullStr | Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context |
| title_full_unstemmed | Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context |
| title_short | Automated Processing of Speech Recordings for Dietary Assessment: Evaluation in the LLMIC Context |
| title_sort | automated processing of speech recordings for dietary assessment evaluation in the llmic context |
| topic | Dietary assessment food recording natural language processing semi-automation speech-to-text |
| url | https://ieeexplore.ieee.org/document/10945822/ |
| work_keys_str_mv | AT connortdodd automatedprocessingofspeechrecordingsfordietaryassessmentevaluationinthellmiccontext AT marctpadam automatedprocessingofspeechrecordingsfordietaryassessmentevaluationinthellmiccontext AT janellelwindus automatedprocessingofspeechrecordingsfordietaryassessmentevaluationinthellmiccontext AT meganerollo automatedprocessingofspeechrecordingsfordietaryassessmentevaluationinthellmiccontext |