Multivariate time series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990–2018)

Abstract Objective: Understanding country-level nutrition intake is crucial to global nutritional policies that aim to reduce disparities and relevant disease burdens. Still, there are limited numbers of studies using clustering techniques to analyse the recent Global Dietary Database (GDD). This...

Full description

Saved in:
Bibliographic Details
Main Authors: Adriano Matousek, Tiffany H Leung, Herbert Pang
Format: Article
Language:English
Published: Cambridge University Press 2025-01-01
Series:Public Health Nutrition
Subjects:
Online Access:https://www.cambridge.org/core/product/identifier/S136898002500059X/type/journal_article
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850274879371214848
author Adriano Matousek
Tiffany H Leung
Herbert Pang
author_facet Adriano Matousek
Tiffany H Leung
Herbert Pang
author_sort Adriano Matousek
collection DOAJ
description Abstract Objective: Understanding country-level nutrition intake is crucial to global nutritional policies that aim to reduce disparities and relevant disease burdens. Still, there are limited numbers of studies using clustering techniques to analyse the recent Global Dietary Database (GDD). This study aims to extend an existing multivariate time series (MTS) clustering algorithm to allow for greater customisability and provide the first cluster analysis of the GDD to explore temporal trends in country-level nutrition profiles (1990–2018). Design: Trends in sugar-sweetened beverage intake and nutritional deficiency were explored using the newly developed programme ‘MTSclust’. Time series clustering algorithms are different from simple clustering approaches in their ability to appreciate temporal elements. Setting: Nutritional and demographical data from 176 countries were analysed from the GDD. Participants: Population representative samples of the 176 in the GDD. Results: In a three-class test specific to the domain, the MTSclust programme achieved a mean accuracy of 71·5 % (adjusted Rand Index [ARI] = 0·381) while the mean accuracy of a popular algorithm, DTWclust, was 58 % (ARI = 0·224). The clustering of nutritional deficiency and sugar-sweetened beverage intake identified several common trends among countries and found that these did not change by demographics. MTS clustering demonstrated a global convergence towards a Western diet. Conclusion: While global nutrition trends are associated with geography, demographic variables such as sex and age are less influential to the trends of certain nutrition intake. The literature could be further supplemented by applying outcome-guided methods to explore how these trends link to disease burdens.
format Article
id doaj-art-e97c348aadde4be8ada755072c9b672c
institution OA Journals
issn 1368-9800
1475-2727
language English
publishDate 2025-01-01
publisher Cambridge University Press
record_format Article
series Public Health Nutrition
spelling doaj-art-e97c348aadde4be8ada755072c9b672c2025-08-20T01:51:00ZengCambridge University PressPublic Health Nutrition1368-98001475-27272025-01-012810.1017/S136898002500059XMultivariate time series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990–2018)Adriano Matousek0Tiffany H Leung1https://orcid.org/0009-0002-1154-8234Herbert Pang2https://orcid.org/0000-0002-7896-6716Department of Public Health and Primary Care, University of Cambridge, Robinson Way, Cambridge, UKDepartment of Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, ChinaPD Data Science & Analytics, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC 27705, USA Abstract Objective: Understanding country-level nutrition intake is crucial to global nutritional policies that aim to reduce disparities and relevant disease burdens. Still, there are limited numbers of studies using clustering techniques to analyse the recent Global Dietary Database (GDD). This study aims to extend an existing multivariate time series (MTS) clustering algorithm to allow for greater customisability and provide the first cluster analysis of the GDD to explore temporal trends in country-level nutrition profiles (1990–2018). Design: Trends in sugar-sweetened beverage intake and nutritional deficiency were explored using the newly developed programme ‘MTSclust’. Time series clustering algorithms are different from simple clustering approaches in their ability to appreciate temporal elements. Setting: Nutritional and demographical data from 176 countries were analysed from the GDD. Participants: Population representative samples of the 176 in the GDD. Results: In a three-class test specific to the domain, the MTSclust programme achieved a mean accuracy of 71·5 % (adjusted Rand Index [ARI] = 0·381) while the mean accuracy of a popular algorithm, DTWclust, was 58 % (ARI = 0·224). The clustering of nutritional deficiency and sugar-sweetened beverage intake identified several common trends among countries and found that these did not change by demographics. MTS clustering demonstrated a global convergence towards a Western diet. Conclusion: While global nutrition trends are associated with geography, demographic variables such as sex and age are less influential to the trends of certain nutrition intake. The literature could be further supplemented by applying outcome-guided methods to explore how these trends link to disease burdens. https://www.cambridge.org/core/product/identifier/S136898002500059X/type/journal_articleClustering algorithmMachine learningTime-series clusteringTrendGlobal Dietary Database
spellingShingle Adriano Matousek
Tiffany H Leung
Herbert Pang
Multivariate time series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990–2018)
Public Health Nutrition
Clustering algorithm
Machine learning
Time-series clustering
Trend
Global Dietary Database
title Multivariate time series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990–2018)
title_full Multivariate time series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990–2018)
title_fullStr Multivariate time series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990–2018)
title_full_unstemmed Multivariate time series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990–2018)
title_short Multivariate time series clustering analysis of the Global Dietary Database to uncover patterns in dietary trends (1990–2018)
title_sort multivariate time series clustering analysis of the global dietary database to uncover patterns in dietary trends 1990 2018
topic Clustering algorithm
Machine learning
Time-series clustering
Trend
Global Dietary Database
url https://www.cambridge.org/core/product/identifier/S136898002500059X/type/journal_article
work_keys_str_mv AT adrianomatousek multivariatetimeseriesclusteringanalysisoftheglobaldietarydatabasetouncoverpatternsindietarytrends19902018
AT tiffanyhleung multivariatetimeseriesclusteringanalysisoftheglobaldietarydatabasetouncoverpatternsindietarytrends19902018
AT herbertpang multivariatetimeseriesclusteringanalysisoftheglobaldietarydatabasetouncoverpatternsindietarytrends19902018