Leveraging machine learning techniques to analyze nutritional content in processed foods

Abstract The global shift towards plant-based diets, particularly in India, is driven by environmental and ethical considerations. While plant foods are often regarded as more sustainable, concerns persist regarding protein quality, especially after processing. With protein deficiencies being preval...

Full description

Saved in:
Bibliographic Details
Main Authors: K. A. Muthukumar, Soumya Gupta, Doli Saikia
Format: Article
Language:English
Published: Springer 2024-12-01
Series:Discover Food
Subjects:
Online Access:https://doi.org/10.1007/s44187-024-00253-x
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850102151304445952
author K. A. Muthukumar
Soumya Gupta
Doli Saikia
author_facet K. A. Muthukumar
Soumya Gupta
Doli Saikia
author_sort K. A. Muthukumar
collection DOAJ
description Abstract The global shift towards plant-based diets, particularly in India, is driven by environmental and ethical considerations. While plant foods are often regarded as more sustainable, concerns persist regarding protein quality, especially after processing. With protein deficiencies being prevalent among Indians, it is crucial to understand the impact of food processing on nutrient retention. This research integrates machine learning with food science to develop a comprehensive AI framework for forecasting the protein content of various plant-based sources following both traditional and non-conventional processing methods. A robust database was compiled using sources such as Web of Science, Scopus, PubMed, and Google Scholar, covering a wide range of plant-based foods and their protein content before and after processing. After data preprocessing, two primary machine learning algorithms were employed: Support Vector Regression (SVR) and Random Forest (RF), both implemented using Scikit-learn. The SVR model was optimized to identify the best-fitting hyperplane in high-dimensional space, while the RF model utilized GridSearchCV for hyperparameter tuning and performed a “Feature Importance Analysis” to identify key factors influencing the outcomes. Model performance was evaluated using Normalized Mean Squared Error (NMSE) as the evaluation metric. The results indicated that the RF model achieved an NMSE of approximately 0.35, reflecting a moderate level of prediction error relative to data variance. In contrast, the SVR model significantly outperformed the RF model, with an NMSE of approximately 0.03, demonstrating superior accuracy and efficiency in predicting nutrient retention. This study leverages machine learning to bridge a critical gap in understanding nutrient retention in plant-based foods during processing. The findings reveal that the SVR model is particularly effective in predicting nutrient retention, outperforming the RF model. This novel approach holds significant potential to optimize nutrient retention in plant-based food products, offering important implications for public health and food quality.
format Article
id doaj-art-d90ec78769fb4a879cb78b521112e119
institution DOAJ
issn 2731-4286
language English
publishDate 2024-12-01
publisher Springer
record_format Article
series Discover Food
spelling doaj-art-d90ec78769fb4a879cb78b521112e1192025-08-20T02:39:50ZengSpringerDiscover Food2731-42862024-12-014111710.1007/s44187-024-00253-xLeveraging machine learning techniques to analyze nutritional content in processed foodsK. A. Muthukumar0Soumya Gupta1Doli Saikia2School of Computer Science, UPES DehardunSchool of Health Science and Technology, UPES DehardunSchool of Health Science and Technology, UPES DehardunAbstract The global shift towards plant-based diets, particularly in India, is driven by environmental and ethical considerations. While plant foods are often regarded as more sustainable, concerns persist regarding protein quality, especially after processing. With protein deficiencies being prevalent among Indians, it is crucial to understand the impact of food processing on nutrient retention. This research integrates machine learning with food science to develop a comprehensive AI framework for forecasting the protein content of various plant-based sources following both traditional and non-conventional processing methods. A robust database was compiled using sources such as Web of Science, Scopus, PubMed, and Google Scholar, covering a wide range of plant-based foods and their protein content before and after processing. After data preprocessing, two primary machine learning algorithms were employed: Support Vector Regression (SVR) and Random Forest (RF), both implemented using Scikit-learn. The SVR model was optimized to identify the best-fitting hyperplane in high-dimensional space, while the RF model utilized GridSearchCV for hyperparameter tuning and performed a “Feature Importance Analysis” to identify key factors influencing the outcomes. Model performance was evaluated using Normalized Mean Squared Error (NMSE) as the evaluation metric. The results indicated that the RF model achieved an NMSE of approximately 0.35, reflecting a moderate level of prediction error relative to data variance. In contrast, the SVR model significantly outperformed the RF model, with an NMSE of approximately 0.03, demonstrating superior accuracy and efficiency in predicting nutrient retention. This study leverages machine learning to bridge a critical gap in understanding nutrient retention in plant-based foods during processing. The findings reveal that the SVR model is particularly effective in predicting nutrient retention, outperforming the RF model. This novel approach holds significant potential to optimize nutrient retention in plant-based food products, offering important implications for public health and food quality.https://doi.org/10.1007/s44187-024-00253-xPlant proteinTotal proteinProcessing techniquesMachine learningSupport vector regressionRandom forest
spellingShingle K. A. Muthukumar
Soumya Gupta
Doli Saikia
Leveraging machine learning techniques to analyze nutritional content in processed foods
Discover Food
Plant protein
Total protein
Processing techniques
Machine learning
Support vector regression
Random forest
title Leveraging machine learning techniques to analyze nutritional content in processed foods
title_full Leveraging machine learning techniques to analyze nutritional content in processed foods
title_fullStr Leveraging machine learning techniques to analyze nutritional content in processed foods
title_full_unstemmed Leveraging machine learning techniques to analyze nutritional content in processed foods
title_short Leveraging machine learning techniques to analyze nutritional content in processed foods
title_sort leveraging machine learning techniques to analyze nutritional content in processed foods
topic Plant protein
Total protein
Processing techniques
Machine learning
Support vector regression
Random forest
url https://doi.org/10.1007/s44187-024-00253-x
work_keys_str_mv AT kamuthukumar leveragingmachinelearningtechniquestoanalyzenutritionalcontentinprocessedfoods
AT soumyagupta leveragingmachinelearningtechniquestoanalyzenutritionalcontentinprocessedfoods
AT dolisaikia leveragingmachinelearningtechniquestoanalyzenutritionalcontentinprocessedfoods