Leveraging machine learning techniques to analyze nutritional content in processed foods
Abstract The global shift towards plant-based diets, particularly in India, is driven by environmental and ethical considerations. While plant foods are often regarded as more sustainable, concerns persist regarding protein quality, especially after processing. With protein deficiencies being preval...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Springer
2024-12-01
|
| Series: | Discover Food |
| Subjects: | |
| Online Access: | https://doi.org/10.1007/s44187-024-00253-x |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850102151304445952 |
|---|---|
| author | K. A. Muthukumar Soumya Gupta Doli Saikia |
| author_facet | K. A. Muthukumar Soumya Gupta Doli Saikia |
| author_sort | K. A. Muthukumar |
| collection | DOAJ |
| description | Abstract The global shift towards plant-based diets, particularly in India, is driven by environmental and ethical considerations. While plant foods are often regarded as more sustainable, concerns persist regarding protein quality, especially after processing. With protein deficiencies being prevalent among Indians, it is crucial to understand the impact of food processing on nutrient retention. This research integrates machine learning with food science to develop a comprehensive AI framework for forecasting the protein content of various plant-based sources following both traditional and non-conventional processing methods. A robust database was compiled using sources such as Web of Science, Scopus, PubMed, and Google Scholar, covering a wide range of plant-based foods and their protein content before and after processing. After data preprocessing, two primary machine learning algorithms were employed: Support Vector Regression (SVR) and Random Forest (RF), both implemented using Scikit-learn. The SVR model was optimized to identify the best-fitting hyperplane in high-dimensional space, while the RF model utilized GridSearchCV for hyperparameter tuning and performed a “Feature Importance Analysis” to identify key factors influencing the outcomes. Model performance was evaluated using Normalized Mean Squared Error (NMSE) as the evaluation metric. The results indicated that the RF model achieved an NMSE of approximately 0.35, reflecting a moderate level of prediction error relative to data variance. In contrast, the SVR model significantly outperformed the RF model, with an NMSE of approximately 0.03, demonstrating superior accuracy and efficiency in predicting nutrient retention. This study leverages machine learning to bridge a critical gap in understanding nutrient retention in plant-based foods during processing. The findings reveal that the SVR model is particularly effective in predicting nutrient retention, outperforming the RF model. This novel approach holds significant potential to optimize nutrient retention in plant-based food products, offering important implications for public health and food quality. |
| format | Article |
| id | doaj-art-d90ec78769fb4a879cb78b521112e119 |
| institution | DOAJ |
| issn | 2731-4286 |
| language | English |
| publishDate | 2024-12-01 |
| publisher | Springer |
| record_format | Article |
| series | Discover Food |
| spelling | doaj-art-d90ec78769fb4a879cb78b521112e1192025-08-20T02:39:50ZengSpringerDiscover Food2731-42862024-12-014111710.1007/s44187-024-00253-xLeveraging machine learning techniques to analyze nutritional content in processed foodsK. A. Muthukumar0Soumya Gupta1Doli Saikia2School of Computer Science, UPES DehardunSchool of Health Science and Technology, UPES DehardunSchool of Health Science and Technology, UPES DehardunAbstract The global shift towards plant-based diets, particularly in India, is driven by environmental and ethical considerations. While plant foods are often regarded as more sustainable, concerns persist regarding protein quality, especially after processing. With protein deficiencies being prevalent among Indians, it is crucial to understand the impact of food processing on nutrient retention. This research integrates machine learning with food science to develop a comprehensive AI framework for forecasting the protein content of various plant-based sources following both traditional and non-conventional processing methods. A robust database was compiled using sources such as Web of Science, Scopus, PubMed, and Google Scholar, covering a wide range of plant-based foods and their protein content before and after processing. After data preprocessing, two primary machine learning algorithms were employed: Support Vector Regression (SVR) and Random Forest (RF), both implemented using Scikit-learn. The SVR model was optimized to identify the best-fitting hyperplane in high-dimensional space, while the RF model utilized GridSearchCV for hyperparameter tuning and performed a “Feature Importance Analysis” to identify key factors influencing the outcomes. Model performance was evaluated using Normalized Mean Squared Error (NMSE) as the evaluation metric. The results indicated that the RF model achieved an NMSE of approximately 0.35, reflecting a moderate level of prediction error relative to data variance. In contrast, the SVR model significantly outperformed the RF model, with an NMSE of approximately 0.03, demonstrating superior accuracy and efficiency in predicting nutrient retention. This study leverages machine learning to bridge a critical gap in understanding nutrient retention in plant-based foods during processing. The findings reveal that the SVR model is particularly effective in predicting nutrient retention, outperforming the RF model. This novel approach holds significant potential to optimize nutrient retention in plant-based food products, offering important implications for public health and food quality.https://doi.org/10.1007/s44187-024-00253-xPlant proteinTotal proteinProcessing techniquesMachine learningSupport vector regressionRandom forest |
| spellingShingle | K. A. Muthukumar Soumya Gupta Doli Saikia Leveraging machine learning techniques to analyze nutritional content in processed foods Discover Food Plant protein Total protein Processing techniques Machine learning Support vector regression Random forest |
| title | Leveraging machine learning techniques to analyze nutritional content in processed foods |
| title_full | Leveraging machine learning techniques to analyze nutritional content in processed foods |
| title_fullStr | Leveraging machine learning techniques to analyze nutritional content in processed foods |
| title_full_unstemmed | Leveraging machine learning techniques to analyze nutritional content in processed foods |
| title_short | Leveraging machine learning techniques to analyze nutritional content in processed foods |
| title_sort | leveraging machine learning techniques to analyze nutritional content in processed foods |
| topic | Plant protein Total protein Processing techniques Machine learning Support vector regression Random forest |
| url | https://doi.org/10.1007/s44187-024-00253-x |
| work_keys_str_mv | AT kamuthukumar leveragingmachinelearningtechniquestoanalyzenutritionalcontentinprocessedfoods AT soumyagupta leveragingmachinelearningtechniquestoanalyzenutritionalcontentinprocessedfoods AT dolisaikia leveragingmachinelearningtechniquestoanalyzenutritionalcontentinprocessedfoods |