Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm
In order to solve the problems of strong subjectivity and low targeting in sampling decision-making that exist in food safety surveillance sampling, this study proposed a correlation analysis method based on an improved Frequent Pattern-growth (FP-growth) algorithm for food risk factors. First, the...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
China Food Publishing Company
2024-12-01
|
Series: | Shipin Kexue |
Subjects: | |
Online Access: | https://www.spkx.net.cn/fileup/1002-6630/PDF/2024-45-23-028.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832539832132304896 |
---|---|
author | YU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang |
author_facet | YU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang |
author_sort | YU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang |
collection | DOAJ |
description | In order to solve the problems of strong subjectivity and low targeting in sampling decision-making that exist in food safety surveillance sampling, this study proposed a correlation analysis method based on an improved Frequent Pattern-growth (FP-growth) algorithm for food risk factors. First, the entropy weight method was used to assign weights to the risk indicators of food categories so as to calculate the risk indices of different food categories. Second, the risk index was used as a feature for risk clustering based on MiniBatchKmeans to obtain the risk level of food products. Finally, an improved FP-growth algorithm with constraints was used for association rule mining of food risk factors to excavate the association relationship between the risk level of food products and the information of food types, time, and geographic attributes, and the mined results were analyzed by correlation analysis so as to provide guidance for precise targeting to guide the decision making of sampling inspection. This study was based on food sampling data from certain regions of China in 2019, which were assigned with indicators to calculate the risk index. Afterwards, the risk was clustered into low (L), medium (M), and high risk (H). Finally, the data was imported into the improved FP-growth algorithm to obtain the association rules of food risk factors. For 17 214 pieces of sampling data, the improved FP-growth algorithm had a shorter running time when compared with the Apriori algorithm. Compared with the traditional one, the improved FP-growth algorithm removed invalid rules and improved the analysis efficiency of the association rules of food risk factors. Thus, it provides an accurate and efficient decision-making basis for the sampling work of food regulatory authorities. |
format | Article |
id | doaj-art-8b41754cdec344db88c1b70dd14b33ef |
institution | Kabale University |
issn | 1002-6630 |
language | English |
publishDate | 2024-12-01 |
publisher | China Food Publishing Company |
record_format | Article |
series | Shipin Kexue |
spelling | doaj-art-8b41754cdec344db88c1b70dd14b33ef2025-02-05T09:07:53ZengChina Food Publishing CompanyShipin Kexue1002-66302024-12-01452325025810.7506/spkx1002-6630-20240206-051Association Analysis of Food Risk Factors Based on Improved FP-growth AlgorithmYU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang0(1. School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing 100048, China;2. Key Laboratory of Industrial Internet and Big Data, China National Light Industry, Beijing Technology and Business University, Beijing 100048, China; 3. School of Arts and Sciences, Beijing Institute of Fashion Technology, Beijing 100029, China)In order to solve the problems of strong subjectivity and low targeting in sampling decision-making that exist in food safety surveillance sampling, this study proposed a correlation analysis method based on an improved Frequent Pattern-growth (FP-growth) algorithm for food risk factors. First, the entropy weight method was used to assign weights to the risk indicators of food categories so as to calculate the risk indices of different food categories. Second, the risk index was used as a feature for risk clustering based on MiniBatchKmeans to obtain the risk level of food products. Finally, an improved FP-growth algorithm with constraints was used for association rule mining of food risk factors to excavate the association relationship between the risk level of food products and the information of food types, time, and geographic attributes, and the mined results were analyzed by correlation analysis so as to provide guidance for precise targeting to guide the decision making of sampling inspection. This study was based on food sampling data from certain regions of China in 2019, which were assigned with indicators to calculate the risk index. Afterwards, the risk was clustered into low (L), medium (M), and high risk (H). Finally, the data was imported into the improved FP-growth algorithm to obtain the association rules of food risk factors. For 17 214 pieces of sampling data, the improved FP-growth algorithm had a shorter running time when compared with the Apriori algorithm. Compared with the traditional one, the improved FP-growth algorithm removed invalid rules and improved the analysis efficiency of the association rules of food risk factors. Thus, it provides an accurate and efficient decision-making basis for the sampling work of food regulatory authorities.https://www.spkx.net.cn/fileup/1002-6630/PDF/2024-45-23-028.pdffood safety surveillance sampling; association analysis; entropy weight method; minibatchkmeans clustering; frequent pattern-growth algorithm |
spellingShingle | YU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm Shipin Kexue food safety surveillance sampling; association analysis; entropy weight method; minibatchkmeans clustering; frequent pattern-growth algorithm |
title | Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm |
title_full | Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm |
title_fullStr | Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm |
title_full_unstemmed | Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm |
title_short | Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm |
title_sort | association analysis of food risk factors based on improved fp growth algorithm |
topic | food safety surveillance sampling; association analysis; entropy weight method; minibatchkmeans clustering; frequent pattern-growth algorithm |
url | https://www.spkx.net.cn/fileup/1002-6630/PDF/2024-45-23-028.pdf |
work_keys_str_mv | AT yujiabinmaxinyuezhaozhiyaowangxiaoyizhangxincuixiaoyubaiyutingchenshuaixiang associationanalysisoffoodriskfactorsbasedonimprovedfpgrowthalgorithm |