Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm

In order to solve the problems of strong subjectivity and low targeting in sampling decision-making that exist in food safety surveillance sampling, this study proposed a correlation analysis method based on an improved Frequent Pattern-growth (FP-growth) algorithm for food risk factors. First, the...

Full description

Saved in:
Bibliographic Details
Main Author: YU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang
Format: Article
Language:English
Published: China Food Publishing Company 2024-12-01
Series:Shipin Kexue
Subjects:
Online Access:https://www.spkx.net.cn/fileup/1002-6630/PDF/2024-45-23-028.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832539832132304896
author YU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang
author_facet YU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang
author_sort YU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang
collection DOAJ
description In order to solve the problems of strong subjectivity and low targeting in sampling decision-making that exist in food safety surveillance sampling, this study proposed a correlation analysis method based on an improved Frequent Pattern-growth (FP-growth) algorithm for food risk factors. First, the entropy weight method was used to assign weights to the risk indicators of food categories so as to calculate the risk indices of different food categories. Second, the risk index was used as a feature for risk clustering based on MiniBatchKmeans to obtain the risk level of food products. Finally, an improved FP-growth algorithm with constraints was used for association rule mining of food risk factors to excavate the association relationship between the risk level of food products and the information of food types, time, and geographic attributes, and the mined results were analyzed by correlation analysis so as to provide guidance for precise targeting to guide the decision making of sampling inspection. This study was based on food sampling data from certain regions of China in 2019, which were assigned with indicators to calculate the risk index. Afterwards, the risk was clustered into low (L), medium (M), and high risk (H). Finally, the data was imported into the improved FP-growth algorithm to obtain the association rules of food risk factors. For 17 214 pieces of sampling data, the improved FP-growth algorithm had a shorter running time when compared with the Apriori algorithm. Compared with the traditional one, the improved FP-growth algorithm removed invalid rules and improved the analysis efficiency of the association rules of food risk factors. Thus, it provides an accurate and efficient decision-making basis for the sampling work of food regulatory authorities.
format Article
id doaj-art-8b41754cdec344db88c1b70dd14b33ef
institution Kabale University
issn 1002-6630
language English
publishDate 2024-12-01
publisher China Food Publishing Company
record_format Article
series Shipin Kexue
spelling doaj-art-8b41754cdec344db88c1b70dd14b33ef2025-02-05T09:07:53ZengChina Food Publishing CompanyShipin Kexue1002-66302024-12-01452325025810.7506/spkx1002-6630-20240206-051Association Analysis of Food Risk Factors Based on Improved FP-growth AlgorithmYU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang0(1. School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing 100048, China;2. Key Laboratory of Industrial Internet and Big Data, China National Light Industry, Beijing Technology and Business University, Beijing 100048, China; 3. School of Arts and Sciences, Beijing Institute of Fashion Technology, Beijing 100029, China)In order to solve the problems of strong subjectivity and low targeting in sampling decision-making that exist in food safety surveillance sampling, this study proposed a correlation analysis method based on an improved Frequent Pattern-growth (FP-growth) algorithm for food risk factors. First, the entropy weight method was used to assign weights to the risk indicators of food categories so as to calculate the risk indices of different food categories. Second, the risk index was used as a feature for risk clustering based on MiniBatchKmeans to obtain the risk level of food products. Finally, an improved FP-growth algorithm with constraints was used for association rule mining of food risk factors to excavate the association relationship between the risk level of food products and the information of food types, time, and geographic attributes, and the mined results were analyzed by correlation analysis so as to provide guidance for precise targeting to guide the decision making of sampling inspection. This study was based on food sampling data from certain regions of China in 2019, which were assigned with indicators to calculate the risk index. Afterwards, the risk was clustered into low (L), medium (M), and high risk (H). Finally, the data was imported into the improved FP-growth algorithm to obtain the association rules of food risk factors. For 17 214 pieces of sampling data, the improved FP-growth algorithm had a shorter running time when compared with the Apriori algorithm. Compared with the traditional one, the improved FP-growth algorithm removed invalid rules and improved the analysis efficiency of the association rules of food risk factors. Thus, it provides an accurate and efficient decision-making basis for the sampling work of food regulatory authorities.https://www.spkx.net.cn/fileup/1002-6630/PDF/2024-45-23-028.pdffood safety surveillance sampling; association analysis; entropy weight method; minibatchkmeans clustering; frequent pattern-growth algorithm
spellingShingle YU Jiabin, MA Xinyue, ZHAO Zhiyao, WANG Xiaoyi, ZHANG Xin, CUI Xiaoyu, BAI Yuting, CHEN Shuaixiang
Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm
Shipin Kexue
food safety surveillance sampling; association analysis; entropy weight method; minibatchkmeans clustering; frequent pattern-growth algorithm
title Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm
title_full Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm
title_fullStr Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm
title_full_unstemmed Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm
title_short Association Analysis of Food Risk Factors Based on Improved FP-growth Algorithm
title_sort association analysis of food risk factors based on improved fp growth algorithm
topic food safety surveillance sampling; association analysis; entropy weight method; minibatchkmeans clustering; frequent pattern-growth algorithm
url https://www.spkx.net.cn/fileup/1002-6630/PDF/2024-45-23-028.pdf
work_keys_str_mv AT yujiabinmaxinyuezhaozhiyaowangxiaoyizhangxincuixiaoyubaiyutingchenshuaixiang associationanalysisoffoodriskfactorsbasedonimprovedfpgrowthalgorithm