A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM

Frequent itemset mining (FIM) and high utility itemset mining (HUIM) are popular data mining techniques used in various real-world applications such as retail-market, bio-medicine, and click-stream analysis. However, these techniques have certain limitations. Support, defined as the frequency of an...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad Waheed Ashraf, M. Asif Naeem, Heejeong Jasmine Lee
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10810425/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846099991041409024
author Muhammad Waheed Ashraf
M. Asif Naeem
Heejeong Jasmine Lee
author_facet Muhammad Waheed Ashraf
M. Asif Naeem
Heejeong Jasmine Lee
author_sort Muhammad Waheed Ashraf
collection DOAJ
description Frequent itemset mining (FIM) and high utility itemset mining (HUIM) are popular data mining techniques used in various real-world applications such as retail-market, bio-medicine, and click-stream analysis. However, these techniques have certain limitations. Support, defined as the frequency of an itemset in the database, is ignored in HUIM, leading to the omission of frequently occurring itemsets. Similarly, utility measure, which quantifies the importance or profit of an itemset, is overlooked in FIM, resulting in the inability to identify high utility itemsets. Additionally, current approaches often generate an extensive set of itemsets, resulting in redundancy and increased computational and memory demands. To address these challenges, this paper presents the Closed Frequent and High Utility Itemset Miner (Closed-FHUIM) algorithm, which concurrently performs both frequent and high utility itemset mining and produces a concise list of itemsets, reducing redundancy and optimizing efficiency. In Closed-FHUIM, we introduce a novel pruning technique that balances utility and support, and we adjust the sub-tree utility concept by incorporating the support measure. These techniques minimize computational resource use while ensuring that the itemsets meet both frequency and utility requirements. We evaluate our proposed approach on different sparse, dense, and very large datasets. Experimental results show that our algorithm outperforms existing closure-based state-of-the-art algorithms by up to two orders of magnitude while consuming significantly less memory.
format Article
id doaj-art-e60375d8ef8d45c88f2e991fc5e4e4ad
institution Kabale University
issn 2169-3536
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-e60375d8ef8d45c88f2e991fc5e4e4ad2024-12-31T00:00:30ZengIEEEIEEE Access2169-35362024-01-011219651719653210.1109/ACCESS.2024.352061910810425A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIMMuhammad Waheed Ashraf0https://orcid.org/0000-0002-4762-7842M. Asif Naeem1https://orcid.org/0000-0001-6785-7875Heejeong Jasmine Lee2https://orcid.org/0000-0001-5153-756XDepartment of AI&Data Science, National University of Computer and Emerging Sciences, Islamabad, PakistanDepartment of AI&Data Science, National University of Computer and Emerging Sciences, Islamabad, PakistanCollege of Information and Communication Engineering, Sungkyunkwan University, Suwon, South KoreaFrequent itemset mining (FIM) and high utility itemset mining (HUIM) are popular data mining techniques used in various real-world applications such as retail-market, bio-medicine, and click-stream analysis. However, these techniques have certain limitations. Support, defined as the frequency of an itemset in the database, is ignored in HUIM, leading to the omission of frequently occurring itemsets. Similarly, utility measure, which quantifies the importance or profit of an itemset, is overlooked in FIM, resulting in the inability to identify high utility itemsets. Additionally, current approaches often generate an extensive set of itemsets, resulting in redundancy and increased computational and memory demands. To address these challenges, this paper presents the Closed Frequent and High Utility Itemset Miner (Closed-FHUIM) algorithm, which concurrently performs both frequent and high utility itemset mining and produces a concise list of itemsets, reducing redundancy and optimizing efficiency. In Closed-FHUIM, we introduce a novel pruning technique that balances utility and support, and we adjust the sub-tree utility concept by incorporating the support measure. These techniques minimize computational resource use while ensuring that the itemsets meet both frequency and utility requirements. We evaluate our proposed approach on different sparse, dense, and very large datasets. Experimental results show that our algorithm outperforms existing closure-based state-of-the-art algorithms by up to two orders of magnitude while consuming significantly less memory.https://ieeexplore.ieee.org/document/10810425/Closed itemsetsdata miningfrequent itemsetshigh utility itemsetsitemset mining
spellingShingle Muhammad Waheed Ashraf
M. Asif Naeem
Heejeong Jasmine Lee
A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM
IEEE Access
Closed itemsets
data mining
frequent itemsets
high utility itemsets
itemset mining
title A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM
title_full A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM
title_fullStr A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM
title_full_unstemmed A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM
title_short A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM
title_sort robust technique for closed frequent and high utility itemsets mining closed fhuim
topic Closed itemsets
data mining
frequent itemsets
high utility itemsets
itemset mining
url https://ieeexplore.ieee.org/document/10810425/
work_keys_str_mv AT muhammadwaheedashraf arobusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim
AT masifnaeem arobusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim
AT heejeongjasminelee arobusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim
AT muhammadwaheedashraf robusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim
AT masifnaeem robusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim
AT heejeongjasminelee robusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim