A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM
Frequent itemset mining (FIM) and high utility itemset mining (HUIM) are popular data mining techniques used in various real-world applications such as retail-market, bio-medicine, and click-stream analysis. However, these techniques have certain limitations. Support, defined as the frequency of an...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2024-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10810425/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1846099991041409024 |
---|---|
author | Muhammad Waheed Ashraf M. Asif Naeem Heejeong Jasmine Lee |
author_facet | Muhammad Waheed Ashraf M. Asif Naeem Heejeong Jasmine Lee |
author_sort | Muhammad Waheed Ashraf |
collection | DOAJ |
description | Frequent itemset mining (FIM) and high utility itemset mining (HUIM) are popular data mining techniques used in various real-world applications such as retail-market, bio-medicine, and click-stream analysis. However, these techniques have certain limitations. Support, defined as the frequency of an itemset in the database, is ignored in HUIM, leading to the omission of frequently occurring itemsets. Similarly, utility measure, which quantifies the importance or profit of an itemset, is overlooked in FIM, resulting in the inability to identify high utility itemsets. Additionally, current approaches often generate an extensive set of itemsets, resulting in redundancy and increased computational and memory demands. To address these challenges, this paper presents the Closed Frequent and High Utility Itemset Miner (Closed-FHUIM) algorithm, which concurrently performs both frequent and high utility itemset mining and produces a concise list of itemsets, reducing redundancy and optimizing efficiency. In Closed-FHUIM, we introduce a novel pruning technique that balances utility and support, and we adjust the sub-tree utility concept by incorporating the support measure. These techniques minimize computational resource use while ensuring that the itemsets meet both frequency and utility requirements. We evaluate our proposed approach on different sparse, dense, and very large datasets. Experimental results show that our algorithm outperforms existing closure-based state-of-the-art algorithms by up to two orders of magnitude while consuming significantly less memory. |
format | Article |
id | doaj-art-e60375d8ef8d45c88f2e991fc5e4e4ad |
institution | Kabale University |
issn | 2169-3536 |
language | English |
publishDate | 2024-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj-art-e60375d8ef8d45c88f2e991fc5e4e4ad2024-12-31T00:00:30ZengIEEEIEEE Access2169-35362024-01-011219651719653210.1109/ACCESS.2024.352061910810425A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIMMuhammad Waheed Ashraf0https://orcid.org/0000-0002-4762-7842M. Asif Naeem1https://orcid.org/0000-0001-6785-7875Heejeong Jasmine Lee2https://orcid.org/0000-0001-5153-756XDepartment of AI&Data Science, National University of Computer and Emerging Sciences, Islamabad, PakistanDepartment of AI&Data Science, National University of Computer and Emerging Sciences, Islamabad, PakistanCollege of Information and Communication Engineering, Sungkyunkwan University, Suwon, South KoreaFrequent itemset mining (FIM) and high utility itemset mining (HUIM) are popular data mining techniques used in various real-world applications such as retail-market, bio-medicine, and click-stream analysis. However, these techniques have certain limitations. Support, defined as the frequency of an itemset in the database, is ignored in HUIM, leading to the omission of frequently occurring itemsets. Similarly, utility measure, which quantifies the importance or profit of an itemset, is overlooked in FIM, resulting in the inability to identify high utility itemsets. Additionally, current approaches often generate an extensive set of itemsets, resulting in redundancy and increased computational and memory demands. To address these challenges, this paper presents the Closed Frequent and High Utility Itemset Miner (Closed-FHUIM) algorithm, which concurrently performs both frequent and high utility itemset mining and produces a concise list of itemsets, reducing redundancy and optimizing efficiency. In Closed-FHUIM, we introduce a novel pruning technique that balances utility and support, and we adjust the sub-tree utility concept by incorporating the support measure. These techniques minimize computational resource use while ensuring that the itemsets meet both frequency and utility requirements. We evaluate our proposed approach on different sparse, dense, and very large datasets. Experimental results show that our algorithm outperforms existing closure-based state-of-the-art algorithms by up to two orders of magnitude while consuming significantly less memory.https://ieeexplore.ieee.org/document/10810425/Closed itemsetsdata miningfrequent itemsetshigh utility itemsetsitemset mining |
spellingShingle | Muhammad Waheed Ashraf M. Asif Naeem Heejeong Jasmine Lee A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM IEEE Access Closed itemsets data mining frequent itemsets high utility itemsets itemset mining |
title | A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM |
title_full | A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM |
title_fullStr | A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM |
title_full_unstemmed | A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM |
title_short | A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM |
title_sort | robust technique for closed frequent and high utility itemsets mining closed fhuim |
topic | Closed itemsets data mining frequent itemsets high utility itemsets itemset mining |
url | https://ieeexplore.ieee.org/document/10810425/ |
work_keys_str_mv | AT muhammadwaheedashraf arobusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim AT masifnaeem arobusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim AT heejeongjasminelee arobusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim AT muhammadwaheedashraf robusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim AT masifnaeem robusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim AT heejeongjasminelee robusttechniqueforclosedfrequentandhighutilityitemsetsminingclosedfhuim |