Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary Data

During the last decade various algorithms have been developed and proposed for discovering overlapping clusters in high-dimensional data. The two most prominent application fields in this research, proposed independently, are frequent itemset mining (developed for market basket data) and biclusterin...

Full description

Saved in:
Bibliographic Details
Main Authors: András Király, Attila Gyenesei, János Abonyi
Format: Article
Language:English
Published: Wiley 2014-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1155/2014/870406
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832557007183282176
author András Király
Attila Gyenesei
János Abonyi
author_facet András Király
Attila Gyenesei
János Abonyi
author_sort András Király
collection DOAJ
description During the last decade various algorithms have been developed and proposed for discovering overlapping clusters in high-dimensional data. The two most prominent application fields in this research, proposed independently, are frequent itemset mining (developed for market basket data) and biclustering (applied to gene expression data analysis). The common limitation of both methodologies is the limited applicability for very large binary data sets. In this paper we propose a novel and efficient method to find both frequent closed itemsets and biclusters in high-dimensional binary data. The method is based on simple but very powerful matrix and vector multiplication approaches that ensure that all patterns can be discovered in a fast manner. The proposed algorithm has been implemented in the commonly used MATLAB environment and freely available for researchers.
format Article
id doaj-art-22452c17e5c14009acd3b2744f9f55ae
institution Kabale University
issn 2356-6140
1537-744X
language English
publishDate 2014-01-01
publisher Wiley
record_format Article
series The Scientific World Journal
spelling doaj-art-22452c17e5c14009acd3b2744f9f55ae2025-02-03T05:43:49ZengWileyThe Scientific World Journal2356-61401537-744X2014-01-01201410.1155/2014/870406870406Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary DataAndrás Király0Attila Gyenesei1János Abonyi2Department of Process Engineering, University of Pannonia, Veszprém 8200, HungaryBioinformatics & Scientific Computing Core, Campus Science Support Facilities, Vienna Biocenter, 1030 Vienna, AustriaDepartment of Process Engineering, University of Pannonia, Veszprém 8200, HungaryDuring the last decade various algorithms have been developed and proposed for discovering overlapping clusters in high-dimensional data. The two most prominent application fields in this research, proposed independently, are frequent itemset mining (developed for market basket data) and biclustering (applied to gene expression data analysis). The common limitation of both methodologies is the limited applicability for very large binary data sets. In this paper we propose a novel and efficient method to find both frequent closed itemsets and biclusters in high-dimensional binary data. The method is based on simple but very powerful matrix and vector multiplication approaches that ensure that all patterns can be discovered in a fast manner. The proposed algorithm has been implemented in the commonly used MATLAB environment and freely available for researchers.http://dx.doi.org/10.1155/2014/870406
spellingShingle András Király
Attila Gyenesei
János Abonyi
Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary Data
The Scientific World Journal
title Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary Data
title_full Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary Data
title_fullStr Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary Data
title_full_unstemmed Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary Data
title_short Bit-Table Based Biclustering and Frequent Closed Itemset Mining in High-Dimensional Binary Data
title_sort bit table based biclustering and frequent closed itemset mining in high dimensional binary data
url http://dx.doi.org/10.1155/2014/870406
work_keys_str_mv AT andraskiraly bittablebasedbiclusteringandfrequentcloseditemsetmininginhighdimensionalbinarydata
AT attilagyenesei bittablebasedbiclusteringandfrequentcloseditemsetmininginhighdimensionalbinarydata
AT janosabonyi bittablebasedbiclusteringandfrequentcloseditemsetmininginhighdimensionalbinarydata