A Semisupervised Cascade Classification Algorithm

Classification is one of the most important tasks of data mining techniques, which have been adopted by several modern applications. The shortage of enough labeled data in the majority of these applications has shifted the interest towards using semisupervised methods. Under such schemes, the use of...

Full description

Saved in:
Bibliographic Details
Main Authors: Stamatis Karlos, Nikos Fazakis, Sotiris Kotsiantis, Kyriakos Sgarbas
Format: Article
Language:English
Published: Wiley 2016-01-01
Series:Applied Computational Intelligence and Soft Computing
Online Access:http://dx.doi.org/10.1155/2016/5919717
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832562871151624192
author Stamatis Karlos
Nikos Fazakis
Sotiris Kotsiantis
Kyriakos Sgarbas
author_facet Stamatis Karlos
Nikos Fazakis
Sotiris Kotsiantis
Kyriakos Sgarbas
author_sort Stamatis Karlos
collection DOAJ
description Classification is one of the most important tasks of data mining techniques, which have been adopted by several modern applications. The shortage of enough labeled data in the majority of these applications has shifted the interest towards using semisupervised methods. Under such schemes, the use of collected unlabeled data combined with a clearly smaller set of labeled examples leads to similar or even better classification accuracy against supervised algorithms, which use labeled examples exclusively during the training phase. A novel approach for increasing semisupervised classification using Cascade Classifier technique is presented in this paper. The main characteristic of Cascade Classifier strategy is the use of a base classifier for increasing the feature space by adding either the predicted class or the probability class distribution of the initial data. The classifier of the second level is supplied with the new dataset and extracts the decision for each instance. In this work, a self-trained NB∇C4.5 classifier algorithm is presented, which combines the characteristics of Naive Bayes as a base classifier and the speed of C4.5 for final classification. We performed an in-depth comparison with other well-known semisupervised classification methods on standard benchmark datasets and we finally reached to the point that the presented technique has better accuracy in most cases.
format Article
id doaj-art-5bb745ad138348fe94023a8e538d28b2
institution Kabale University
issn 1687-9724
1687-9732
language English
publishDate 2016-01-01
publisher Wiley
record_format Article
series Applied Computational Intelligence and Soft Computing
spelling doaj-art-5bb745ad138348fe94023a8e538d28b22025-02-03T01:21:30ZengWileyApplied Computational Intelligence and Soft Computing1687-97241687-97322016-01-01201610.1155/2016/59197175919717A Semisupervised Cascade Classification AlgorithmStamatis Karlos0Nikos Fazakis1Sotiris Kotsiantis2Kyriakos Sgarbas3Department of Mathematics, University of Patras, 26504 Rio, GreeceDepartment of Electrical and Computer Engineering, University of Patras, 26504 Rio, GreeceDepartment of Mathematics, University of Patras, 26504 Rio, GreeceDepartment of Electrical and Computer Engineering, University of Patras, 26504 Rio, GreeceClassification is one of the most important tasks of data mining techniques, which have been adopted by several modern applications. The shortage of enough labeled data in the majority of these applications has shifted the interest towards using semisupervised methods. Under such schemes, the use of collected unlabeled data combined with a clearly smaller set of labeled examples leads to similar or even better classification accuracy against supervised algorithms, which use labeled examples exclusively during the training phase. A novel approach for increasing semisupervised classification using Cascade Classifier technique is presented in this paper. The main characteristic of Cascade Classifier strategy is the use of a base classifier for increasing the feature space by adding either the predicted class or the probability class distribution of the initial data. The classifier of the second level is supplied with the new dataset and extracts the decision for each instance. In this work, a self-trained NB∇C4.5 classifier algorithm is presented, which combines the characteristics of Naive Bayes as a base classifier and the speed of C4.5 for final classification. We performed an in-depth comparison with other well-known semisupervised classification methods on standard benchmark datasets and we finally reached to the point that the presented technique has better accuracy in most cases.http://dx.doi.org/10.1155/2016/5919717
spellingShingle Stamatis Karlos
Nikos Fazakis
Sotiris Kotsiantis
Kyriakos Sgarbas
A Semisupervised Cascade Classification Algorithm
Applied Computational Intelligence and Soft Computing
title A Semisupervised Cascade Classification Algorithm
title_full A Semisupervised Cascade Classification Algorithm
title_fullStr A Semisupervised Cascade Classification Algorithm
title_full_unstemmed A Semisupervised Cascade Classification Algorithm
title_short A Semisupervised Cascade Classification Algorithm
title_sort semisupervised cascade classification algorithm
url http://dx.doi.org/10.1155/2016/5919717
work_keys_str_mv AT stamatiskarlos asemisupervisedcascadeclassificationalgorithm
AT nikosfazakis asemisupervisedcascadeclassificationalgorithm
AT sotiriskotsiantis asemisupervisedcascadeclassificationalgorithm
AT kyriakossgarbas asemisupervisedcascadeclassificationalgorithm
AT stamatiskarlos semisupervisedcascadeclassificationalgorithm
AT nikosfazakis semisupervisedcascadeclassificationalgorithm
AT sotiriskotsiantis semisupervisedcascadeclassificationalgorithm
AT kyriakossgarbas semisupervisedcascadeclassificationalgorithm