Classification Performance Analysis of Decision Tree-Based Algorithms with Noisy Class Variable

Class noise is a common issue that affects the performance of classification techniques on real-world data sets. Class noise appears when a class variable in data sets has incorrect class labels. In the case of noisy data, the robustness of classification techniques against noise could be more impor...

Full description

Saved in:
Bibliographic Details
Main Author: Abdulmajeed Atiah Alharbi
Format: Article
Language:English
Published: Wiley 2024-01-01
Series:Discrete Dynamics in Nature and Society
Online Access:http://dx.doi.org/10.1155/2024/6671395
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832559575741497344
author Abdulmajeed Atiah Alharbi
author_facet Abdulmajeed Atiah Alharbi
author_sort Abdulmajeed Atiah Alharbi
collection DOAJ
description Class noise is a common issue that affects the performance of classification techniques on real-world data sets. Class noise appears when a class variable in data sets has incorrect class labels. In the case of noisy data, the robustness of classification techniques against noise could be more important than the performance results on noise-free data sets. The decision tree method is one of the most popular techniques for classification tasks. The C4.5, CART, and random forest (RF) algorithms are considered to be three of the most used algorithms in decision trees. The aim of this paper is to reach conclusions on which decision tree algorithm is better to use for building decision trees in terms of its performance and robustness against class noise. In order to achieve this aim, we study and compare the performance of the models when applied to class variables with noise. The results obtained indicate that the RF algorithm is more robust to data sets with noisy class variable than other algorithms.
format Article
id doaj-art-b094d33c81b149a9a03732f85519635f
institution Kabale University
issn 1607-887X
language English
publishDate 2024-01-01
publisher Wiley
record_format Article
series Discrete Dynamics in Nature and Society
spelling doaj-art-b094d33c81b149a9a03732f85519635f2025-02-03T01:29:48ZengWileyDiscrete Dynamics in Nature and Society1607-887X2024-01-01202410.1155/2024/6671395Classification Performance Analysis of Decision Tree-Based Algorithms with Noisy Class VariableAbdulmajeed Atiah Alharbi0Department of MathematicsClass noise is a common issue that affects the performance of classification techniques on real-world data sets. Class noise appears when a class variable in data sets has incorrect class labels. In the case of noisy data, the robustness of classification techniques against noise could be more important than the performance results on noise-free data sets. The decision tree method is one of the most popular techniques for classification tasks. The C4.5, CART, and random forest (RF) algorithms are considered to be three of the most used algorithms in decision trees. The aim of this paper is to reach conclusions on which decision tree algorithm is better to use for building decision trees in terms of its performance and robustness against class noise. In order to achieve this aim, we study and compare the performance of the models when applied to class variables with noise. The results obtained indicate that the RF algorithm is more robust to data sets with noisy class variable than other algorithms.http://dx.doi.org/10.1155/2024/6671395
spellingShingle Abdulmajeed Atiah Alharbi
Classification Performance Analysis of Decision Tree-Based Algorithms with Noisy Class Variable
Discrete Dynamics in Nature and Society
title Classification Performance Analysis of Decision Tree-Based Algorithms with Noisy Class Variable
title_full Classification Performance Analysis of Decision Tree-Based Algorithms with Noisy Class Variable
title_fullStr Classification Performance Analysis of Decision Tree-Based Algorithms with Noisy Class Variable
title_full_unstemmed Classification Performance Analysis of Decision Tree-Based Algorithms with Noisy Class Variable
title_short Classification Performance Analysis of Decision Tree-Based Algorithms with Noisy Class Variable
title_sort classification performance analysis of decision tree based algorithms with noisy class variable
url http://dx.doi.org/10.1155/2024/6671395
work_keys_str_mv AT abdulmajeedatiahalharbi classificationperformanceanalysisofdecisiontreebasedalgorithmswithnoisyclassvariable