IMCP: A Python package for imbalanced and multiclass data classifier performance comparison

The Multiclass Classification Performance (MCP) curve is an innovative method to visualize the performance of a classifier for multiclass datasets. On the other hand, the Imbalanced Multiclass Classification Performance (IMCP) curve is a novel approach to visualizing classifier performance on multic...

Full description

Saved in:
Bibliographic Details
Main Authors: Jesus S. Aguilar-Ruiz, Marcin Michalak, Łukasz Wróbel
Format: Article
Language:English
Published: Elsevier 2024-12-01
Series:SoftwareX
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352711024002474
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850266436885282816
author Jesus S. Aguilar-Ruiz
Marcin Michalak
Łukasz Wróbel
author_facet Jesus S. Aguilar-Ruiz
Marcin Michalak
Łukasz Wróbel
author_sort Jesus S. Aguilar-Ruiz
collection DOAJ
description The Multiclass Classification Performance (MCP) curve is an innovative method to visualize the performance of a classifier for multiclass datasets. On the other hand, the Imbalanced Multiclass Classification Performance (IMCP) curve is a novel approach to visualizing classifier performance on multiclass datasets that exhibit class imbalance, i.e. the proportions of (two or more) class labels are unequal. We have developed an open-source Python package that encompasses the functionality required to calculate and visualize these two novel classification performance measures, along with providing the calculation of the area under the curves. The MCP and IMCP curves offer advantages over the traditional ROC (Receiver Operating Characteristic) curve when dealing with multiclass and imbalanced datasets, respectively. They provide more informative insights into classifier behavior, especially in scenarios involving multiple classes or uneven class distribution.
format Article
id doaj-art-554899ffef0249c6bdc403d9939be97e
institution OA Journals
issn 2352-7110
language English
publishDate 2024-12-01
publisher Elsevier
record_format Article
series SoftwareX
spelling doaj-art-554899ffef0249c6bdc403d9939be97e2025-08-20T01:54:11ZengElsevierSoftwareX2352-71102024-12-012810187710.1016/j.softx.2024.101877IMCP: A Python package for imbalanced and multiclass data classifier performance comparisonJesus S. Aguilar-Ruiz0Marcin Michalak1Łukasz Wróbel2School of Engineering, Pablo de Olavide University, ES–41013 Seville, SpainDepartment of Computer Networks and Systems, Silesian University of Technology, ul. Akademicka 16, 44-100 Gliwice, Poland; Corresponding author.Department of Computer Networks and Systems, Silesian University of Technology, ul. Akademicka 16, 44-100 Gliwice, PolandThe Multiclass Classification Performance (MCP) curve is an innovative method to visualize the performance of a classifier for multiclass datasets. On the other hand, the Imbalanced Multiclass Classification Performance (IMCP) curve is a novel approach to visualizing classifier performance on multiclass datasets that exhibit class imbalance, i.e. the proportions of (two or more) class labels are unequal. We have developed an open-source Python package that encompasses the functionality required to calculate and visualize these two novel classification performance measures, along with providing the calculation of the area under the curves. The MCP and IMCP curves offer advantages over the traditional ROC (Receiver Operating Characteristic) curve when dealing with multiclass and imbalanced datasets, respectively. They provide more informative insights into classifier behavior, especially in scenarios involving multiple classes or uneven class distribution.http://www.sciencedirect.com/science/article/pii/S2352711024002474ClassificationMulticlass dataImbalanced dataPerformanceROC curvePython
spellingShingle Jesus S. Aguilar-Ruiz
Marcin Michalak
Łukasz Wróbel
IMCP: A Python package for imbalanced and multiclass data classifier performance comparison
SoftwareX
Classification
Multiclass data
Imbalanced data
Performance
ROC curve
Python
title IMCP: A Python package for imbalanced and multiclass data classifier performance comparison
title_full IMCP: A Python package for imbalanced and multiclass data classifier performance comparison
title_fullStr IMCP: A Python package for imbalanced and multiclass data classifier performance comparison
title_full_unstemmed IMCP: A Python package for imbalanced and multiclass data classifier performance comparison
title_short IMCP: A Python package for imbalanced and multiclass data classifier performance comparison
title_sort imcp a python package for imbalanced and multiclass data classifier performance comparison
topic Classification
Multiclass data
Imbalanced data
Performance
ROC curve
Python
url http://www.sciencedirect.com/science/article/pii/S2352711024002474
work_keys_str_mv AT jesussaguilarruiz imcpapythonpackageforimbalancedandmulticlassdataclassifierperformancecomparison
AT marcinmichalak imcpapythonpackageforimbalancedandmulticlassdataclassifierperformancecomparison
AT łukaszwrobel imcpapythonpackageforimbalancedandmulticlassdataclassifierperformancecomparison