How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation

Kolmogorov–Arnold Networks (KANs) are a novel class of neural network architectures based on the Kolmogorov–Arnold representation theorem, which has demonstrated potential advantages in accuracy and interpretability over Multilayer Perceptron (MLP) models. This paper comprehensively evaluates the ro...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ahmed Dawod Mohammed Ibrahum, Zhengyu Shang, Jang-Eui Hong
Format:	Article
Language:	English
Published:	MDPI AG 2024-11-01
Series:	Applied Sciences
Subjects:	adversarial attacks robustness Kolmogorov–Arnold networks (KANs) multilayer perceptrons (MLPs) convolutional-KANs KAN-Mixer
Online Access:	https://www.mdpi.com/2076-3417/14/22/10173
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846154547445104640
author	Ahmed Dawod Mohammed Ibrahum Zhengyu Shang Jang-Eui Hong
author_facet	Ahmed Dawod Mohammed Ibrahum Zhengyu Shang Jang-Eui Hong
author_sort	Ahmed Dawod Mohammed Ibrahum
collection	DOAJ
description	Kolmogorov–Arnold Networks (KANs) are a novel class of neural network architectures based on the Kolmogorov–Arnold representation theorem, which has demonstrated potential advantages in accuracy and interpretability over Multilayer Perceptron (MLP) models. This paper comprehensively evaluates the robustness of various KAN architectures—including KAN, KAN-Mixer, KANConv_KAN, and KANConv_MLP—against adversarial attacks, which constitute a critical aspect that has been underexplored in current research. We compare these models with MLP-based architectures such as MLP, MLP-Mixer, and ConvNet_MLP across three traffic sign classification datasets: GTSRB, BTSD, and CTSD. The models were subjected to various adversarial attacks (FGSM, PGD, CW, and BIM) with varying perturbation levels and were trained under different strategies, including standard training, adversarial training, and Randomized Smoothing. Our experimental results demonstrate that KAN-based models, particularly the KAN-Mixer, exhibit superior robustness to adversarial attacks compared to their MLP counterparts. Specifically, the KAN-Mixer consistently achieved lower Success Attack Rates (SARs) and Degrees of Change (DoCs) across most attack types and datasets while maintaining high accuracy on clean data. For instance, under FGSM attacks with <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ϵ</mi><mo>=</mo><mn>0.01</mn></mrow></semantics></math></inline-formula>, the KAN-Mixer outperformed the MLP-Mixer by maintaining higher accuracy and lower SARs. Adversarial training and Randomized Smoothing further enhanced the robustness of KAN-based models, with t-SNE visualizations revealing more stable latent space representations under adversarial perturbations. These findings underscore the potential of KAN architectures to improve neural network security and reliability in adversarial settings.
format	Article
id	doaj-art-83a14105022d4022a1995da1144d9da9
institution	Kabale University
issn	2076-3417
language	English
publishDate	2024-11-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj-art-83a14105022d4022a1995da1144d9da92024-11-26T17:47:49ZengMDPI AGApplied Sciences2076-34172024-11-0114221017310.3390/app142210173How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness InvestigationAhmed Dawod Mohammed Ibrahum0Zhengyu Shang1Jang-Eui Hong2Software Intelligence Engineering Lab, Department of Computer Science, Chungbuk National University, Cheongju 28644, Republic of KoreaSoftware Intelligence Engineering Lab, Department of Computer Science, Chungbuk National University, Cheongju 28644, Republic of KoreaSoftware Intelligence Engineering Lab, Department of Computer Science, Chungbuk National University, Cheongju 28644, Republic of KoreaKolmogorov–Arnold Networks (KANs) are a novel class of neural network architectures based on the Kolmogorov–Arnold representation theorem, which has demonstrated potential advantages in accuracy and interpretability over Multilayer Perceptron (MLP) models. This paper comprehensively evaluates the robustness of various KAN architectures—including KAN, KAN-Mixer, KANConv_KAN, and KANConv_MLP—against adversarial attacks, which constitute a critical aspect that has been underexplored in current research. We compare these models with MLP-based architectures such as MLP, MLP-Mixer, and ConvNet_MLP across three traffic sign classification datasets: GTSRB, BTSD, and CTSD. The models were subjected to various adversarial attacks (FGSM, PGD, CW, and BIM) with varying perturbation levels and were trained under different strategies, including standard training, adversarial training, and Randomized Smoothing. Our experimental results demonstrate that KAN-based models, particularly the KAN-Mixer, exhibit superior robustness to adversarial attacks compared to their MLP counterparts. Specifically, the KAN-Mixer consistently achieved lower Success Attack Rates (SARs) and Degrees of Change (DoCs) across most attack types and datasets while maintaining high accuracy on clean data. For instance, under FGSM attacks with <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ϵ</mi><mo>=</mo><mn>0.01</mn></mrow></semantics></math></inline-formula>, the KAN-Mixer outperformed the MLP-Mixer by maintaining higher accuracy and lower SARs. Adversarial training and Randomized Smoothing further enhanced the robustness of KAN-based models, with t-SNE visualizations revealing more stable latent space representations under adversarial perturbations. These findings underscore the potential of KAN architectures to improve neural network security and reliability in adversarial settings.https://www.mdpi.com/2076-3417/14/22/10173adversarial attacksrobustnessKolmogorov–Arnold networks (KANs)multilayer perceptrons (MLPs)convolutional-KANsKAN-Mixer
spellingShingle	Ahmed Dawod Mohammed Ibrahum Zhengyu Shang Jang-Eui Hong How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation Applied Sciences adversarial attacks robustness Kolmogorov–Arnold networks (KANs) multilayer perceptrons (MLPs) convolutional-KANs KAN-Mixer
title	How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation
title_full	How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation
title_fullStr	How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation
title_full_unstemmed	How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation
title_short	How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation
title_sort	how resilient are kolmogorov arnold networks in classification tasks a robustness investigation
topic	adversarial attacks robustness Kolmogorov–Arnold networks (KANs) multilayer perceptrons (MLPs) convolutional-KANs KAN-Mixer
url	https://www.mdpi.com/2076-3417/14/22/10173
work_keys_str_mv	AT ahmeddawodmohammedibrahum howresilientarekolmogorovarnoldnetworksinclassificationtasksarobustnessinvestigation AT zhengyushang howresilientarekolmogorovarnoldnetworksinclassificationtasksarobustnessinvestigation AT jangeuihong howresilientarekolmogorovarnoldnetworksinclassificationtasksarobustnessinvestigation

How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation

Similar Items