Enhancing Image Classification using Graph Attention Networks

Excellent performance in artificial intelligence image classification leads to extensive applications throughout areas such as healthcare facilities, robotic systems and multimedia platforms. The research field has evolved through new developments in both Vision Transformers (ViTs) alongside Graph N...

Full description

Saved in:
Bibliographic Details
Main Author: Hasan Maher Ahmed
Format: Article
Language:Arabic
Published: University of Information Technology and Communications 2025-08-01
Series:Iraqi Journal for Computers and Informatics
Subjects:
Online Access:https://ijci.uoitc.edu.iq/index.php/ijci/article/view/548
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849224680851898368
author Hasan Maher Ahmed
author_facet Hasan Maher Ahmed
author_sort Hasan Maher Ahmed
collection DOAJ
description Excellent performance in artificial intelligence image classification leads to extensive applications throughout areas such as healthcare facilities, robotic systems and multimedia platforms. The research field has evolved through new developments in both Vision Transformers (ViTs) alongside Graph Neural Networks (GNNs). A new image classification method utilizes integrated Vision Transformers (ViTs) and Graph Attention Networks (GATs) to improve results for difficult dataset types. The hybrid architecture made possible by combining ViTs with GATs successfully captures complex relationships within visual data because ViTs deliver powerful global feature extraction while GATs establish strong patch-level dependencies. The implementation of GATs via their built-in attention mechanism allows dynamic region prioritization for both accurate recognition and better interpretability of images. The experiments using benchmark datasets CIFAR-10, CIFAR-100, ImageNet, Fashion-MNIST, and SVHN show that ViT + GAT outperforms Swin Transformer and ConvNeXt for state-of-the-art architectures. The proposed method delivers prominent improvements in all classification metrics including accuracy and both accuracy and resistance to noise interference and adversarial perturbations. Model reliability and task generalization capabilities are demonstrated through the precision, recall, F1-score and AUC-ROC metrics. This project integrates smartphone-level ViT technology with deep social modeling GAT components to redefine image classification methods. The method's outstanding performance proves itself as a promising solution for complex visual recognition challenges on multiple scale levels.
format Article
id doaj-art-ce3135c37bb44141bbfa64ca677dd270
institution Kabale University
issn 2313-190X
2520-4912
language Arabic
publishDate 2025-08-01
publisher University of Information Technology and Communications
record_format Article
series Iraqi Journal for Computers and Informatics
spelling doaj-art-ce3135c37bb44141bbfa64ca677dd2702025-08-25T07:15:00ZaraUniversity of Information Technology and CommunicationsIraqi Journal for Computers and Informatics2313-190X2520-49122025-08-01512334510.25195/ijci.v51i2.548511Enhancing Image Classification using Graph Attention NetworksHasan Maher Ahmed0University of MosulExcellent performance in artificial intelligence image classification leads to extensive applications throughout areas such as healthcare facilities, robotic systems and multimedia platforms. The research field has evolved through new developments in both Vision Transformers (ViTs) alongside Graph Neural Networks (GNNs). A new image classification method utilizes integrated Vision Transformers (ViTs) and Graph Attention Networks (GATs) to improve results for difficult dataset types. The hybrid architecture made possible by combining ViTs with GATs successfully captures complex relationships within visual data because ViTs deliver powerful global feature extraction while GATs establish strong patch-level dependencies. The implementation of GATs via their built-in attention mechanism allows dynamic region prioritization for both accurate recognition and better interpretability of images. The experiments using benchmark datasets CIFAR-10, CIFAR-100, ImageNet, Fashion-MNIST, and SVHN show that ViT + GAT outperforms Swin Transformer and ConvNeXt for state-of-the-art architectures. The proposed method delivers prominent improvements in all classification metrics including accuracy and both accuracy and resistance to noise interference and adversarial perturbations. Model reliability and task generalization capabilities are demonstrated through the precision, recall, F1-score and AUC-ROC metrics. This project integrates smartphone-level ViT technology with deep social modeling GAT components to redefine image classification methods. The method's outstanding performance proves itself as a promising solution for complex visual recognition challenges on multiple scale levels.https://ijci.uoitc.edu.iq/index.php/ijci/article/view/548image classification, graph attention networks, vision transformers, graph convolutional networks, artificial intelligence
spellingShingle Hasan Maher Ahmed
Enhancing Image Classification using Graph Attention Networks
Iraqi Journal for Computers and Informatics
image classification, graph attention networks, vision transformers, graph convolutional networks, artificial intelligence
title Enhancing Image Classification using Graph Attention Networks
title_full Enhancing Image Classification using Graph Attention Networks
title_fullStr Enhancing Image Classification using Graph Attention Networks
title_full_unstemmed Enhancing Image Classification using Graph Attention Networks
title_short Enhancing Image Classification using Graph Attention Networks
title_sort enhancing image classification using graph attention networks
topic image classification, graph attention networks, vision transformers, graph convolutional networks, artificial intelligence
url https://ijci.uoitc.edu.iq/index.php/ijci/article/view/548
work_keys_str_mv AT hasanmaherahmed enhancingimageclassificationusinggraphattentionnetworks