Graph-based vision transformer with sparsity for training on small datasets from scratch

Abstract Vision Transformers (ViTs) have achieved impressive results in large-scale image classification. However, when training from scratch on small datasets, there is still a significant performance gap between ViTs and Convolutional Neural Networks (CNNs), which is attributed to the lack of indu...

Full description

Saved in:

Bibliographic Details
Main Authors:	Peng Li, Lu Huang, Jin Li, Haiyan Yan, Dongjing Shan
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-07-01
Series:	Scientific Reports
Subjects:	Vision Transformer Graph convolution Self-attention Graph-pooling Image classification
Online Access:	https://doi.org/10.1038/s41598-025-10408-0
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://doi.org/10.1038/s41598-025-10408-0

Graph-based vision transformer with sparsity for training on small datasets from scratch

Internet

Similar Items