MoNetViT: an efficient fusion of CNN and transformer technologies for visual navigation assistance with multi query attention
Aruco markers are crucial for navigation in complex indoor environments, especially for those with visual impairments. Traditional CNNs handle image segmentation well, but transformers excel at capturing long-range dependencies, essential for machine vision tasks. Our study introduces MoNetViT (Mini...
Saved in:
| Main Authors: | Liliek Triyono, Rahmat Gernowo, Prayitno |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Frontiers Media S.A.
2025-02-01
|
| Series: | Frontiers in Computer Science |
| Subjects: | |
| Online Access: | https://www.frontiersin.org/articles/10.3389/fcomp.2025.1510252/full |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Visual Impairment Spatial Awareness System for Indoor Navigation and Daily Activities
by: Xinrui Yu, et al.
Published: (2025-01-01) -
Fusion of Visual Attention and Scene Descriptions With Deep Reinforcement Learning for AAV Indoor Autonomous Navigation
by: Hussein Samma, et al.
Published: (2025-01-01) -
Analysis and testing of an indoor navigation system
by: Aleksander Wędzonka, et al.
Published: (2024-06-01) -
Development of methodology for designing indoor cartographic visualizations for use in navigation for persons with special needs
by: Szewczuk Gabriela, et al.
Published: (2024-01-01) -
A Context-Aware Doorway Alignment and Depth Estimation Algorithm for Assistive Wheelchairs
by: Shanelle Tennekoon, et al.
Published: (2025-07-01)