Designing and Evaluating a Dual-Stream Transformer-Based Architecture for Visual Question Answering

Designing and Evaluating a Dual-Stream Transformer-Based Architecture for Visual Question Answering

In the realm of Visual Question Answering, accurate answers often hinge on the harmonious fusion of textual and visual elements. While these complex architectures are effective, they typically come with a hefty price tag: a large number of parameters that demand significant processing power and leng...

Full description

Saved in:

Bibliographic Details
Main Authors:	Faheem Shehzad, Aniello Minutolo, Massimo Esposito
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	Visual question answering (VQA) transformer models natural language processing dual-stream architecture multimodal question answering attention mechanisms
Online Access:	https://ieeexplore.ieee.org/document/10811881/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Semantic Weight Adaptive Model Based on Visual Question Answering
by: Li Huimin, et al.
Published: (2025-01-01)

Envisioning Answers: Unleashing Deep Learning for Visual Question Answering in Artistic Images
by: Erfan Zolghadriha, et al.
Published: (2024-03-01)

Medical Knowledge-Based Differential Image Visual Question Answering
by: Fangpeng Lu, et al.
Published: (2025-01-01)

Adaptive Conditional Reasoning for Remote Sensing Visual Question Answering
by: Yiqun Gao, et al.
Published: (2025-04-01)

Seeing and Reasoning: A Simple Deep Learning Approach to Visual Question Answering
by: Rufai Yusuf Zakari, et al.
Published: (2025-04-01)

Improving Visual Question Answering by Image Captioning
by: Xiangjun Shao, et al.
Published: (2025-01-01)

Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion
by: Junkai Zhang, et al.
Published: (2025-04-01)

Enhancing Visual Question Answering for Multiple Choice Questions
by: Rashi Goel, et al.
Published: (2025-01-01)

Multimodal representative answer extraction in community question answering
by: Ming Li, et al.
Published: (2023-10-01)

A Multi-Modal Attentive Framework That Can Interpret Text (MMAT)
by: Vijay Kumari, et al.
Published: (2025-01-01)

Cross-Encoder-Based Semantic Evaluation of Extractive and Generative Question Answering in Low-Resourced African Languages
by: Funebi Francis Ijebu, et al.
Published: (2025-03-01)

Generative Models for Multiple-Choice Question Answering in Portuguese: A Monolingual and Multilingual Experimental Study
by: Guilherme Dallmann Lima, et al.
Published: (2025-05-01)

Adapting an English Corpus and a Question Answering System for Slovene
by: Uroš Šmajdek, et al.
Published: (2023-09-01)

The role of answer content and length when preparing answers to questions
by: Ruth Elizabeth Corps, et al.
Published: (2024-07-01)

A lightweight knowledge graph-driven question answering system for field-based mineral resource survey
by: Mingguo Wang, et al.
Published: (2025-09-01)

MusiQAl: A Dataset for Music Question–Answering through Audio–Video Fusion
by: Anna-Maria Christodoulou, et al.
Published: (2025-07-01)

Visual Question Answering in Robotic Surgery: A Comprehensive Review
by: Di Ding, et al.
Published: (2025-01-01)

DEFINITION OF TYPOS IN ANSWER OF STUDENT IN KNOWN CORRECT ANSWER
by: Maria V. Biryukova, et al.
Published: (2016-05-01)

Methods of Asking and Answering Questions in Jadal Works Written by Fiqh Scholars
by: Abdurrahim Bilik
Published: (2021-10-01)

Automatic question-answering modeling in English by integrating TF-IDF and segmentation algorithms
by: Hainan Wang
Published: (2024-12-01)

Rhetorical questions as aggressive, friendly or sarcastic/ironical questions with imposed answers
by: Džemal Špago
Published: (2025-01-01)

Progress, challenges and research trends of reasoning in multi-hop knowledge graph based question answering
by: Huifang DU, et al.
Published: (2021-05-01)

Analysis of The Use of Discussion And Question And Answer Methods As an Effort to Improve Student Physics Learning Outcomes
by: Delia Sapitri, et al.
Published: (2023-06-01)

Profile of Junior High School Students' Critical Thinking Skills in Answering Questions Related to Biological Concepts
by: Ahmad Fauzi
Published: (2019-07-01)

MOODLE IN LANGUAGE TEACHING AND TESTING. THE EMBEDDED ANSWERS QUESTION TYPE
by: Ioana-Claudia Horea
Published: (2025-03-01)

HSM-QA: Question Answering System Based on Hierarchical Semantic Matching
by: Jinlu Zhang, et al.
Published: (2023-01-01)

Automatic generation of semantic network for question answering
by: V. V. Potaraev, et al.
Published: (2020-06-01)

Deep Memory Fusion Model for Long Video Question Answering
by: SUN Guanglu, et al.
Published: (2021-02-01)

SHIFA: SBERT-Based Healthcare Information Focused Arabic Question Answering
by: Rahaf Alruwaithi, et al.
Published: (2025-01-01)

Giving Questions and Getting Answers (GQGA) Strategy Improves Biology Learning Outcomes
by: Muhammad Eval Setiawan, et al.
Published: (2019-12-01)

Classifying the Clarity of Questions in CQA Networks: A Topic based Approach
by: Alireza Khabbazan, et al.
Published: (2023-03-01)

Visual explainable artificial intelligence for graph-based visual question answering and scene graph curation
by: Sebastian Künzel, et al.
Published: (2025-04-01)

Enhancing the performance of neurosurgery medical question-answering systems using a multi-task knowledge graph-augmented answer generation model
by: Ting Pan, et al.
Published: (2025-05-01)

Access to court interpreting as social inclusion for migrants in Australia: an analysis of courtroom examination questions and answers
by: Ran Yi
Published: (2024-12-01)

Rhetorical questions as aggressive, friendly or sarcastic/ironical questions with imposed answers
by: Špago Džemal
Published: (2020-12-01)

An Empirical Evaluation of Large Language Models on Consumer Health Questions
by: Moaiz Abrar, et al.
Published: (2025-02-01)

Expert Detection In Question Answer Communities
by: Hamed Salimian, et al.
Published: (2022-01-01)

The battle of question formats: a comparative study of retrieval practice using very short answer questions and multiple choice questions
by: Elise V. van Wijk, et al.
Published: (2024-12-01)

ZPVQA: Visual Question Answering of Images Based on Zero-Shot Prompt Learning
by: Naihao Hu, et al.
Published: (2025-01-01)

Hajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwas
by: Hayfa A. Aleid, et al.
Published: (2025-07-01)