A multimodal transformer-based visual question answering method integrating local and global information.

A multimodal transformer-based visual question answering method integrating local and global information.

Addressing the limitations in current visual question answering (VQA) models face limitations in multimodal feature fusion capabilities and often lack adequate consideration of local information, this study proposes a multimodal Transformer VQA network based on local and global information integrati...

Full description

Saved in:

Bibliographic Details
Main Authors:	Cuiyang Huang, Zihan Hu
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2025-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0324757
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Multimodal representative answer extraction in community question answering
by: Ming Li, et al.
Published: (2023-10-01)

Enhancing Visual Question Answering for Multiple Choice Questions
by: Rashi Goel, et al.
Published: (2025-01-01)

Visual Question Answering Using Semantic Information from Image Descriptions
by: Tasmia Tasmia, et al.
Published: (2021-04-01)

Informed-Learning-Guided Visual Question Answering Model of Crop Disease
by: Yunpeng Zhao, et al.
Published: (2024-01-01)

Designing and Evaluating a Dual-Stream Transformer-Based Architecture for Visual Question Answering
by: Faheem Shehzad, et al.
Published: (2024-01-01)

Improving Visual Question Answering by Image Captioning
by: Xiangjun Shao, et al.
Published: (2025-01-01)

Envisioning Answers: Unleashing Deep Learning for Visual Question Answering in Artistic Images
by: Erfan Zolghadriha, et al.
Published: (2024-03-01)

Medical Knowledge-Based Differential Image Visual Question Answering
by: Fangpeng Lu, et al.
Published: (2025-01-01)

ZPVQA: Visual Question Answering of Images Based on Zero-Shot Prompt Learning
by: Naihao Hu, et al.
Published: (2025-01-01)

Questions (No Answers)
Published: (2025-06-01)

Adaptive Conditional Reasoning for Remote Sensing Visual Question Answering
by: Yiqun Gao, et al.
Published: (2025-04-01)

Frequently asked questions and answers on Visually-Provoked (Photosensitive) epilepsy
by: Dorothée Kasteleijn-Nolst Trenité, et al.
Published: (2025-06-01)

Visual Question Answering in Robotic Surgery: A Comprehensive Review
by: Di Ding, et al.
Published: (2025-01-01)

A Semantic Weight Adaptive Model Based on Visual Question Answering
by: Li Huimin, et al.
Published: (2025-01-01)

Knowledge injection methods in question answering
by: D. V. Radyush
Published: (2025-06-01)

Assessing the performance of zero-shot visual question answering in multimodal large language models for 12-lead ECG image interpretation
by: Tomohisa Seki, et al.
Published: (2025-02-01)

Questionable Answers in Question Answering Research: Reproducibility and Variability of Published Results
by: Matt Crane
Published: (2021-03-01)

Design of agricultural question answering information extraction method based on improved BILSTM algorithm
by: Ruipeng Tang, et al.
Published: (2024-10-01)

Visual explainable artificial intelligence for graph-based visual question answering and scene graph curation
by: Sebastian Künzel, et al.
Published: (2025-04-01)

Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion
by: Junkai Zhang, et al.
Published: (2025-04-01)

Advancing medical question answering with a knowledge embedding transformer.
by: Xiang Zhu, et al.
Published: (2025-01-01)

BVQA: Connecting Language and Vision Through Multimodal Attention for Open-Ended Question Answering
by: Md. Shalha Mucha Bhuyan, et al.
Published: (2025-01-01)

SHIFA: SBERT-Based Healthcare Information Focused Arabic Question Answering
by: Rahaf Alruwaithi, et al.
Published: (2025-01-01)

Seeing and Reasoning: A Simple Deep Learning Approach to Visual Question Answering
by: Rufai Yusuf Zakari, et al.
Published: (2025-04-01)

Some Answers, Some Questions
by: Nick R Anthonisen
Published: (2002-01-01)

HOW TO ANSWER CHILDREN QUESTIONS
by: O. Brenifier
Published: (2016-03-01)

The role of answer content and length when preparing answers to questions
by: Ruth Elizabeth Corps, et al.
Published: (2024-07-01)

The history of development of the cultural-historical theory and its contemporary perceptions: answering questions and questioning answers
by: Nikolai N. Veresov
Published: (2024-12-01)

Question Dependent Recurrent Entity Network for Question Answering
by: Andrea Madotto, et al.
Published: (2017-12-01)

Method of Automatic Generation of Questions and Answers for Knowledge Testing Systems
by: S. А. Migalevich, et al.
Published: (2025-07-01)

Obesity: there are more questions than answers
by: I. V. Samorodskaya
Published: (2021-09-01)

Resistant Hypertension: Questions and Contemporary Answers
by: V. I. Podzolkov, et al.
Published: (2019-09-01)

Expert Detection In Question Answer Communities
by: Hamed Salimian, et al.
Published: (2022-01-01)

Question Answering in Restricted Domains: An Overview
by: Diego Mollá, et al.
Published: (2021-03-01)

ISCHEMIA Trial: Key Questions and Answers
by: Jose Lopez-Sendon, et al.
Published: (2021-09-01)

Advanced Question-Answering and Discourse Semantics
by: Patrick Saint-Dizier
Published: (2022-04-01)

Enterprise chart question and answer method based on multi modal cross fusion
by: Xinxin Wang, et al.
Published: (2025-01-01)

Methods of Asking and Answering Questions in Jadal Works Written by Fiqh Scholars
by: Abdurrahim Bilik
Published: (2021-10-01)

A brain-inspired memory transformation based differentiable neural computer for reasoning-based question answering
by: Yao Liang, et al.
Published: (2025-08-01)

Automatic question-answering modeling in English by integrating TF-IDF and segmentation algorithms
by: Hainan Wang
Published: (2024-12-01)