Visual Question Answering Using Semantic Information from Image Descriptions
In this work, we propose a deep neural architecture that uses an attention mechanism which utilizes region based image features, the natural language question asked, and semantic knowledge extracted from the regions of an image to produce open-ended answers for questions asked in a visual question a...
Saved in:
| Main Authors: | Tasmia Tasmia, Md Sultan Al Nahian, Brent Harrison |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
LibraryPress@UF
2021-04-01
|
| Series: | Proceedings of the International Florida Artificial Intelligence Research Society Conference |
| Subjects: | |
| Online Access: | https://journals.flvc.org/FLAIRS/article/view/128460 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Envisioning Answers: Unleashing Deep Learning for Visual Question Answering in Artistic Images
by: Erfan Zolghadriha, et al.
Published: (2024-03-01) -
Enhancing Visual Question Answering for Multiple Choice Questions
by: Rashi Goel, et al.
Published: (2025-01-01) -
MineObserver: A Deep Learning Framework for Assessing Natural Language Descriptions of Minecraft Imagery
by: Jay Mahajan, et al.
Published: (2022-05-01) -
Influencing Reinforcement Learning through Natural Language Guidance
by: Tasmia Tasrin, et al.
Published: (2021-04-01) -
Seeing and Reasoning: A Simple Deep Learning Approach to Visual Question Answering
by: Rufai Yusuf Zakari, et al.
Published: (2025-04-01)