Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering

Previous works employ the Large Language Model (LLM) like GPT-3 for knowledge-based Visual Question Answering (VQA). We argue that the inferential capacity of LLM can be enhanced through knowledge injection. Although methods that utilize knowledge graphs to enhance LLM have been explored in various...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhongjian Hu, Peng Yang, Fengyuan Liu, Yuan Meng, Xingyu Liu
Format:	Article
Language:	English
Published:	Tsinghua University Press 2024-09-01
Series:	Big Data Mining and Analytics
Subjects:	visual question answering knowledge-based visual question answering large language model knowledge injection
Online Access:	https://www.sciopen.com/article/10.26599/BDMA.2024.9020026
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.sciopen.com/article/10.26599/BDMA.2024.9020026

Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering

Internet

Similar Items