Deep Memory Fusion Model for Long Video Question Answering

Long video question answering contains rich multimodal semantic information and inference information. At present, it is difficult for video question answering models based on recurrent neural networks to fully retain important memory information, to ignore irrelevant redundant information and to ac...

Full description

Saved in:

Bibliographic Details
Main Authors:	SUN Guanglu, WU Meng, QIU Jing, LIANG Lili
Format:	Article
Language:	zho
Published:	Harbin University of Science and Technology Publications 2021-02-01
Series:	Journal of Harbin University of Science and Technology
Subjects:	video question answering long video understanding memory network attention mechanism multimodal fusion
Online Access:	https://hlgxb.hrbust.edu.cn/#/digest?ArticleID=1911
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://hlgxb.hrbust.edu.cn/#/digest?ArticleID=1911

Deep Memory Fusion Model for Long Video Question Answering

Internet

Similar Items