MKNNet: Knowledge-aligned multimodal transformer for information retrieval

With the rapid advancement of artificial intelligence and the Internet of Things, data collected from multiple sensing modalities is growing rapidly in both volume and complexity. In this paper, we propose a novel deep learning framework called MKNNet, which combines modality alignment, Transformer-...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaoqin Lin, Chentao Han, Jian Yao, Yue Li, Xujun Wang, Shufeng Jia
Format: Article
Language:English
Published: Elsevier 2025-08-01
Series:Alexandria Engineering Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1110016825008051
Tags: Add Tag
No Tags, Be the first to tag this record!