Multimodal Retrieval Method for Images and Diagnostic Reports Using Cross-Attention
<b>Background:</b> Conventional medical image retrieval methods treat images and text as independent embeddings, limiting their ability to fully utilize the complementary information from both modalities. This separation often results in suboptimal retrieval performance, as the intricate...
Saved in:
| Main Authors: | Ikumi Sata, Motoki Amagasaki, Masato Kiyama |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-02-01
|
| Series: | AI |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2673-2688/6/2/38 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Exploring BiomedCLIP’s Capabilities in Medical Image Analysis: A Focus on Scoliosis Detection and Severity Assessment
by: Bartosz Polis, et al.
Published: (2025-01-01) -
DI-VTR: Dual inter-modal interaction model for video-text retrieval
by: Jie Guo, et al.
Published: (2024-09-01) -
Quick-MIMIC: A Multimodal Data Extraction Pipeline for MIMIC with Parallelization
by: Yutao Dou, et al.
Published: (2024-12-01) -
Assessing chest radiographic quality and the influence of COVID‐19 pathology: the Australian experience
by: Peter O'Reilly, et al.
Published: (2025-06-01) -
CCMusic: An Open and Diverse Database for Chinese Music Information Retrieval Research
by: Monan Zhou, et al.
Published: (2025-03-01)