Automated Ultrasound Diagnosis via CLIP-GPT Synergy: A Multimodal Framework for Image Classification and Report Generation
As a crucial non-invasive imaging modality in clinical diagnosis, ultrasound interpretation faces challenges of subjectivity and inefficiency. To address the limitations of traditional single-modal deep learning models in cross-modal alignment and structured text generation, this study proposes an i...
Saved in:
| Main Authors: | Li Yan, Xiaodong Zhou, Yaotian Wang, Xuan Chang, Qing Li, Gang Han |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11029188/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Transferring CLIP for visual grounding in remote sensing images
by: Linlin Liang, et al.
Published: (2025-08-01) -
The CLIP - GPT Image Captioning Model Integrated with Global Semantics
by: TAO Rui, et al.
Published: (2024-04-01) -
Enhanced CLIP-GPT Framework for Cross-Lingual Remote Sensing Image Captioning
by: Rui Song, et al.
Published: (2025-01-01) -
A Dual-Enhanced Hierarchical Alignment Framework for Multimodal Named Entity Recognition
by: Jian Wang, et al.
Published: (2025-05-01) -
CLIP-BCA-Gated: A Dynamic Multimodal Framework for Real-Time Humanitarian Crisis Classification with Bi-Cross-Attention and Adaptive Gating
by: Shanshan Li, et al.
Published: (2025-08-01)