Dual modality prompt learning for visual question-grounded answering in robotic surgery
Abstract With recent advancements in robotic surgery, notable strides have been made in visual question answering (VQA). Existing VQA systems typically generate textual answers to questions but fail to indicate the location of the relevant content within the image. This limitation restricts the inte...
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
SpringerOpen
2024-04-01
|
| Series: | Visual Computing for Industry, Biomedicine, and Art |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s42492-024-00160-z |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|