-
1
Advancements in Vision–Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques
Published 2025-01-01Subjects: “…vision–language models…”
Get full text
Article -
2
Multimodal Framework for Long-Tailed Recognition
Published 2024-11-01Subjects: Get full text
Article -
3
KeyMPs: One-Shot Vision-Language Guided Motion Generation by Sequencing DMPs for Occlusion-Rich Tasks
Published 2025-01-01Subjects: Get full text
Article -
4
Training-Free VLM-Based Pseudo Label Generation for Video Anomaly Detection
Published 2025-01-01Subjects: “…Vision language models…”
Get full text
Article -
5
QUBVIS: query based multi-modal summarization system using CLIP based transformer and vision language models
Published 2025-09-01Subjects: Get full text
Article -
6
DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark
Published 2025-02-01Subjects: Get full text
Article -
7
Exploring the Limits of Large Language Models’ Ability to Distinguish Between Objects
Published 2025-04-01Subjects: Get full text
Article -
8
VisGraphVar: A benchmark generator for Assessing Variability in Graph Analysis Using Large Vision-Language Models
Published 2025-01-01Subjects: Get full text
Article -
9
Pedestrian Vision Language Model for Intentions Prediction
Published 2025-01-01Subjects: Get full text
Article -
10
-
11
Detailed Image Captioning and Hashtag Generation
Published 2024-11-01Subjects: Get full text
Article -
12
AbVLM-Q: intelligent quality assessment for abdominal ultrasound standard planes via vision-language modeling
Published 2025-08-01Subjects: “…Vision-language models…”
Get full text
Article -
13
Dual Adapter Tuning of Vision–Language Models Using Large Language Models
Published 2025-05-01Subjects: Get full text
Article -
14
Text-Guided Distribution Calibration for Few-Shot Object Detection in Remote Sensing Images
Published 2025-01-01Subjects: Get full text
Article -
15
LAMARS: Large Language Model-Based Anticipation Mechanism Acceleration in Real-Time Robotic Systems
Published 2025-01-01Subjects: Get full text
Article -
16
Open-Vocabulary Action Localization With Iterative Visual Prompting
Published 2025-01-01Subjects: Get full text
Article -
17
Automated Skin Cancer Report Generation via a Knowledge-Distilled Vision-Language Model
Published 2025-01-01Subjects: Get full text
Article -
18
Evaluation of Thermal Comfort in Urban Commercial Space with Vision–Language-Model-Based Agent Model
Published 2025-04-01Subjects: “…vision–language models (VLMs)…”
Get full text
Article -
19
Optimizing Text Recognition in Mechanical Drawings: A Comprehensive Approach
Published 2025-03-01Subjects: Get full text
Article -
20
Multimodal AI and Large Language Models for Orthopantomography Radiology Report Generation and Q&A
Published 2025-03-01Subjects: Get full text
Article