Parameter-efficient weakly supervised referring video object segmentation via chain-of-thought reasoning

Abstract Referring video object segmentation (RVOS) aims to segment the object corresponding to a language expression in a video. Most existing RVOS methods are trained using accurate per-pixel annotations, which are expensive and time-consuming to obtain. Moreover, they need to update the entire pa...

Full description

Saved in:
Bibliographic Details
Main Authors: Xing Wang, Zhe Xu, Yuanshi Zheng, Handing Wang
Format: Article
Language:English
Published: Springer 2025-05-01
Series:Complex & Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1007/s40747-025-01900-1
Tags: Add Tag
No Tags, Be the first to tag this record!