Fine-Grained Length Controllable Video Captioning With Ordinal Embeddings
This paper proposes a method for video captioning that controls the length of generated captions. Previous work on length control often had few levels for expressing length. In this study, we propose two methods of length embedding for fine-grained length control. A traditional embedding method is l...
Saved in:
| Main Authors: | Tomoya Nitta, Takumi Fukuzawa, Toru Tamaki |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2024-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10767711/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Survey of Dense Video Captioning: Techniques, Resources, and Future Perspectives
by: Zhandong Liu, et al.
Published: (2025-04-01) -
DanceCaps: Pseudo-Captioning for Dance Videos Using Large Language Models
by: Seohyun Kim, et al.
Published: (2024-11-01) -
Listen or Read? The Impact of Proficiency and Visual Complexity on Learners’ Reliance on Captions
by: Yan Li
Published: (2025-04-01) -
Tiny TR-CAP: A novel small-scale benchmark dataset for general-purpose image captioning tasks
by: Abbas Memiş, et al.
Published: (2025-04-01) -
An Ensemble of Vision-Language Transformer-Based Captioning Model With Rotatory Positional Embeddings
by: K. B. Sathyanarayana, et al.
Published: (2025-01-01)