Fine-Grained Length Controllable Video Captioning With Ordinal Embeddings

Fine-Grained Length Controllable Video Captioning With Ordinal Embeddings

This paper proposes a method for video captioning that controls the length of generated captions. Previous work on length control often had few levels for expressing length. In this study, we propose two methods of length embedding for fine-grained length control. A traditional embedding method is l...

Full description

Saved in:

Bibliographic Details
Main Authors:	Tomoya Nitta, Takumi Fukuzawa, Toru Tamaki
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	Video captioning length controllable generation ordinal embedding
Online Access:	https://ieeexplore.ieee.org/document/10767711/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Survey of Dense Video Captioning: Techniques, Resources, and Future Perspectives
by: Zhandong Liu, et al.
Published: (2025-04-01)

DanceCaps: Pseudo-Captioning for Dance Videos Using Large Language Models
by: Seohyun Kim, et al.
Published: (2024-11-01)

Listen or Read? The Impact of Proficiency and Visual Complexity on Learners’ Reliance on Captions
by: Yan Li
Published: (2025-04-01)

Tiny TR-CAP: A novel small-scale benchmark dataset for general-purpose image captioning tasks
by: Abbas Memiş, et al.
Published: (2025-04-01)

An Ensemble of Vision-Language Transformer-Based Captioning Model With Rotatory Positional Embeddings
by: K. B. Sathyanarayana, et al.
Published: (2025-01-01)

Fit for What Purpose? NER Certification of Automatic Captions in English and Spanish
by: Pablo Romero-Fresco, et al.
Published: (2025-01-01)

Ordered Choice Models: Ordinal Logit and Ordinal Probit
by: Öznur İşçi Güneri, et al.
Published: (2022-11-01)

Derived length for arbitrary topological spaces
by: A. J. Jayanthan
Published: (1992-01-01)

Integrating visual memory for image captioning
by: Jiahui Wei, et al.
Published: (2025-05-01)

Dual-Stream Spatially Aware Transformer for Remote Sensing Image Captioning
by: Haifeng Sima, et al.
Published: (2025-01-01)

Undergraduate students’ perceptions toward writing Instagram captions in English
by: Nahda Nafisah Hutasuhut, et al.
Published: (2024-05-01)

Remote Sensing Image Change Captioning Using Multi-Attentive Network with Diffusion Model
by: Yue Yang, et al.
Published: (2024-11-01)

Image Captioning Based on Semantic Scenes
by: Fengzhi Zhao, et al.
Published: (2024-10-01)

Unveiling the Ultimate Meme Recipe: Image Embeddings for Identifying Top Meme Templates from r/Memes
by: Jan Sawicki
Published: (2025-04-01)

AVCaps: An Audio-Visual Dataset With Modality-Specific Captions
by: Parthasaarathy Sudarsanam, et al.
Published: (2025-01-01)

Content moderation assistance through image caption generation
by: Liam Kearns
Published: (2025-03-01)

A Study on Generating Maritime Image Captions Based on Transformer Dual Information Flow
by: Zhenqiang Zhao, et al.
Published: (2025-06-01)

Enhanced group relation learning via aligned attention masking for fashion product captioning
by: Yuhao Tang, et al.
Published: (2025-08-01)

Affective Image Captioning for Visual Artworks Using Emotion-Based Cross-Attention Mechanisms
by: Shintaro Ishikawa, et al.
Published: (2023-01-01)

Structure preserved ordinal unsupervised domain adaptation
by: Qing Tian, et al.
Published: (2024-11-01)

Semantic-Guided Selective Representation for Image Captioning
by: Yinan Li, et al.
Published: (2023-01-01)

Improving Visual Question Answering by Image Captioning
by: Xiangjun Shao, et al.
Published: (2025-01-01)

HI4HC and AAAAD: Exploring a hierarchical method and dataset using hybrid intelligence for remote sensing scene captioning
by: Jiaxin Ren, et al.
Published: (2025-05-01)

NuCap: A Numerically Aware Captioning Framework for Improved Numerical Reasoning
by: Yuna Jeong, et al.
Published: (2025-05-01)

THE NECESSITY OF PRODUCING A HIGH-QUALITY TRANSLATION OF CAPTIONS IN RADYA PUSTAKA MUSEUM
by: Dyah Ayu Nila Khrisna, et al.
Published: (2021-04-01)

Contrastive learning based remote sensing text-to-image generation for few-shot remote sensing image captioning
by: Haonan Zhou, et al.
Published: (2025-08-01)

Detailed Image Captioning and Hashtag Generation
by: Nikshep Shetty, et al.
Published: (2024-11-01)

Improved IEC performance via emotional stimuli-aware captioning
by: Zibo Zhou, et al.
Published: (2025-07-01)

DISCURSIVE AND SOCIAL PRACTICES IN INSTAGRAM CAPTIONS: EVIDENCE FROM INDONESIA
by: Hidayana Putri, et al.
Published: (2022-04-01)

Combining Region-Guided Attention and Attribute Prediction for Thangka Image Captioning Method
by: Fujun Zhang, et al.
Published: (2025-01-01)

Preliminary Study on Image Captioning for Construction Hazards
by: Wen-Ta Hsiao, et al.
Published: (2024-08-01)

ORDINAL LOGISTIC REGRESSION MODEL AND CLASSIFICATION TREE ON ORDINAL RESPONSE DATA
by: Jajang Jajang, et al.
Published: (2022-03-01)

CONDITIONS OF ECONOMIC ACTIVITY CO-ORDINATION AS A FACTOR OF SHAPING ORGANIZATIONAL STRUCTURES
by: Viktor E. Dementiev
Published: (2017-09-01)

Rapid video copy detection on compressed domain
by: ZHANG Yong-dong, et al.
Published: (2009-01-01)

Ordinal Random Processes
by: Christoph Bandt
Published: (2025-06-01)

Chinese Image Captioning Based on Deep Fusion Feature and Multi-Layer Feature Filtering Block
by: Xi Yang, et al.
Published: (2025-01-01)

Enhanced CLIP-GPT Framework for Cross-Lingual Remote Sensing Image Captioning
by: Rui Song, et al.
Published: (2025-01-01)

Thangka image captioning model with Salient Attention and Local Interaction Aggregator
by: Wenjin Hu, et al.
Published: (2024-11-01)

ENSEMBLE BAGGING WITH ORDINAL LOGISTIC REGRESSION TO CLASSIFY TODDLER NUTRITIONAL STATUS
by: Luthfia Hanun Yuli Arini, et al.
Published: (2025-01-01)

The ordinal of dynamical degrees of birational maps of the projective plane
by: Bot, Anna
Published: (2024-03-01)