Explainable Video Topics for Content Taxonomy: A Multimodal Retrieval Approach to Industry-Compliant Contextual Advertising

Owing to the increased video content consumption in recent years, the need for advanced contextual advertising methods that leverage increasing user engagement and relevance on advertisement-based video-on-demand platforms has increased. Traditional behavior-based advertisement targeting is waning,...

Full description

Saved in:
Bibliographic Details
Main Authors: Waruna de Silva, Anil Fernando
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10890950/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850189915733622784
author Waruna de Silva
Anil Fernando
author_facet Waruna de Silva
Anil Fernando
author_sort Waruna de Silva
collection DOAJ
description Owing to the increased video content consumption in recent years, the need for advanced contextual advertising methods that leverage increasing user engagement and relevance on advertisement-based video-on-demand platforms has increased. Traditional behavior-based advertisement targeting is waning, particularly owing to the recent strict privacy policies that favor user consent and privacy. This study proposes an innovative approach for integrating advanced natural language processing with multimodal analysis for video contextual advertising. To this end, transformer-based architectures, specifically BERTopic, computer vision techniques, and large language models were used to extract sets of topics from visual and textual video data automatically and systematically. The proposed framework decodes the taxonomy of content efficiently through videos in different levels of noise and languages. Empirical analysis of the YouTube-8M dataset shows the potential for the approach to change the paradigm in video advertising. Built to be scalable and easily adaptable, this solution can handle multifarious and complex user-generated content well, suited for a wide range of applications across various media platforms.
format Article
id doaj-art-c4eaf3cfafb8423284b88377ec6a9df7
institution OA Journals
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-c4eaf3cfafb8423284b88377ec6a9df72025-08-20T02:15:28ZengIEEEIEEE Access2169-35362025-01-0113305973061210.1109/ACCESS.2025.354256210890950Explainable Video Topics for Content Taxonomy: A Multimodal Retrieval Approach to Industry-Compliant Contextual AdvertisingWaruna de Silva0https://orcid.org/0009-0003-4707-3901Anil Fernando1Department of Computer and Information Sciences, University of Strathclyde, Glasgow, U.K.Department of Computer and Information Sciences, University of Strathclyde, Glasgow, U.K.Owing to the increased video content consumption in recent years, the need for advanced contextual advertising methods that leverage increasing user engagement and relevance on advertisement-based video-on-demand platforms has increased. Traditional behavior-based advertisement targeting is waning, particularly owing to the recent strict privacy policies that favor user consent and privacy. This study proposes an innovative approach for integrating advanced natural language processing with multimodal analysis for video contextual advertising. To this end, transformer-based architectures, specifically BERTopic, computer vision techniques, and large language models were used to extract sets of topics from visual and textual video data automatically and systematically. The proposed framework decodes the taxonomy of content efficiently through videos in different levels of noise and languages. Empirical analysis of the YouTube-8M dataset shows the potential for the approach to change the paradigm in video advertising. Built to be scalable and easily adaptable, this solution can handle multifarious and complex user-generated content well, suited for a wide range of applications across various media platforms.https://ieeexplore.ieee.org/document/10890950/Natural language processingvideo contextual advertisementsmultimodal fusiontopic modelingBERTopiccontextual taxonomy standards
spellingShingle Waruna de Silva
Anil Fernando
Explainable Video Topics for Content Taxonomy: A Multimodal Retrieval Approach to Industry-Compliant Contextual Advertising
IEEE Access
Natural language processing
video contextual advertisements
multimodal fusion
topic modeling
BERTopic
contextual taxonomy standards
title Explainable Video Topics for Content Taxonomy: A Multimodal Retrieval Approach to Industry-Compliant Contextual Advertising
title_full Explainable Video Topics for Content Taxonomy: A Multimodal Retrieval Approach to Industry-Compliant Contextual Advertising
title_fullStr Explainable Video Topics for Content Taxonomy: A Multimodal Retrieval Approach to Industry-Compliant Contextual Advertising
title_full_unstemmed Explainable Video Topics for Content Taxonomy: A Multimodal Retrieval Approach to Industry-Compliant Contextual Advertising
title_short Explainable Video Topics for Content Taxonomy: A Multimodal Retrieval Approach to Industry-Compliant Contextual Advertising
title_sort explainable video topics for content taxonomy a multimodal retrieval approach to industry compliant contextual advertising
topic Natural language processing
video contextual advertisements
multimodal fusion
topic modeling
BERTopic
contextual taxonomy standards
url https://ieeexplore.ieee.org/document/10890950/
work_keys_str_mv AT warunadesilva explainablevideotopicsforcontenttaxonomyamultimodalretrievalapproachtoindustrycompliantcontextualadvertising
AT anilfernando explainablevideotopicsforcontenttaxonomyamultimodalretrievalapproachtoindustrycompliantcontextualadvertising