OVALYTICS: Enhancing Offensive Video Detection with YouTube Transcriptions and Advanced Language Models

The exponential growth of offensive content online underscores the need for robust content moderation. In response, this work presents OVALYTICS (Offensive Video Analysis Leveraging YouTube Transcriptions with Intelligent Classification System), a comprehensive framework that introduces novel integr...

Full description

Saved in:
Bibliographic Details
Main Authors: Sneha Chinivar, Roopa M.S., Arunalatha J.S., Venugopal K.R.
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Natural Language Processing Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2949719125000238
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849423302600163328
author Sneha Chinivar
Roopa M.S.
Arunalatha J.S.
Venugopal K.R.
author_facet Sneha Chinivar
Roopa M.S.
Arunalatha J.S.
Venugopal K.R.
author_sort Sneha Chinivar
collection DOAJ
description The exponential growth of offensive content online underscores the need for robust content moderation. In response, this work presents OVALYTICS (Offensive Video Analysis Leveraging YouTube Transcriptions with Intelligent Classification System), a comprehensive framework that introduces novel integrations of advanced technologies for offensive video detection. Unlike existing approaches, OVALYTICS uniquely combines Whisper AI for accurate audio-to-text transcription with state-of-the-art large language models (LLMs) such as BERT, ALBERT, XLM-R, MPNet, and T5 for semantic analysis. The framework also features a newly curated dataset tailored for fine-grained evaluation, achieving significant improvements in accuracy and F1-scores over traditional methods and advancing the state of automated content moderation.
format Article
id doaj-art-0642116cc6ac4fd8977dddd9fb30e05a
institution Kabale University
issn 2949-7191
language English
publishDate 2025-06-01
publisher Elsevier
record_format Article
series Natural Language Processing Journal
spelling doaj-art-0642116cc6ac4fd8977dddd9fb30e05a2025-08-20T03:30:39ZengElsevierNatural Language Processing Journal2949-71912025-06-011110014710.1016/j.nlp.2025.100147OVALYTICS: Enhancing Offensive Video Detection with YouTube Transcriptions and Advanced Language ModelsSneha Chinivar0Roopa M.S.1Arunalatha J.S.2Venugopal K.R.3Department of Computer Science, University Visvesvaraya College of Engineering, Bengaluru, India; Corresponding author.Department of Computer Science, Nitte Meenakshi Institute of Technology, Bengaluru, IndiaDepartment of Computer Science, University Visvesvaraya College of Engineering, Bengaluru, IndiaDepartment of Computer Science, University Visvesvaraya College of Engineering, Bengaluru, IndiaThe exponential growth of offensive content online underscores the need for robust content moderation. In response, this work presents OVALYTICS (Offensive Video Analysis Leveraging YouTube Transcriptions with Intelligent Classification System), a comprehensive framework that introduces novel integrations of advanced technologies for offensive video detection. Unlike existing approaches, OVALYTICS uniquely combines Whisper AI for accurate audio-to-text transcription with state-of-the-art large language models (LLMs) such as BERT, ALBERT, XLM-R, MPNet, and T5 for semantic analysis. The framework also features a newly curated dataset tailored for fine-grained evaluation, achieving significant improvements in accuracy and F1-scores over traditional methods and advancing the state of automated content moderation.http://www.sciencedirect.com/science/article/pii/S2949719125000238Classification headLarge language modelsOffensive video detectionText transcriptionWhisper AI
spellingShingle Sneha Chinivar
Roopa M.S.
Arunalatha J.S.
Venugopal K.R.
OVALYTICS: Enhancing Offensive Video Detection with YouTube Transcriptions and Advanced Language Models
Natural Language Processing Journal
Classification head
Large language models
Offensive video detection
Text transcription
Whisper AI
title OVALYTICS: Enhancing Offensive Video Detection with YouTube Transcriptions and Advanced Language Models
title_full OVALYTICS: Enhancing Offensive Video Detection with YouTube Transcriptions and Advanced Language Models
title_fullStr OVALYTICS: Enhancing Offensive Video Detection with YouTube Transcriptions and Advanced Language Models
title_full_unstemmed OVALYTICS: Enhancing Offensive Video Detection with YouTube Transcriptions and Advanced Language Models
title_short OVALYTICS: Enhancing Offensive Video Detection with YouTube Transcriptions and Advanced Language Models
title_sort ovalytics enhancing offensive video detection with youtube transcriptions and advanced language models
topic Classification head
Large language models
Offensive video detection
Text transcription
Whisper AI
url http://www.sciencedirect.com/science/article/pii/S2949719125000238
work_keys_str_mv AT snehachinivar ovalyticsenhancingoffensivevideodetectionwithyoutubetranscriptionsandadvancedlanguagemodels
AT roopams ovalyticsenhancingoffensivevideodetectionwithyoutubetranscriptionsandadvancedlanguagemodels
AT arunalathajs ovalyticsenhancingoffensivevideodetectionwithyoutubetranscriptionsandadvancedlanguagemodels
AT venugopalkr ovalyticsenhancingoffensivevideodetectionwithyoutubetranscriptionsandadvancedlanguagemodels