Performance benchmarking of multimodal data-driven approaches in industrial settingsZenedoZenedoZenedo

Data-driven solutions are increasingly transforming the industrial sector, yet collecting large-scale, multimodal datasets remains costly and challenging. This paper presents three synthetic multimodal datasets that replicate real-world industrial conditions across varying levels of complexity, desi...

Full description

Saved in:
Bibliographic Details
Main Authors: Diyar Altinses, Andreas Schwung
Format: Article
Language:English
Published: Elsevier 2025-09-01
Series:Machine Learning with Applications
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S266682702500074X
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Data-driven solutions are increasingly transforming the industrial sector, yet collecting large-scale, multimodal datasets remains costly and challenging. This paper presents three synthetic multimodal datasets that replicate real-world industrial conditions across varying levels of complexity, designed to benchmark multimodal machine learning models. We validate their utility through a series of experiments. Cross-modal prediction and domain adaptation demonstrate that the datasets effectively capture strong multimodal correlations. Multimodal reconstruction experiments confirm the internal consistency and richness of the fused representations, indicating that the modalities complement each other in capturing underlying structure. Additionally, multimodal regression significantly outperforms unimodal baselines, underscoring the predictive strength gained through multimodal integration. Together, these results demonstrate the utility of our datasets, establishing a solid baseline for future research and encouraging further advancements in industrial data-driven solutions.
ISSN:2666-8270