Non-parametric correlation structures and their respective embeddings in predictive analysis

Abstract In data analysis, there is a natural connection between correlation and visual analysis. Correlation analysis enables the identification of relationships between attribute pairs in multidimensional datasets, while visual data analysis focuses on representing the data and the models that pro...

Full description

Saved in:
Bibliographic Details
Main Author: Adam Dudáš
Format: Article
Language:English
Published: Springer 2025-05-01
Series:Discover Applied Sciences
Subjects:
Online Access:https://doi.org/10.1007/s42452-025-07022-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849734048195280896
author Adam Dudáš
author_facet Adam Dudáš
author_sort Adam Dudáš
collection DOAJ
description Abstract In data analysis, there is a natural connection between correlation and visual analysis. Correlation analysis enables the identification of relationships between attribute pairs in multidimensional datasets, while visual data analysis focuses on representing the data and the models that process it in a comprehensible visual form. A notable model that integrates these two approaches is the correlation structure—a graphical and visual model based on correlation graphs and correlation chains. However, traditional correlation structures are parametric, requiring an attribute of interest as an input to guide the analysis. This presents a major challenge when analyzing unfamiliar datasets or domains, where the attributes of interest are not known in advance. To address this limitation, this study proposes the design and implementation of non-parametric correlation structures—specifically, non-parametric correlation graphs and non-parametric correlation chains—that eliminate the dependency on predefined input attributes. These models are implemented in Python language to ensure broad accessibility and integration into existing data analysis pipelines. Once implemented, the concept of embedding non-parametric correlation chains within non-parametric correlation graphs is explored to enhance the interpretability of correlation structures of a dataset. The proposed approach is evaluated through case studies on five open-access datasets, varying in size from 6 to 149 attributes, and tested using five regression analysis models. Finally, the advantages and disadvantages of the proposed model are determined.
format Article
id doaj-art-73355f6b87884b198c225eacf7fdbfed
institution DOAJ
issn 3004-9261
language English
publishDate 2025-05-01
publisher Springer
record_format Article
series Discover Applied Sciences
spelling doaj-art-73355f6b87884b198c225eacf7fdbfed2025-08-20T03:07:54ZengSpringerDiscover Applied Sciences3004-92612025-05-017512110.1007/s42452-025-07022-0Non-parametric correlation structures and their respective embeddings in predictive analysisAdam Dudáš0Department of Computer Science, Faculty of Natural Sciences, Matej Bel UniversityAbstract In data analysis, there is a natural connection between correlation and visual analysis. Correlation analysis enables the identification of relationships between attribute pairs in multidimensional datasets, while visual data analysis focuses on representing the data and the models that process it in a comprehensible visual form. A notable model that integrates these two approaches is the correlation structure—a graphical and visual model based on correlation graphs and correlation chains. However, traditional correlation structures are parametric, requiring an attribute of interest as an input to guide the analysis. This presents a major challenge when analyzing unfamiliar datasets or domains, where the attributes of interest are not known in advance. To address this limitation, this study proposes the design and implementation of non-parametric correlation structures—specifically, non-parametric correlation graphs and non-parametric correlation chains—that eliminate the dependency on predefined input attributes. These models are implemented in Python language to ensure broad accessibility and integration into existing data analysis pipelines. Once implemented, the concept of embedding non-parametric correlation chains within non-parametric correlation graphs is explored to enhance the interpretability of correlation structures of a dataset. The proposed approach is evaluated through case studies on five open-access datasets, varying in size from 6 to 149 attributes, and tested using five regression analysis models. Finally, the advantages and disadvantages of the proposed model are determined.https://doi.org/10.1007/s42452-025-07022-0Correlation structuresVisual analysisCorrelation analysisPattern recognitionRegression analysis
spellingShingle Adam Dudáš
Non-parametric correlation structures and their respective embeddings in predictive analysis
Discover Applied Sciences
Correlation structures
Visual analysis
Correlation analysis
Pattern recognition
Regression analysis
title Non-parametric correlation structures and their respective embeddings in predictive analysis
title_full Non-parametric correlation structures and their respective embeddings in predictive analysis
title_fullStr Non-parametric correlation structures and their respective embeddings in predictive analysis
title_full_unstemmed Non-parametric correlation structures and their respective embeddings in predictive analysis
title_short Non-parametric correlation structures and their respective embeddings in predictive analysis
title_sort non parametric correlation structures and their respective embeddings in predictive analysis
topic Correlation structures
Visual analysis
Correlation analysis
Pattern recognition
Regression analysis
url https://doi.org/10.1007/s42452-025-07022-0
work_keys_str_mv AT adamdudas nonparametriccorrelationstructuresandtheirrespectiveembeddingsinpredictiveanalysis