Non-parametric correlation structures and their respective embeddings in predictive analysis
Abstract In data analysis, there is a natural connection between correlation and visual analysis. Correlation analysis enables the identification of relationships between attribute pairs in multidimensional datasets, while visual data analysis focuses on representing the data and the models that pro...
Saved in:
| Main Author: | |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Springer
2025-05-01
|
| Series: | Discover Applied Sciences |
| Subjects: | |
| Online Access: | https://doi.org/10.1007/s42452-025-07022-0 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849734048195280896 |
|---|---|
| author | Adam Dudáš |
| author_facet | Adam Dudáš |
| author_sort | Adam Dudáš |
| collection | DOAJ |
| description | Abstract In data analysis, there is a natural connection between correlation and visual analysis. Correlation analysis enables the identification of relationships between attribute pairs in multidimensional datasets, while visual data analysis focuses on representing the data and the models that process it in a comprehensible visual form. A notable model that integrates these two approaches is the correlation structure—a graphical and visual model based on correlation graphs and correlation chains. However, traditional correlation structures are parametric, requiring an attribute of interest as an input to guide the analysis. This presents a major challenge when analyzing unfamiliar datasets or domains, where the attributes of interest are not known in advance. To address this limitation, this study proposes the design and implementation of non-parametric correlation structures—specifically, non-parametric correlation graphs and non-parametric correlation chains—that eliminate the dependency on predefined input attributes. These models are implemented in Python language to ensure broad accessibility and integration into existing data analysis pipelines. Once implemented, the concept of embedding non-parametric correlation chains within non-parametric correlation graphs is explored to enhance the interpretability of correlation structures of a dataset. The proposed approach is evaluated through case studies on five open-access datasets, varying in size from 6 to 149 attributes, and tested using five regression analysis models. Finally, the advantages and disadvantages of the proposed model are determined. |
| format | Article |
| id | doaj-art-73355f6b87884b198c225eacf7fdbfed |
| institution | DOAJ |
| issn | 3004-9261 |
| language | English |
| publishDate | 2025-05-01 |
| publisher | Springer |
| record_format | Article |
| series | Discover Applied Sciences |
| spelling | doaj-art-73355f6b87884b198c225eacf7fdbfed2025-08-20T03:07:54ZengSpringerDiscover Applied Sciences3004-92612025-05-017512110.1007/s42452-025-07022-0Non-parametric correlation structures and their respective embeddings in predictive analysisAdam Dudáš0Department of Computer Science, Faculty of Natural Sciences, Matej Bel UniversityAbstract In data analysis, there is a natural connection between correlation and visual analysis. Correlation analysis enables the identification of relationships between attribute pairs in multidimensional datasets, while visual data analysis focuses on representing the data and the models that process it in a comprehensible visual form. A notable model that integrates these two approaches is the correlation structure—a graphical and visual model based on correlation graphs and correlation chains. However, traditional correlation structures are parametric, requiring an attribute of interest as an input to guide the analysis. This presents a major challenge when analyzing unfamiliar datasets or domains, where the attributes of interest are not known in advance. To address this limitation, this study proposes the design and implementation of non-parametric correlation structures—specifically, non-parametric correlation graphs and non-parametric correlation chains—that eliminate the dependency on predefined input attributes. These models are implemented in Python language to ensure broad accessibility and integration into existing data analysis pipelines. Once implemented, the concept of embedding non-parametric correlation chains within non-parametric correlation graphs is explored to enhance the interpretability of correlation structures of a dataset. The proposed approach is evaluated through case studies on five open-access datasets, varying in size from 6 to 149 attributes, and tested using five regression analysis models. Finally, the advantages and disadvantages of the proposed model are determined.https://doi.org/10.1007/s42452-025-07022-0Correlation structuresVisual analysisCorrelation analysisPattern recognitionRegression analysis |
| spellingShingle | Adam Dudáš Non-parametric correlation structures and their respective embeddings in predictive analysis Discover Applied Sciences Correlation structures Visual analysis Correlation analysis Pattern recognition Regression analysis |
| title | Non-parametric correlation structures and their respective embeddings in predictive analysis |
| title_full | Non-parametric correlation structures and their respective embeddings in predictive analysis |
| title_fullStr | Non-parametric correlation structures and their respective embeddings in predictive analysis |
| title_full_unstemmed | Non-parametric correlation structures and their respective embeddings in predictive analysis |
| title_short | Non-parametric correlation structures and their respective embeddings in predictive analysis |
| title_sort | non parametric correlation structures and their respective embeddings in predictive analysis |
| topic | Correlation structures Visual analysis Correlation analysis Pattern recognition Regression analysis |
| url | https://doi.org/10.1007/s42452-025-07022-0 |
| work_keys_str_mv | AT adamdudas nonparametriccorrelationstructuresandtheirrespectiveembeddingsinpredictiveanalysis |