Evaluating Directed Acyclic Graphs with DAGMetrics: Insights from Tuber and Soil Microbiome Data

Understanding and evaluating directed acyclic graphs (DAGs) is crucial for causal discovery, particularly in high-dimensional and small-sample datasets such as microbial abundance data. This study introduces DAGMetrics, an R package designed to comprehensively evaluate and compare DAGs. The package...

Full description

Saved in:
Bibliographic Details
Main Authors: Pavel Averin, Ifigeneia Mellidou, Maria Ganopoulou, Aliki Xanthopoulou, Theodoros Moysiadis
Format: Article
Language:English
Published: MDPI AG 2025-04-01
Series:Agronomy
Subjects:
Online Access:https://www.mdpi.com/2073-4395/15/4/987
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850144149718695936
author Pavel Averin
Ifigeneia Mellidou
Maria Ganopoulou
Aliki Xanthopoulou
Theodoros Moysiadis
author_facet Pavel Averin
Ifigeneia Mellidou
Maria Ganopoulou
Aliki Xanthopoulou
Theodoros Moysiadis
author_sort Pavel Averin
collection DOAJ
description Understanding and evaluating directed acyclic graphs (DAGs) is crucial for causal discovery, particularly in high-dimensional and small-sample datasets such as microbial abundance data. This study introduces DAGMetrics, an R package designed to comprehensively evaluate and compare DAGs. The package provides descriptive and comparative metrics, streamlining the assessment of outputs from various structure learning algorithms. It was applied to datasets generated for potato tubers and soils from different terroirs (continental and island) and stages (at harvest and post-harvest). Using a comprehensive set of descriptive and comparative metrics, DAGMetrics facilitated model selection by identifying balanced and robust DAGs. The PC algorithm with Spearman correlation produced DAGs with moderate complexity and high stability across scaling and transformation setups. Additionally, the package enabled detailed exploration of the Markov blanket space, revealing small Markov blankets (up to seven nodes) and numerous isolated nodes. Identified matching edges between Markov blankets across different terroirs and stages aligned with known microbial interactions, highlighting the package’s utility in facilitating the discovery of biologically meaningful relationships. This study illustrates the utility of DAGMetrics in providing objective and reproducible tools for DAG evaluation along with its potential application in agronomic and other domains involving complex structured data.
format Article
id doaj-art-84d2ebf6ceac4c968e34fda6964165e2
institution OA Journals
issn 2073-4395
language English
publishDate 2025-04-01
publisher MDPI AG
record_format Article
series Agronomy
spelling doaj-art-84d2ebf6ceac4c968e34fda6964165e22025-08-20T02:28:27ZengMDPI AGAgronomy2073-43952025-04-0115498710.3390/agronomy15040987Evaluating Directed Acyclic Graphs with DAGMetrics: Insights from Tuber and Soil Microbiome DataPavel Averin0Ifigeneia Mellidou1Maria Ganopoulou2Aliki Xanthopoulou3Theodoros Moysiadis4Department of Computer Science, School of Sciences and Engineering, University of Nicosia, Nicosia 2417, CyprusInstitute of Plant Breeding and Genetic Resources, ELGO-DIMITRA, 57001 Thessaloniki, GreeceSchool of Informatics, Aristotle University of Thessaloniki, 54124 Thessaloniki, GreeceInstitute of Plant Breeding and Genetic Resources, ELGO-DIMITRA, 57001 Thessaloniki, GreeceDepartment of Computer Science, School of Sciences and Engineering, University of Nicosia, Nicosia 2417, CyprusUnderstanding and evaluating directed acyclic graphs (DAGs) is crucial for causal discovery, particularly in high-dimensional and small-sample datasets such as microbial abundance data. This study introduces DAGMetrics, an R package designed to comprehensively evaluate and compare DAGs. The package provides descriptive and comparative metrics, streamlining the assessment of outputs from various structure learning algorithms. It was applied to datasets generated for potato tubers and soils from different terroirs (continental and island) and stages (at harvest and post-harvest). Using a comprehensive set of descriptive and comparative metrics, DAGMetrics facilitated model selection by identifying balanced and robust DAGs. The PC algorithm with Spearman correlation produced DAGs with moderate complexity and high stability across scaling and transformation setups. Additionally, the package enabled detailed exploration of the Markov blanket space, revealing small Markov blankets (up to seven nodes) and numerous isolated nodes. Identified matching edges between Markov blankets across different terroirs and stages aligned with known microbial interactions, highlighting the package’s utility in facilitating the discovery of biologically meaningful relationships. This study illustrates the utility of DAGMetrics in providing objective and reproducible tools for DAG evaluation along with its potential application in agronomic and other domains involving complex structured data.https://www.mdpi.com/2073-4395/15/4/987agronomic data analysisBayesian networkscausal discoverydirected acyclic graphsMarkov blanketmetagenomics
spellingShingle Pavel Averin
Ifigeneia Mellidou
Maria Ganopoulou
Aliki Xanthopoulou
Theodoros Moysiadis
Evaluating Directed Acyclic Graphs with DAGMetrics: Insights from Tuber and Soil Microbiome Data
Agronomy
agronomic data analysis
Bayesian networks
causal discovery
directed acyclic graphs
Markov blanket
metagenomics
title Evaluating Directed Acyclic Graphs with DAGMetrics: Insights from Tuber and Soil Microbiome Data
title_full Evaluating Directed Acyclic Graphs with DAGMetrics: Insights from Tuber and Soil Microbiome Data
title_fullStr Evaluating Directed Acyclic Graphs with DAGMetrics: Insights from Tuber and Soil Microbiome Data
title_full_unstemmed Evaluating Directed Acyclic Graphs with DAGMetrics: Insights from Tuber and Soil Microbiome Data
title_short Evaluating Directed Acyclic Graphs with DAGMetrics: Insights from Tuber and Soil Microbiome Data
title_sort evaluating directed acyclic graphs with dagmetrics insights from tuber and soil microbiome data
topic agronomic data analysis
Bayesian networks
causal discovery
directed acyclic graphs
Markov blanket
metagenomics
url https://www.mdpi.com/2073-4395/15/4/987
work_keys_str_mv AT pavelaverin evaluatingdirectedacyclicgraphswithdagmetricsinsightsfromtuberandsoilmicrobiomedata
AT ifigeneiamellidou evaluatingdirectedacyclicgraphswithdagmetricsinsightsfromtuberandsoilmicrobiomedata
AT mariaganopoulou evaluatingdirectedacyclicgraphswithdagmetricsinsightsfromtuberandsoilmicrobiomedata
AT alikixanthopoulou evaluatingdirectedacyclicgraphswithdagmetricsinsightsfromtuberandsoilmicrobiomedata
AT theodorosmoysiadis evaluatingdirectedacyclicgraphswithdagmetricsinsightsfromtuberandsoilmicrobiomedata