ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data

BackgroundIdentifying bacterial taxa associated with disease phenotypes or clinical treatments over time is critical for understanding the underlying biological mechanism. Association testing for microbiome data is already challenging due to its complex distribution that involves sparsity, over-disp...

Full description

Saved in:
Bibliographic Details
Main Authors: Shuai Li, Runzhe Li, John R. Lee, Ni Zhao, Wodan Ling
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2024.1494401/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832582901730902016
author Shuai Li
Runzhe Li
John R. Lee
John R. Lee
Ni Zhao
Wodan Ling
author_facet Shuai Li
Runzhe Li
John R. Lee
John R. Lee
Ni Zhao
Wodan Ling
author_sort Shuai Li
collection DOAJ
description BackgroundIdentifying bacterial taxa associated with disease phenotypes or clinical treatments over time is critical for understanding the underlying biological mechanism. Association testing for microbiome data is already challenging due to its complex distribution that involves sparsity, over-dispersion, heavy tails, etc. The longitudinal nature of the data adds another layer of complexity - one needs to account for the within-subject correlations to avoid biased results. Existing longitudinal differential abundance approaches usually depend on strong parametric assumptions, such as zero-inflated normal or negative binomial. However, the complex microbiome data frequently violate these distributional assumptions, leading to inflated false discovery rates. In addition, the existing methods are mostly mean-based, unable to identify heterogeneous associations such as tail events or subgroup effects, which could be important biomedical signals.MethodsWe propose a zero-inflated quantile approach for longitudinal (ZINQ-L) microbiome differential abundance test. A mixed-effects quantile rank-score-based test was proposed for hypothesis testing, which consists of a test in mixed-effects logistic model for the presence-absence status of the investigated taxon, and a series of mixed-effects quantile rank-score tests adjusted for zero inflation given its presence. As a regression method with minimal distributional assumptions, it is robust to the complex microbiome data, controlling false discovery rate, and is flexible to adjust for important covariates. Its comprehensive examination of the abundance distribution enables the identification of heterogeneous associations, improving the testing power.ResultsExtensive simulation studies and an application to a real kidney transplant microbiome study demonstrate the improved power of ZINQ-L in detecting true signals while controlling false discovery rates.ConclusionZINQ-L is a zero-inflated quantile-based approach for detecting individual taxa associated with outcomes or exposures in longitudinal microbiome studies, providing a robust and powerful option to improve and complement the existing methods in the field.
format Article
id doaj-art-5b8ec2e5d94947ca85668def19db9d90
institution Kabale University
issn 1664-8021
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Genetics
spelling doaj-art-5b8ec2e5d94947ca85668def19db9d902025-01-29T06:45:59ZengFrontiers Media S.A.Frontiers in Genetics1664-80212025-01-011510.3389/fgene.2024.14944011494401ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome dataShuai Li0Runzhe Li1John R. Lee2John R. Lee3Ni Zhao4Wodan Ling5Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United StatesDepartment of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United StatesDivision of Nephrology and Hypertension, Department of Medicine, Weill Medical College of Cornell University, New York, NY, United StatesDepartment of Transplantation Medicine, New York Presbyterian Hospital–Weill Cornell Medical Center, New York, NY, United StatesDepartment of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United StatesDivision of Biostatistics, Department of Population Health Sciences, Weill Medical College of Cornell University, New York, NY, United StatesBackgroundIdentifying bacterial taxa associated with disease phenotypes or clinical treatments over time is critical for understanding the underlying biological mechanism. Association testing for microbiome data is already challenging due to its complex distribution that involves sparsity, over-dispersion, heavy tails, etc. The longitudinal nature of the data adds another layer of complexity - one needs to account for the within-subject correlations to avoid biased results. Existing longitudinal differential abundance approaches usually depend on strong parametric assumptions, such as zero-inflated normal or negative binomial. However, the complex microbiome data frequently violate these distributional assumptions, leading to inflated false discovery rates. In addition, the existing methods are mostly mean-based, unable to identify heterogeneous associations such as tail events or subgroup effects, which could be important biomedical signals.MethodsWe propose a zero-inflated quantile approach for longitudinal (ZINQ-L) microbiome differential abundance test. A mixed-effects quantile rank-score-based test was proposed for hypothesis testing, which consists of a test in mixed-effects logistic model for the presence-absence status of the investigated taxon, and a series of mixed-effects quantile rank-score tests adjusted for zero inflation given its presence. As a regression method with minimal distributional assumptions, it is robust to the complex microbiome data, controlling false discovery rate, and is flexible to adjust for important covariates. Its comprehensive examination of the abundance distribution enables the identification of heterogeneous associations, improving the testing power.ResultsExtensive simulation studies and an application to a real kidney transplant microbiome study demonstrate the improved power of ZINQ-L in detecting true signals while controlling false discovery rates.ConclusionZINQ-L is a zero-inflated quantile-based approach for detecting individual taxa associated with outcomes or exposures in longitudinal microbiome studies, providing a robust and powerful option to improve and complement the existing methods in the field.https://www.frontiersin.org/articles/10.3389/fgene.2024.1494401/fulllongitudinal microbiome studieszero inflation and dispersonmixed-effects modelsquantile rank-score testheterogeneous associations
spellingShingle Shuai Li
Runzhe Li
John R. Lee
John R. Lee
Ni Zhao
Wodan Ling
ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data
Frontiers in Genetics
longitudinal microbiome studies
zero inflation and disperson
mixed-effects models
quantile rank-score test
heterogeneous associations
title ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data
title_full ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data
title_fullStr ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data
title_full_unstemmed ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data
title_short ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data
title_sort zinq l a zero inflated quantile approach for differential abundance analysis of longitudinal microbiome data
topic longitudinal microbiome studies
zero inflation and disperson
mixed-effects models
quantile rank-score test
heterogeneous associations
url https://www.frontiersin.org/articles/10.3389/fgene.2024.1494401/full
work_keys_str_mv AT shuaili zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata
AT runzheli zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata
AT johnrlee zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata
AT johnrlee zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata
AT nizhao zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata
AT wodanling zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata