ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data
BackgroundIdentifying bacterial taxa associated with disease phenotypes or clinical treatments over time is critical for understanding the underlying biological mechanism. Association testing for microbiome data is already challenging due to its complex distribution that involves sparsity, over-disp...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2025-01-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fgene.2024.1494401/full |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832582901730902016 |
---|---|
author | Shuai Li Runzhe Li John R. Lee John R. Lee Ni Zhao Wodan Ling |
author_facet | Shuai Li Runzhe Li John R. Lee John R. Lee Ni Zhao Wodan Ling |
author_sort | Shuai Li |
collection | DOAJ |
description | BackgroundIdentifying bacterial taxa associated with disease phenotypes or clinical treatments over time is critical for understanding the underlying biological mechanism. Association testing for microbiome data is already challenging due to its complex distribution that involves sparsity, over-dispersion, heavy tails, etc. The longitudinal nature of the data adds another layer of complexity - one needs to account for the within-subject correlations to avoid biased results. Existing longitudinal differential abundance approaches usually depend on strong parametric assumptions, such as zero-inflated normal or negative binomial. However, the complex microbiome data frequently violate these distributional assumptions, leading to inflated false discovery rates. In addition, the existing methods are mostly mean-based, unable to identify heterogeneous associations such as tail events or subgroup effects, which could be important biomedical signals.MethodsWe propose a zero-inflated quantile approach for longitudinal (ZINQ-L) microbiome differential abundance test. A mixed-effects quantile rank-score-based test was proposed for hypothesis testing, which consists of a test in mixed-effects logistic model for the presence-absence status of the investigated taxon, and a series of mixed-effects quantile rank-score tests adjusted for zero inflation given its presence. As a regression method with minimal distributional assumptions, it is robust to the complex microbiome data, controlling false discovery rate, and is flexible to adjust for important covariates. Its comprehensive examination of the abundance distribution enables the identification of heterogeneous associations, improving the testing power.ResultsExtensive simulation studies and an application to a real kidney transplant microbiome study demonstrate the improved power of ZINQ-L in detecting true signals while controlling false discovery rates.ConclusionZINQ-L is a zero-inflated quantile-based approach for detecting individual taxa associated with outcomes or exposures in longitudinal microbiome studies, providing a robust and powerful option to improve and complement the existing methods in the field. |
format | Article |
id | doaj-art-5b8ec2e5d94947ca85668def19db9d90 |
institution | Kabale University |
issn | 1664-8021 |
language | English |
publishDate | 2025-01-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Genetics |
spelling | doaj-art-5b8ec2e5d94947ca85668def19db9d902025-01-29T06:45:59ZengFrontiers Media S.A.Frontiers in Genetics1664-80212025-01-011510.3389/fgene.2024.14944011494401ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome dataShuai Li0Runzhe Li1John R. Lee2John R. Lee3Ni Zhao4Wodan Ling5Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United StatesDepartment of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United StatesDivision of Nephrology and Hypertension, Department of Medicine, Weill Medical College of Cornell University, New York, NY, United StatesDepartment of Transplantation Medicine, New York Presbyterian Hospital–Weill Cornell Medical Center, New York, NY, United StatesDepartment of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United StatesDivision of Biostatistics, Department of Population Health Sciences, Weill Medical College of Cornell University, New York, NY, United StatesBackgroundIdentifying bacterial taxa associated with disease phenotypes or clinical treatments over time is critical for understanding the underlying biological mechanism. Association testing for microbiome data is already challenging due to its complex distribution that involves sparsity, over-dispersion, heavy tails, etc. The longitudinal nature of the data adds another layer of complexity - one needs to account for the within-subject correlations to avoid biased results. Existing longitudinal differential abundance approaches usually depend on strong parametric assumptions, such as zero-inflated normal or negative binomial. However, the complex microbiome data frequently violate these distributional assumptions, leading to inflated false discovery rates. In addition, the existing methods are mostly mean-based, unable to identify heterogeneous associations such as tail events or subgroup effects, which could be important biomedical signals.MethodsWe propose a zero-inflated quantile approach for longitudinal (ZINQ-L) microbiome differential abundance test. A mixed-effects quantile rank-score-based test was proposed for hypothesis testing, which consists of a test in mixed-effects logistic model for the presence-absence status of the investigated taxon, and a series of mixed-effects quantile rank-score tests adjusted for zero inflation given its presence. As a regression method with minimal distributional assumptions, it is robust to the complex microbiome data, controlling false discovery rate, and is flexible to adjust for important covariates. Its comprehensive examination of the abundance distribution enables the identification of heterogeneous associations, improving the testing power.ResultsExtensive simulation studies and an application to a real kidney transplant microbiome study demonstrate the improved power of ZINQ-L in detecting true signals while controlling false discovery rates.ConclusionZINQ-L is a zero-inflated quantile-based approach for detecting individual taxa associated with outcomes or exposures in longitudinal microbiome studies, providing a robust and powerful option to improve and complement the existing methods in the field.https://www.frontiersin.org/articles/10.3389/fgene.2024.1494401/fulllongitudinal microbiome studieszero inflation and dispersonmixed-effects modelsquantile rank-score testheterogeneous associations |
spellingShingle | Shuai Li Runzhe Li John R. Lee John R. Lee Ni Zhao Wodan Ling ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data Frontiers in Genetics longitudinal microbiome studies zero inflation and disperson mixed-effects models quantile rank-score test heterogeneous associations |
title | ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data |
title_full | ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data |
title_fullStr | ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data |
title_full_unstemmed | ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data |
title_short | ZINQ-L: a zero-inflated quantile approach for differential abundance analysis of longitudinal microbiome data |
title_sort | zinq l a zero inflated quantile approach for differential abundance analysis of longitudinal microbiome data |
topic | longitudinal microbiome studies zero inflation and disperson mixed-effects models quantile rank-score test heterogeneous associations |
url | https://www.frontiersin.org/articles/10.3389/fgene.2024.1494401/full |
work_keys_str_mv | AT shuaili zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata AT runzheli zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata AT johnrlee zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata AT johnrlee zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata AT nizhao zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata AT wodanling zinqlazeroinflatedquantileapproachfordifferentialabundanceanalysisoflongitudinalmicrobiomedata |