Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information

Abstract Background Whole-genome sequencing (WGS) data has facilitated genome-wide identification of rare noncoding variants. However, elucidating these variants’ associations with complex diseases remains challenging. A previous study utilized a deep-learning-based framework and reported a signific...

Full description

Saved in:
Bibliographic Details
Main Authors: Runjia Li, Jason Ernst
Format: Article
Language:English
Published: BMC 2025-06-01
Series:Genome Biology
Online Access:https://doi.org/10.1186/s13059-025-03619-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849434175574114304
author Runjia Li
Jason Ernst
author_facet Runjia Li
Jason Ernst
author_sort Runjia Li
collection DOAJ
description Abstract Background Whole-genome sequencing (WGS) data has facilitated genome-wide identification of rare noncoding variants. However, elucidating these variants’ associations with complex diseases remains challenging. A previous study utilized a deep-learning-based framework and reported a significant brain-related association signal of autism spectrum disorder (ASD) detected from de novo noncoding variants in the Simons Simplex Collection (SSC) WGS cohort. Results We revisit the reported significant brain-related ASD association signal attributed to deep-learning and show that local GC content can capture similar association signals. We further show that the association signal appears driven by variants from male proband-female sibling pairs that are upstream of assigned genes. We then develop Expression Neighborhood Sequence Association Study (ENSAS), which utilizes gene expression correlations and sequence information, to more systematically identify phenotype-associated variant sets. Applying ENSAS to the same set of de novo variants, we identify gene expression-based neighborhoods showing significant ASD association signal, enriched for synapse-related gene ontology terms. For these top neighborhoods, we also identify chromatin state annotations of variants that are predictive of the proband-sibling local GC content differences. Conclusions Overall, our work simplifies a previously reported ASD signal and provides new insights into associations of noncoding de novo mutations in ASD. We also present a new analytical framework for understanding disease impact of de novo mutations, applicable to other phenotypes.
format Article
id doaj-art-1373fca1d92a49d9a3c85bc0d09391d7
institution Kabale University
issn 1474-760X
language English
publishDate 2025-06-01
publisher BMC
record_format Article
series Genome Biology
spelling doaj-art-1373fca1d92a49d9a3c85bc0d09391d72025-08-20T03:26:44ZengBMCGenome Biology1474-760X2025-06-0126112510.1186/s13059-025-03619-1Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex informationRunjia Li0Jason Ernst1Bioinformatics Interdepartmental Program, University of California, Los AngelesBioinformatics Interdepartmental Program, University of California, Los AngelesAbstract Background Whole-genome sequencing (WGS) data has facilitated genome-wide identification of rare noncoding variants. However, elucidating these variants’ associations with complex diseases remains challenging. A previous study utilized a deep-learning-based framework and reported a significant brain-related association signal of autism spectrum disorder (ASD) detected from de novo noncoding variants in the Simons Simplex Collection (SSC) WGS cohort. Results We revisit the reported significant brain-related ASD association signal attributed to deep-learning and show that local GC content can capture similar association signals. We further show that the association signal appears driven by variants from male proband-female sibling pairs that are upstream of assigned genes. We then develop Expression Neighborhood Sequence Association Study (ENSAS), which utilizes gene expression correlations and sequence information, to more systematically identify phenotype-associated variant sets. Applying ENSAS to the same set of de novo variants, we identify gene expression-based neighborhoods showing significant ASD association signal, enriched for synapse-related gene ontology terms. For these top neighborhoods, we also identify chromatin state annotations of variants that are predictive of the proband-sibling local GC content differences. Conclusions Overall, our work simplifies a previously reported ASD signal and provides new insights into associations of noncoding de novo mutations in ASD. We also present a new analytical framework for understanding disease impact of de novo mutations, applicable to other phenotypes.https://doi.org/10.1186/s13059-025-03619-1
spellingShingle Runjia Li
Jason Ernst
Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information
Genome Biology
title Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information
title_full Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information
title_fullStr Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information
title_full_unstemmed Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information
title_short Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information
title_sort identifying associations of de novo noncoding variants with autism through integration of gene expression sequence and sex information
url https://doi.org/10.1186/s13059-025-03619-1
work_keys_str_mv AT runjiali identifyingassociationsofdenovononcodingvariantswithautismthroughintegrationofgeneexpressionsequenceandsexinformation
AT jasonernst identifyingassociationsofdenovononcodingvariantswithautismthroughintegrationofgeneexpressionsequenceandsexinformation