Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information
Abstract Background Whole-genome sequencing (WGS) data has facilitated genome-wide identification of rare noncoding variants. However, elucidating these variants’ associations with complex diseases remains challenging. A previous study utilized a deep-learning-based framework and reported a signific...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
BMC
2025-06-01
|
| Series: | Genome Biology |
| Online Access: | https://doi.org/10.1186/s13059-025-03619-1 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849434175574114304 |
|---|---|
| author | Runjia Li Jason Ernst |
| author_facet | Runjia Li Jason Ernst |
| author_sort | Runjia Li |
| collection | DOAJ |
| description | Abstract Background Whole-genome sequencing (WGS) data has facilitated genome-wide identification of rare noncoding variants. However, elucidating these variants’ associations with complex diseases remains challenging. A previous study utilized a deep-learning-based framework and reported a significant brain-related association signal of autism spectrum disorder (ASD) detected from de novo noncoding variants in the Simons Simplex Collection (SSC) WGS cohort. Results We revisit the reported significant brain-related ASD association signal attributed to deep-learning and show that local GC content can capture similar association signals. We further show that the association signal appears driven by variants from male proband-female sibling pairs that are upstream of assigned genes. We then develop Expression Neighborhood Sequence Association Study (ENSAS), which utilizes gene expression correlations and sequence information, to more systematically identify phenotype-associated variant sets. Applying ENSAS to the same set of de novo variants, we identify gene expression-based neighborhoods showing significant ASD association signal, enriched for synapse-related gene ontology terms. For these top neighborhoods, we also identify chromatin state annotations of variants that are predictive of the proband-sibling local GC content differences. Conclusions Overall, our work simplifies a previously reported ASD signal and provides new insights into associations of noncoding de novo mutations in ASD. We also present a new analytical framework for understanding disease impact of de novo mutations, applicable to other phenotypes. |
| format | Article |
| id | doaj-art-1373fca1d92a49d9a3c85bc0d09391d7 |
| institution | Kabale University |
| issn | 1474-760X |
| language | English |
| publishDate | 2025-06-01 |
| publisher | BMC |
| record_format | Article |
| series | Genome Biology |
| spelling | doaj-art-1373fca1d92a49d9a3c85bc0d09391d72025-08-20T03:26:44ZengBMCGenome Biology1474-760X2025-06-0126112510.1186/s13059-025-03619-1Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex informationRunjia Li0Jason Ernst1Bioinformatics Interdepartmental Program, University of California, Los AngelesBioinformatics Interdepartmental Program, University of California, Los AngelesAbstract Background Whole-genome sequencing (WGS) data has facilitated genome-wide identification of rare noncoding variants. However, elucidating these variants’ associations with complex diseases remains challenging. A previous study utilized a deep-learning-based framework and reported a significant brain-related association signal of autism spectrum disorder (ASD) detected from de novo noncoding variants in the Simons Simplex Collection (SSC) WGS cohort. Results We revisit the reported significant brain-related ASD association signal attributed to deep-learning and show that local GC content can capture similar association signals. We further show that the association signal appears driven by variants from male proband-female sibling pairs that are upstream of assigned genes. We then develop Expression Neighborhood Sequence Association Study (ENSAS), which utilizes gene expression correlations and sequence information, to more systematically identify phenotype-associated variant sets. Applying ENSAS to the same set of de novo variants, we identify gene expression-based neighborhoods showing significant ASD association signal, enriched for synapse-related gene ontology terms. For these top neighborhoods, we also identify chromatin state annotations of variants that are predictive of the proband-sibling local GC content differences. Conclusions Overall, our work simplifies a previously reported ASD signal and provides new insights into associations of noncoding de novo mutations in ASD. We also present a new analytical framework for understanding disease impact of de novo mutations, applicable to other phenotypes.https://doi.org/10.1186/s13059-025-03619-1 |
| spellingShingle | Runjia Li Jason Ernst Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information Genome Biology |
| title | Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information |
| title_full | Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information |
| title_fullStr | Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information |
| title_full_unstemmed | Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information |
| title_short | Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information |
| title_sort | identifying associations of de novo noncoding variants with autism through integration of gene expression sequence and sex information |
| url | https://doi.org/10.1186/s13059-025-03619-1 |
| work_keys_str_mv | AT runjiali identifyingassociationsofdenovononcodingvariantswithautismthroughintegrationofgeneexpressionsequenceandsexinformation AT jasonernst identifyingassociationsofdenovononcodingvariantswithautismthroughintegrationofgeneexpressionsequenceandsexinformation |