Alternative splicing induces sample-level variation in gene–gene correlations

Abstract Background The vast majority of genes in the genome are multi-exonic, and are alternatively spliced during transcription, resulting in multiple isoforms for each gene. For some genes, different mRNA isoforms may have differential expression levels or be involved in different pathways. Bulk...

Full description

Saved in:
Bibliographic Details
Main Authors: Yihao Lu, Brandon L. Pierce, Pei Wang, Fan Yang, Lin S. Chen
Format: Article
Language:English
Published: BMC 2024-12-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-024-11118-z
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850118916037148672
author Yihao Lu
Brandon L. Pierce
Pei Wang
Fan Yang
Lin S. Chen
author_facet Yihao Lu
Brandon L. Pierce
Pei Wang
Fan Yang
Lin S. Chen
author_sort Yihao Lu
collection DOAJ
description Abstract Background The vast majority of genes in the genome are multi-exonic, and are alternatively spliced during transcription, resulting in multiple isoforms for each gene. For some genes, different mRNA isoforms may have differential expression levels or be involved in different pathways. Bulk tissue RNA-seq, as a widely used technology for transcriptome quantification, measures the total expression (TE) levels of each gene across multiple isoforms in multiple cell types for each tissue sample. With recent developments in precise quantification of alternative splicing events for each gene, we propose to study the effects of alternative splicing variation on gene–gene correlation effects. We adopted a variance-component model for testing the TE–TE correlations of one gene with a co-expressed gene, accounting for the effects of splicing variation and splicing-by-TE interaction of one gene on the other. Results We analyzed data from the Genotype-Tissue Expression (GTEx) project (V8). At the 5% FDR level, 38,146 pairs of genes out of ∼10 M examined pairs from GTEx lung tissue showed significant TE-splicing interaction effects, implying isoform-specific and/or sample-specific TE–TE correlations. Additional analysis across 13 GTEx brain tissues revealed strong tissue-specificity of TE-splicing interaction effects. Moreover, we showed that accounting for splicing variation across samples could improve the reproducibility of results and could reduce potential confounding effects in studying co-expressed gene pairs with bulk tissue data. Many of those gene pairs had correlation effects specific to only certain isoforms and would otherwise be undetected. By analyzing gene–gene co-expression variation within functional pathways accounting for splicing, we characterized the patterns of the “hub” genes with isoform-specific regulatory effects on multiple other genes. Conclusions We showed that splicing variation of a gene may interact with TE of the gene and affect the TE of co-expressed genes, resulting in substantial tissue-specific inter-sample variability in gene–gene correlation effects. Accounting for TE-splicing interaction effects could reduce potential confounding effects and improve the robustness of estimation when estimating gene–gene correlations from bulk tissue expression data.
format Article
id doaj-art-571a304fcb84424eba0bf7b3599d7730
institution OA Journals
issn 1471-2164
language English
publishDate 2024-12-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj-art-571a304fcb84424eba0bf7b3599d77302025-08-20T02:35:45ZengBMCBMC Genomics1471-21642024-12-0123S411310.1186/s12864-024-11118-zAlternative splicing induces sample-level variation in gene–gene correlationsYihao Lu0Brandon L. Pierce1Pei Wang2Fan Yang3Lin S. Chen4Department of Public Health Sciences, University of ChicagoDepartment of Public Health Sciences, University of ChicagoDepartment of Genetics and Genomic Sciences, Icahn School of Medicine at Mount SinaiDepartment of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado DenverDepartment of Public Health Sciences, University of ChicagoAbstract Background The vast majority of genes in the genome are multi-exonic, and are alternatively spliced during transcription, resulting in multiple isoforms for each gene. For some genes, different mRNA isoforms may have differential expression levels or be involved in different pathways. Bulk tissue RNA-seq, as a widely used technology for transcriptome quantification, measures the total expression (TE) levels of each gene across multiple isoforms in multiple cell types for each tissue sample. With recent developments in precise quantification of alternative splicing events for each gene, we propose to study the effects of alternative splicing variation on gene–gene correlation effects. We adopted a variance-component model for testing the TE–TE correlations of one gene with a co-expressed gene, accounting for the effects of splicing variation and splicing-by-TE interaction of one gene on the other. Results We analyzed data from the Genotype-Tissue Expression (GTEx) project (V8). At the 5% FDR level, 38,146 pairs of genes out of ∼10 M examined pairs from GTEx lung tissue showed significant TE-splicing interaction effects, implying isoform-specific and/or sample-specific TE–TE correlations. Additional analysis across 13 GTEx brain tissues revealed strong tissue-specificity of TE-splicing interaction effects. Moreover, we showed that accounting for splicing variation across samples could improve the reproducibility of results and could reduce potential confounding effects in studying co-expressed gene pairs with bulk tissue data. Many of those gene pairs had correlation effects specific to only certain isoforms and would otherwise be undetected. By analyzing gene–gene co-expression variation within functional pathways accounting for splicing, we characterized the patterns of the “hub” genes with isoform-specific regulatory effects on multiple other genes. Conclusions We showed that splicing variation of a gene may interact with TE of the gene and affect the TE of co-expressed genes, resulting in substantial tissue-specific inter-sample variability in gene–gene correlation effects. Accounting for TE-splicing interaction effects could reduce potential confounding effects and improve the robustness of estimation when estimating gene–gene correlations from bulk tissue expression data.https://doi.org/10.1186/s12864-024-11118-zAlternative splicingTotal expressionIsoformGene–gene correlationSample-level variation
spellingShingle Yihao Lu
Brandon L. Pierce
Pei Wang
Fan Yang
Lin S. Chen
Alternative splicing induces sample-level variation in gene–gene correlations
BMC Genomics
Alternative splicing
Total expression
Isoform
Gene–gene correlation
Sample-level variation
title Alternative splicing induces sample-level variation in gene–gene correlations
title_full Alternative splicing induces sample-level variation in gene–gene correlations
title_fullStr Alternative splicing induces sample-level variation in gene–gene correlations
title_full_unstemmed Alternative splicing induces sample-level variation in gene–gene correlations
title_short Alternative splicing induces sample-level variation in gene–gene correlations
title_sort alternative splicing induces sample level variation in gene gene correlations
topic Alternative splicing
Total expression
Isoform
Gene–gene correlation
Sample-level variation
url https://doi.org/10.1186/s12864-024-11118-z
work_keys_str_mv AT yihaolu alternativesplicinginducessamplelevelvariationingenegenecorrelations
AT brandonlpierce alternativesplicinginducessamplelevelvariationingenegenecorrelations
AT peiwang alternativesplicinginducessamplelevelvariationingenegenecorrelations
AT fanyang alternativesplicinginducessamplelevelvariationingenegenecorrelations
AT linschen alternativesplicinginducessamplelevelvariationingenegenecorrelations