TempO-seq and RNA-seq gene expression levels are highly correlated for most genes: A comparison using 39 human cell lines.

Recent advances in transcriptomics technologies allow for whole transcriptome gene expression profiling using targeted sequencing techniques, which is becoming increasingly popular due to logistical ease of data acquisition and analysis. As data from these targeted sequencing platforms accumulates,...

Full description

Saved in:
Bibliographic Details
Main Authors: Laura J Word, Clinton M Willis, Richard S Judson, Logan J Everett, Sarah E Davidson-Fritz, Derik E Haggard, Bryant A Chambers, Jesse D Rogers, Joseph L Bundy, Imran Shah, Nisha S Sipes, Joshua A Harrill
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0320862
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849327801465831424
author Laura J Word
Clinton M Willis
Richard S Judson
Logan J Everett
Sarah E Davidson-Fritz
Derik E Haggard
Bryant A Chambers
Jesse D Rogers
Joseph L Bundy
Imran Shah
Nisha S Sipes
Joshua A Harrill
author_facet Laura J Word
Clinton M Willis
Richard S Judson
Logan J Everett
Sarah E Davidson-Fritz
Derik E Haggard
Bryant A Chambers
Jesse D Rogers
Joseph L Bundy
Imran Shah
Nisha S Sipes
Joshua A Harrill
author_sort Laura J Word
collection DOAJ
description Recent advances in transcriptomics technologies allow for whole transcriptome gene expression profiling using targeted sequencing techniques, which is becoming increasingly popular due to logistical ease of data acquisition and analysis. As data from these targeted sequencing platforms accumulates, it is important to evaluate their similarity to traditional whole transcriptome RNA-seq. Thus, we evaluated the comparability of TempO-seq data from cell lysates to traditional RNA-Seq from purified RNA using baseline gene expression profiles. First, two TempO-seq data sets that were generated several months apart at different read depths were compared for six human cell lines. The average Pearson correlation was 0.93 (95% CI: 0.90-0.96) and principal component analysis (PCA) showed that these two TempO-seq data sets were highly reproducible and could be combined. Next, TempO-seq data was compared to RNA-Seq data for 39 human cell lines. The log2 normalized expression data for 19,290 genes within both platforms were well correlated between TempO-seq and RNA-seq (Pearson correlation 0.77, 95% CI: 0.76-0.78), and the majority of genes (15,480 genes, 80%) had concordant gene expression levels. PCA showed a platform divergence, but this was readily resolved by calculating relative log2 expression (RLE) of genes compared to the average expression across cell lines in each platform. Application of gene ontology analysis revealed that ontologies associated with histone and ribosomal functions were enriched for the 20% of genes with non-concordant expression levels (3,810 genes). On the other hand, gene ontologies annotated to cellular structure functions were enriched for genes with concordant expression levels between the platforms. In conclusion, we found TempO-seq baseline expression data to be reproducible at different read depths and found TempO-seq RLE data from lysed cells to be comparable to RNA-seq RLE data from purified RNA across 39 cell lines, even though the datasets were generated by different laboratories using different cell stocks.
format Article
id doaj-art-2e4e3689e25444f38d7a9aa9c6f4a890
institution Kabale University
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-2e4e3689e25444f38d7a9aa9c6f4a8902025-08-20T03:47:45ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01205e032086210.1371/journal.pone.0320862TempO-seq and RNA-seq gene expression levels are highly correlated for most genes: A comparison using 39 human cell lines.Laura J WordClinton M WillisRichard S JudsonLogan J EverettSarah E Davidson-FritzDerik E HaggardBryant A ChambersJesse D RogersJoseph L BundyImran ShahNisha S SipesJoshua A HarrillRecent advances in transcriptomics technologies allow for whole transcriptome gene expression profiling using targeted sequencing techniques, which is becoming increasingly popular due to logistical ease of data acquisition and analysis. As data from these targeted sequencing platforms accumulates, it is important to evaluate their similarity to traditional whole transcriptome RNA-seq. Thus, we evaluated the comparability of TempO-seq data from cell lysates to traditional RNA-Seq from purified RNA using baseline gene expression profiles. First, two TempO-seq data sets that were generated several months apart at different read depths were compared for six human cell lines. The average Pearson correlation was 0.93 (95% CI: 0.90-0.96) and principal component analysis (PCA) showed that these two TempO-seq data sets were highly reproducible and could be combined. Next, TempO-seq data was compared to RNA-Seq data for 39 human cell lines. The log2 normalized expression data for 19,290 genes within both platforms were well correlated between TempO-seq and RNA-seq (Pearson correlation 0.77, 95% CI: 0.76-0.78), and the majority of genes (15,480 genes, 80%) had concordant gene expression levels. PCA showed a platform divergence, but this was readily resolved by calculating relative log2 expression (RLE) of genes compared to the average expression across cell lines in each platform. Application of gene ontology analysis revealed that ontologies associated with histone and ribosomal functions were enriched for the 20% of genes with non-concordant expression levels (3,810 genes). On the other hand, gene ontologies annotated to cellular structure functions were enriched for genes with concordant expression levels between the platforms. In conclusion, we found TempO-seq baseline expression data to be reproducible at different read depths and found TempO-seq RLE data from lysed cells to be comparable to RNA-seq RLE data from purified RNA across 39 cell lines, even though the datasets were generated by different laboratories using different cell stocks.https://doi.org/10.1371/journal.pone.0320862
spellingShingle Laura J Word
Clinton M Willis
Richard S Judson
Logan J Everett
Sarah E Davidson-Fritz
Derik E Haggard
Bryant A Chambers
Jesse D Rogers
Joseph L Bundy
Imran Shah
Nisha S Sipes
Joshua A Harrill
TempO-seq and RNA-seq gene expression levels are highly correlated for most genes: A comparison using 39 human cell lines.
PLoS ONE
title TempO-seq and RNA-seq gene expression levels are highly correlated for most genes: A comparison using 39 human cell lines.
title_full TempO-seq and RNA-seq gene expression levels are highly correlated for most genes: A comparison using 39 human cell lines.
title_fullStr TempO-seq and RNA-seq gene expression levels are highly correlated for most genes: A comparison using 39 human cell lines.
title_full_unstemmed TempO-seq and RNA-seq gene expression levels are highly correlated for most genes: A comparison using 39 human cell lines.
title_short TempO-seq and RNA-seq gene expression levels are highly correlated for most genes: A comparison using 39 human cell lines.
title_sort tempo seq and rna seq gene expression levels are highly correlated for most genes a comparison using 39 human cell lines
url https://doi.org/10.1371/journal.pone.0320862
work_keys_str_mv AT laurajword temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT clintonmwillis temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT richardsjudson temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT loganjeverett temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT sarahedavidsonfritz temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT derikehaggard temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT bryantachambers temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT jessedrogers temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT josephlbundy temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT imranshah temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT nishassipes temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines
AT joshuaaharrill temposeqandrnaseqgeneexpressionlevelsarehighlycorrelatedformostgenesacomparisonusing39humancelllines