Identification and quantification of transposable element transcripts using Long-Read RNA-seq in Drosophila germline tissues

Transposable elements (TEs) are repeated DNA sequences potentially able to move throughout the genome. In addition to their inherent mutagenic effects, TEs can disrupt nearby genes by donating their intrinsic regulatory sequences, for instance, promoting the ectopic expression of a cellular gene. TE...

Full description

Saved in:
Bibliographic Details
Main Authors: Rebollo, Rita, Gerenton, Pierre, Cumunel, Eric, Mary, Arnaud, Sabot, François, Burlet, Nelly, Gillet, Benjamin, Hughes, Sandrine, S. Oliveira, Daniel, Goubert, Clément, Fablet, Marie, Vieira, Cristina, Lacroix, Vincent
Format: Article
Language:English
Published: Peer Community In 2024-09-01
Series:Peer Community Journal
Subjects:
Online Access:https://peercommunityjournal.org/articles/10.24072/pcjournal.457/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1825206434863775744
author Rebollo, Rita
Gerenton, Pierre
Cumunel, Eric
Mary, Arnaud
Sabot, François
Burlet, Nelly
Gillet, Benjamin
Hughes, Sandrine
S. Oliveira, Daniel
Goubert, Clément
Fablet, Marie
Vieira, Cristina
Lacroix, Vincent
author_facet Rebollo, Rita
Gerenton, Pierre
Cumunel, Eric
Mary, Arnaud
Sabot, François
Burlet, Nelly
Gillet, Benjamin
Hughes, Sandrine
S. Oliveira, Daniel
Goubert, Clément
Fablet, Marie
Vieira, Cristina
Lacroix, Vincent
author_sort Rebollo, Rita
collection DOAJ
description Transposable elements (TEs) are repeated DNA sequences potentially able to move throughout the genome. In addition to their inherent mutagenic effects, TEs can disrupt nearby genes by donating their intrinsic regulatory sequences, for instance, promoting the ectopic expression of a cellular gene. TE transcription is therefore not only necessary for TE transposition per se but can also be associated with TE-gene fusion transcripts, and in some cases, be the product of pervasive transcription. Hence, correctly determining the transcription state of a TE copy is essential to apprehend the impact of the TE in the host genome. Methods to identify and quantify TE transcription have mostly relied on short RNA-seq reads to estimate TE expression at the family level while using specific algorithms to discriminate copy-specific transcription. However, assigning short reads to their correct genomic location, and genomic feature is not trivial. Here we retrieved full-length cDNA  (TeloPrime, Lexogen) of Drosophila melanogaster gonads and sequenced them using Oxford Nanopore Technologies. We show that long-read RNA-seq can be used to identify and quantify transcribed TEs at the copy level. In particular, TE insertions overlapping annotated genes are better estimated using long reads than short reads. Nevertheless, long TE transcripts (> 4.5 kb)  are not well captured. Most expressed TE insertions correspond to copies that have lost their ability to transpose, and within a family, only a few copies are indeed expressed. Long-read sequencing also allowed the identification of spliced transcripts for around 107 TE copies. Overall, this first comparison of TEs between testes and ovaries uncovers differences in their transcriptional landscape, at the subclass and insertion level.
format Article
id doaj-art-c5e46ea8cc4643d29f5655ed672d825f
institution Kabale University
issn 2804-3871
language English
publishDate 2024-09-01
publisher Peer Community In
record_format Article
series Peer Community Journal
spelling doaj-art-c5e46ea8cc4643d29f5655ed672d825f2025-02-07T10:17:17ZengPeer Community InPeer Community Journal2804-38712024-09-01410.24072/pcjournal.45710.24072/pcjournal.457Identification and quantification of transposable element transcripts using Long-Read RNA-seq in Drosophila germline tissues Rebollo, Rita0https://orcid.org/0000-0002-8138-5082Gerenton, Pierre1Cumunel, Eric2Mary, Arnaud3Sabot, François4https://orcid.org/0000-0002-8522-7583Burlet, Nelly5Gillet, Benjamin6Hughes, Sandrine7https://orcid.org/0000-0003-3932-9180S. Oliveira, Daniel8https://orcid.org/0000-0001-5468-6640Goubert, Clément9https://orcid.org/0000-0001-8034-5559Fablet, Marie10Vieira, Cristina11https://orcid.org/0000-0003-3414-3993Lacroix, Vincent12INRAE, INSA Lyon, BF2I, UMR203, Villeurbanne, FranceUniversité Claude Bernard Lyon 1, Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR5558, Villeurbanne, France.; ERABLE team, Inria, Lyon Rhone-Alpes, Villeurbanne, FranceUniversité Claude Bernard Lyon 1, Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR5558, Villeurbanne, France.; ERABLE team, Inria, Lyon Rhone-Alpes, Villeurbanne, FranceUniversité Claude Bernard Lyon 1, Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR5558, Villeurbanne, France.; ERABLE team, Inria, Lyon Rhone-Alpes, Villeurbanne, FranceDIADE unit, Univ Montpellier, Cirad, IRD, Montpellier, FranceUniversité Claude Bernard Lyon 1, Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR5558, Villeurbanne, France.Institut de Génomique Fonctionnelle de Lyon (IGFL), CNRS UMR 5242 , Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1 , Lyon, France.Institut de Génomique Fonctionnelle de Lyon (IGFL), CNRS UMR 5242 , Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1 , Lyon, France.Université Claude Bernard Lyon 1, Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR5558, Villeurbanne, France.; São Paulo State University (Unesp), Institute of Biosciences, Humanities and Exact Sciences, São José do Rio Preto, SP, Brazil.Human Genetics, McGill University, Montreal, QC, CanadaUniversité Claude Bernard Lyon 1, Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR5558, Villeurbanne, France.; Institut Universitaire de France (IUF), Paris, France.Université Claude Bernard Lyon 1, Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR5558, Villeurbanne, France.; ERABLE team, Inria, Lyon Rhone-Alpes, Villeurbanne, FranceUniversité Claude Bernard Lyon 1, Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR5558, Villeurbanne, France.; ERABLE team, Inria, Lyon Rhone-Alpes, Villeurbanne, FranceTransposable elements (TEs) are repeated DNA sequences potentially able to move throughout the genome. In addition to their inherent mutagenic effects, TEs can disrupt nearby genes by donating their intrinsic regulatory sequences, for instance, promoting the ectopic expression of a cellular gene. TE transcription is therefore not only necessary for TE transposition per se but can also be associated with TE-gene fusion transcripts, and in some cases, be the product of pervasive transcription. Hence, correctly determining the transcription state of a TE copy is essential to apprehend the impact of the TE in the host genome. Methods to identify and quantify TE transcription have mostly relied on short RNA-seq reads to estimate TE expression at the family level while using specific algorithms to discriminate copy-specific transcription. However, assigning short reads to their correct genomic location, and genomic feature is not trivial. Here we retrieved full-length cDNA  (TeloPrime, Lexogen) of Drosophila melanogaster gonads and sequenced them using Oxford Nanopore Technologies. We show that long-read RNA-seq can be used to identify and quantify transcribed TEs at the copy level. In particular, TE insertions overlapping annotated genes are better estimated using long reads than short reads. Nevertheless, long TE transcripts (> 4.5 kb)  are not well captured. Most expressed TE insertions correspond to copies that have lost their ability to transpose, and within a family, only a few copies are indeed expressed. Long-read sequencing also allowed the identification of spliced transcripts for around 107 TE copies. Overall, this first comparison of TEs between testes and ovaries uncovers differences in their transcriptional landscape, at the subclass and insertion level.https://peercommunityjournal.org/articles/10.24072/pcjournal.457/long-read sequencing, ONT, transposable elements, regulation, RNA-seq, full-length cDNA
spellingShingle Rebollo, Rita
Gerenton, Pierre
Cumunel, Eric
Mary, Arnaud
Sabot, François
Burlet, Nelly
Gillet, Benjamin
Hughes, Sandrine
S. Oliveira, Daniel
Goubert, Clément
Fablet, Marie
Vieira, Cristina
Lacroix, Vincent
Identification and quantification of transposable element transcripts using Long-Read RNA-seq in Drosophila germline tissues
Peer Community Journal
long-read sequencing, ONT, transposable elements, regulation, RNA-seq, full-length cDNA
title Identification and quantification of transposable element transcripts using Long-Read RNA-seq in Drosophila germline tissues
title_full Identification and quantification of transposable element transcripts using Long-Read RNA-seq in Drosophila germline tissues
title_fullStr Identification and quantification of transposable element transcripts using Long-Read RNA-seq in Drosophila germline tissues
title_full_unstemmed Identification and quantification of transposable element transcripts using Long-Read RNA-seq in Drosophila germline tissues
title_short Identification and quantification of transposable element transcripts using Long-Read RNA-seq in Drosophila germline tissues
title_sort identification and quantification of transposable element transcripts using long read rna seq in drosophila germline tissues
topic long-read sequencing, ONT, transposable elements, regulation, RNA-seq, full-length cDNA
url https://peercommunityjournal.org/articles/10.24072/pcjournal.457/
work_keys_str_mv AT rebollorita identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT gerentonpierre identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT cumuneleric identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT maryarnaud identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT sabotfrancois identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT burletnelly identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT gilletbenjamin identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT hughessandrine identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT soliveiradaniel identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT goubertclement identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT fabletmarie identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT vieiracristina identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues
AT lacroixvincent identificationandquantificationoftransposableelementtranscriptsusinglongreadrnaseqindrosophilagermlinetissues