A BAC-guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassava

Abstract Background Cassava is an important crop for food security in the tropics where its production is jeopardized by several viral diseases, including the cassava mosaic disease (CMD) which is endemic in Sub-Saharan Africa and the Indian subcontinent. Resistance to CMD is linked to a single domi...

Full description

Saved in:
Bibliographic Details
Main Authors: Luc Cornet, Syed Shan-e-Ali Zaidi, Jia Li, Yvan Ngapout, Sara Shakir, Loic Meunier, Caroline Callot, William Marande, Marc Hanikenne, Stephane Rombauts, Yves Van de Peer, Hervé Vanderschuren
Format: Article
Language:English
Published: BMC 2025-06-01
Series:Genome Biology
Online Access:https://doi.org/10.1186/s13059-025-03620-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849432596515127296
author Luc Cornet
Syed Shan-e-Ali Zaidi
Jia Li
Yvan Ngapout
Sara Shakir
Loic Meunier
Caroline Callot
William Marande
Marc Hanikenne
Stephane Rombauts
Yves Van de Peer
Hervé Vanderschuren
author_facet Luc Cornet
Syed Shan-e-Ali Zaidi
Jia Li
Yvan Ngapout
Sara Shakir
Loic Meunier
Caroline Callot
William Marande
Marc Hanikenne
Stephane Rombauts
Yves Van de Peer
Hervé Vanderschuren
author_sort Luc Cornet
collection DOAJ
description Abstract Background Cassava is an important crop for food security in the tropics where its production is jeopardized by several viral diseases, including the cassava mosaic disease (CMD) which is endemic in Sub-Saharan Africa and the Indian subcontinent. Resistance to CMD is linked to a single dominant locus, namely CMD2. The cassava genome contains highly repetitive regions making the accurate assembly of a reference genome challenging. Results In the present study, we generate BAC libraries of the CMD-susceptible cassava cultivar (cv.) 60444 and the CMD-resistant landrace TME3. We subsequently identify and sequence BACs belonging to the CMD2 region in both cultivars using high-accuracy long-read PacBio circular consensus sequencing (ccs) reads. We then sequence and assemble the complete genomes of cv. 60444 and TME3 using a combination of ONT ultra-long reads and optical mapping. Anchoring the assemblies on cassava genetic maps reveals discrepancies in our, as well as in previously released, CMD2 regions of the cv. 60444 and TME3 genomes. A BAC-guided approach to assess cassava genome assemblies significantly improves the synteny between the assembled CMD2 regions of cv. 60444 and TME3 and the CMD2 genetic maps. We then performed repeat-unmasked gene annotation on CMD2 assemblies and identify 81 stress resistance proteins present in the CMD2 region, among which 31 were previously not reported in publicly available CMD2 sequences. Conclusions The BAC-assessed approach improved CMD2 region accuracy and revealed new sequences linked to virus resistance, advancing our understanding of cassava mosaic disease resistance.
format Article
id doaj-art-def930c1ed0e49a7a6b15f375e8667d2
institution Kabale University
issn 1474-760X
language English
publishDate 2025-06-01
publisher BMC
record_format Article
series Genome Biology
spelling doaj-art-def930c1ed0e49a7a6b15f375e8667d22025-08-20T03:27:18ZengBMCGenome Biology1474-760X2025-06-0126111510.1186/s13059-025-03620-8A BAC-guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassavaLuc Cornet0Syed Shan-e-Ali Zaidi1Jia Li2Yvan Ngapout3Sara Shakir4Loic Meunier5Caroline Callot6William Marande7Marc Hanikenne8Stephane Rombauts9Yves Van de Peer10Hervé Vanderschuren11Plant Genetics and Rhizosphere Processes Laboratory, TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of LiègePlant Genetics and Rhizosphere Processes Laboratory, TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of LiègeDepartment of Plant Biotechnology and Bioinformatics, Ghent University and VIB Center for Plant Systems BiologyLaboratory of Tropical Crop Improvement, Division of Crop Biotechnics, Biosystems Department, KU LeuvenPlant Genetics and Rhizosphere Processes Laboratory, TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of LiègeInBioS, PhytoSYSTEMS, Eukaryotic Phylogenomics, University of LiègeCNRGV, Centre National de Ressources Génomiques VégétalesCNRGV, Centre National de Ressources Génomiques VégétalesInBioS, PhytoSYSTEMS, Translational Plant Biology, University of LiègeDepartment of Plant Biotechnology and Bioinformatics, Ghent University and VIB Center for Plant Systems BiologyDepartment of Plant Biotechnology and Bioinformatics, Ghent University and VIB Center for Plant Systems BiologyPlant Genetics and Rhizosphere Processes Laboratory, TERRA Teaching and Research Center, Gembloux Agro-Bio Tech, University of LiègeAbstract Background Cassava is an important crop for food security in the tropics where its production is jeopardized by several viral diseases, including the cassava mosaic disease (CMD) which is endemic in Sub-Saharan Africa and the Indian subcontinent. Resistance to CMD is linked to a single dominant locus, namely CMD2. The cassava genome contains highly repetitive regions making the accurate assembly of a reference genome challenging. Results In the present study, we generate BAC libraries of the CMD-susceptible cassava cultivar (cv.) 60444 and the CMD-resistant landrace TME3. We subsequently identify and sequence BACs belonging to the CMD2 region in both cultivars using high-accuracy long-read PacBio circular consensus sequencing (ccs) reads. We then sequence and assemble the complete genomes of cv. 60444 and TME3 using a combination of ONT ultra-long reads and optical mapping. Anchoring the assemblies on cassava genetic maps reveals discrepancies in our, as well as in previously released, CMD2 regions of the cv. 60444 and TME3 genomes. A BAC-guided approach to assess cassava genome assemblies significantly improves the synteny between the assembled CMD2 regions of cv. 60444 and TME3 and the CMD2 genetic maps. We then performed repeat-unmasked gene annotation on CMD2 assemblies and identify 81 stress resistance proteins present in the CMD2 region, among which 31 were previously not reported in publicly available CMD2 sequences. Conclusions The BAC-assessed approach improved CMD2 region accuracy and revealed new sequences linked to virus resistance, advancing our understanding of cassava mosaic disease resistance.https://doi.org/10.1186/s13059-025-03620-8
spellingShingle Luc Cornet
Syed Shan-e-Ali Zaidi
Jia Li
Yvan Ngapout
Sara Shakir
Loic Meunier
Caroline Callot
William Marande
Marc Hanikenne
Stephane Rombauts
Yves Van de Peer
Hervé Vanderschuren
A BAC-guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassava
Genome Biology
title A BAC-guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassava
title_full A BAC-guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassava
title_fullStr A BAC-guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassava
title_full_unstemmed A BAC-guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassava
title_short A BAC-guided haplotype assembly pipeline increases the resolution of the virus resistance locus CMD2 in cassava
title_sort bac guided haplotype assembly pipeline increases the resolution of the virus resistance locus cmd2 in cassava
url https://doi.org/10.1186/s13059-025-03620-8
work_keys_str_mv AT luccornet abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT syedshanealizaidi abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT jiali abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT yvanngapout abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT sarashakir abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT loicmeunier abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT carolinecallot abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT williammarande abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT marchanikenne abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT stephanerombauts abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT yvesvandepeer abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT hervevanderschuren abacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT luccornet bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT syedshanealizaidi bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT jiali bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT yvanngapout bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT sarashakir bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT loicmeunier bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT carolinecallot bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT williammarande bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT marchanikenne bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT stephanerombauts bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT yvesvandepeer bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava
AT hervevanderschuren bacguidedhaplotypeassemblypipelineincreasestheresolutionofthevirusresistancelocuscmd2incassava