Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum

Abstract Argan (Sideroxylon spinosum L., formerly Argania spinosa) is a tree endemic to Morocco, primarily valued for its seed oil. Growing interest in its biology and in genes linked to oil quality and stress resistance highlights the need for high-quality genome and transcriptome models. We integr...

Full description

Saved in:
Bibliographic Details
Main Authors: Ivan D. Mateus, Abdellatif Essahibi, Pamela Nicholson, Mohamed Hijri, Ahmed Qaddoury, Laurent Falquet, Didier Reinhardt
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-05768-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849767141015814144
author Ivan D. Mateus
Abdellatif Essahibi
Pamela Nicholson
Mohamed Hijri
Ahmed Qaddoury
Laurent Falquet
Didier Reinhardt
author_facet Ivan D. Mateus
Abdellatif Essahibi
Pamela Nicholson
Mohamed Hijri
Ahmed Qaddoury
Laurent Falquet
Didier Reinhardt
author_sort Ivan D. Mateus
collection DOAJ
description Abstract Argan (Sideroxylon spinosum L., formerly Argania spinosa) is a tree endemic to Morocco, primarily valued for its seed oil. Growing interest in its biology and in genes linked to oil quality and stress resistance highlights the need for high-quality genome and transcriptome models. We integrated PacBio HiFi long-read and Illumina Hi-C sequencing data to generate independently assembled, phased genome models for both parental haplotypes, measuring 636 Mb and 655 Mb, respectively, with BUSCO completeness scores exceeding 97.8%. Each haplotype consists of 11 fully resolved telomere-to-telomere chromosomes, consistent with chromosome numbers in other Sapotaceae species (n = 10–13), and contains approximately 60% repetitive sequences. Annotation predicted ~28,720 protein-coding genes per haplotype. Comparative analyses with other Sapotaceae genomes indicate overall chromosome conservation within the family, alongside repeat expansion and fusion events in the two largest chromosomes (chr1 and chr2). We also independently assembled the complete chloroplast genome. This high-quality assembly provides a valuable resource for future research on argan biology, genetic diversity, and traits relevant to adaptation and oil biosynthesis.
format Article
id doaj-art-6a254dc274374982ad16632c1542525d
institution DOAJ
issn 2052-4463
language English
publishDate 2025-08-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj-art-6a254dc274374982ad16632c1542525d2025-08-20T03:04:18ZengNature PortfolioScientific Data2052-44632025-08-011211810.1038/s41597-025-05768-1Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosumIvan D. Mateus0Abdellatif Essahibi1Pamela Nicholson2Mohamed Hijri3Ahmed Qaddoury4Laurent Falquet5Didier Reinhardt6Department of Biology, University of FribourgDepartment of Biology, University of FribourgNext Generation Sequencing Platform, University of BernAfrican Genome Center, University Mohammed VI Polytechnic (UM6P)Department of Biology, Faculty of Sciences and Techniques, University of Cadi AyyadDepartment of Biology, University of FribourgDepartment of Biology, University of FribourgAbstract Argan (Sideroxylon spinosum L., formerly Argania spinosa) is a tree endemic to Morocco, primarily valued for its seed oil. Growing interest in its biology and in genes linked to oil quality and stress resistance highlights the need for high-quality genome and transcriptome models. We integrated PacBio HiFi long-read and Illumina Hi-C sequencing data to generate independently assembled, phased genome models for both parental haplotypes, measuring 636 Mb and 655 Mb, respectively, with BUSCO completeness scores exceeding 97.8%. Each haplotype consists of 11 fully resolved telomere-to-telomere chromosomes, consistent with chromosome numbers in other Sapotaceae species (n = 10–13), and contains approximately 60% repetitive sequences. Annotation predicted ~28,720 protein-coding genes per haplotype. Comparative analyses with other Sapotaceae genomes indicate overall chromosome conservation within the family, alongside repeat expansion and fusion events in the two largest chromosomes (chr1 and chr2). We also independently assembled the complete chloroplast genome. This high-quality assembly provides a valuable resource for future research on argan biology, genetic diversity, and traits relevant to adaptation and oil biosynthesis.https://doi.org/10.1038/s41597-025-05768-1
spellingShingle Ivan D. Mateus
Abdellatif Essahibi
Pamela Nicholson
Mohamed Hijri
Ahmed Qaddoury
Laurent Falquet
Didier Reinhardt
Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum
Scientific Data
title Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum
title_full Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum
title_fullStr Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum
title_full_unstemmed Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum
title_short Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum
title_sort chromosome level phased genome assembly of the argan tree sideroxylon spinosum
url https://doi.org/10.1038/s41597-025-05768-1
work_keys_str_mv AT ivandmateus chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum
AT abdellatifessahibi chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum
AT pamelanicholson chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum
AT mohamedhijri chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum
AT ahmedqaddoury chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum
AT laurentfalquet chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum
AT didierreinhardt chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum