Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum

Abstract Argan (Sideroxylon spinosum L., formerly Argania spinosa) is a tree endemic to Morocco, primarily valued for its seed oil. Growing interest in its biology and in genes linked to oil quality and stress resistance highlights the need for high-quality genome and transcriptome models. We integr...

Full description

Saved in:
Bibliographic Details
Main Authors: Ivan D. Mateus, Abdellatif Essahibi, Pamela Nicholson, Mohamed Hijri, Ahmed Qaddoury, Laurent Falquet, Didier Reinhardt
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-05768-1
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Argan (Sideroxylon spinosum L., formerly Argania spinosa) is a tree endemic to Morocco, primarily valued for its seed oil. Growing interest in its biology and in genes linked to oil quality and stress resistance highlights the need for high-quality genome and transcriptome models. We integrated PacBio HiFi long-read and Illumina Hi-C sequencing data to generate independently assembled, phased genome models for both parental haplotypes, measuring 636 Mb and 655 Mb, respectively, with BUSCO completeness scores exceeding 97.8%. Each haplotype consists of 11 fully resolved telomere-to-telomere chromosomes, consistent with chromosome numbers in other Sapotaceae species (n = 10–13), and contains approximately 60% repetitive sequences. Annotation predicted ~28,720 protein-coding genes per haplotype. Comparative analyses with other Sapotaceae genomes indicate overall chromosome conservation within the family, alongside repeat expansion and fusion events in the two largest chromosomes (chr1 and chr2). We also independently assembled the complete chloroplast genome. This high-quality assembly provides a valuable resource for future research on argan biology, genetic diversity, and traits relevant to adaptation and oil biosynthesis.
ISSN:2052-4463