Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum
Abstract Argan (Sideroxylon spinosum L., formerly Argania spinosa) is a tree endemic to Morocco, primarily valued for its seed oil. Growing interest in its biology and in genes linked to oil quality and stress resistance highlights the need for high-quality genome and transcriptome models. We integr...
Saved in:
| Main Authors: | , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-08-01
|
| Series: | Scientific Data |
| Online Access: | https://doi.org/10.1038/s41597-025-05768-1 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849767141015814144 |
|---|---|
| author | Ivan D. Mateus Abdellatif Essahibi Pamela Nicholson Mohamed Hijri Ahmed Qaddoury Laurent Falquet Didier Reinhardt |
| author_facet | Ivan D. Mateus Abdellatif Essahibi Pamela Nicholson Mohamed Hijri Ahmed Qaddoury Laurent Falquet Didier Reinhardt |
| author_sort | Ivan D. Mateus |
| collection | DOAJ |
| description | Abstract Argan (Sideroxylon spinosum L., formerly Argania spinosa) is a tree endemic to Morocco, primarily valued for its seed oil. Growing interest in its biology and in genes linked to oil quality and stress resistance highlights the need for high-quality genome and transcriptome models. We integrated PacBio HiFi long-read and Illumina Hi-C sequencing data to generate independently assembled, phased genome models for both parental haplotypes, measuring 636 Mb and 655 Mb, respectively, with BUSCO completeness scores exceeding 97.8%. Each haplotype consists of 11 fully resolved telomere-to-telomere chromosomes, consistent with chromosome numbers in other Sapotaceae species (n = 10–13), and contains approximately 60% repetitive sequences. Annotation predicted ~28,720 protein-coding genes per haplotype. Comparative analyses with other Sapotaceae genomes indicate overall chromosome conservation within the family, alongside repeat expansion and fusion events in the two largest chromosomes (chr1 and chr2). We also independently assembled the complete chloroplast genome. This high-quality assembly provides a valuable resource for future research on argan biology, genetic diversity, and traits relevant to adaptation and oil biosynthesis. |
| format | Article |
| id | doaj-art-6a254dc274374982ad16632c1542525d |
| institution | DOAJ |
| issn | 2052-4463 |
| language | English |
| publishDate | 2025-08-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Data |
| spelling | doaj-art-6a254dc274374982ad16632c1542525d2025-08-20T03:04:18ZengNature PortfolioScientific Data2052-44632025-08-011211810.1038/s41597-025-05768-1Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosumIvan D. Mateus0Abdellatif Essahibi1Pamela Nicholson2Mohamed Hijri3Ahmed Qaddoury4Laurent Falquet5Didier Reinhardt6Department of Biology, University of FribourgDepartment of Biology, University of FribourgNext Generation Sequencing Platform, University of BernAfrican Genome Center, University Mohammed VI Polytechnic (UM6P)Department of Biology, Faculty of Sciences and Techniques, University of Cadi AyyadDepartment of Biology, University of FribourgDepartment of Biology, University of FribourgAbstract Argan (Sideroxylon spinosum L., formerly Argania spinosa) is a tree endemic to Morocco, primarily valued for its seed oil. Growing interest in its biology and in genes linked to oil quality and stress resistance highlights the need for high-quality genome and transcriptome models. We integrated PacBio HiFi long-read and Illumina Hi-C sequencing data to generate independently assembled, phased genome models for both parental haplotypes, measuring 636 Mb and 655 Mb, respectively, with BUSCO completeness scores exceeding 97.8%. Each haplotype consists of 11 fully resolved telomere-to-telomere chromosomes, consistent with chromosome numbers in other Sapotaceae species (n = 10–13), and contains approximately 60% repetitive sequences. Annotation predicted ~28,720 protein-coding genes per haplotype. Comparative analyses with other Sapotaceae genomes indicate overall chromosome conservation within the family, alongside repeat expansion and fusion events in the two largest chromosomes (chr1 and chr2). We also independently assembled the complete chloroplast genome. This high-quality assembly provides a valuable resource for future research on argan biology, genetic diversity, and traits relevant to adaptation and oil biosynthesis.https://doi.org/10.1038/s41597-025-05768-1 |
| spellingShingle | Ivan D. Mateus Abdellatif Essahibi Pamela Nicholson Mohamed Hijri Ahmed Qaddoury Laurent Falquet Didier Reinhardt Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum Scientific Data |
| title | Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum |
| title_full | Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum |
| title_fullStr | Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum |
| title_full_unstemmed | Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum |
| title_short | Chromosome-level phased genome assembly of the argan tree Sideroxylon spinosum |
| title_sort | chromosome level phased genome assembly of the argan tree sideroxylon spinosum |
| url | https://doi.org/10.1038/s41597-025-05768-1 |
| work_keys_str_mv | AT ivandmateus chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum AT abdellatifessahibi chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum AT pamelanicholson chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum AT mohamedhijri chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum AT ahmedqaddoury chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum AT laurentfalquet chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum AT didierreinhardt chromosomelevelphasedgenomeassemblyoftheargantreesideroxylonspinosum |