Comparative analysis of RAD-seq methods for SNP discovery and genetic diversity assessment in oil seed crop safflower

Abstract Safflower (Carthamus tinctorius L.) is an important oilseed crop with diverse uses and the potential for genetic improvement. This study aimed to optimize genotyping-by-sequencing (GBS) for safflower via in silico and in vitro methods with two restriction site-associated DNA sequencing (RAD...

Full description

Saved in:
Bibliographic Details
Main Authors: Pooja Pathania, Gaddam Prasanna Kumar, Nishu Gupta, R. Parimalan, J. Radhamani, Rajesh Kumar, Sunil Shriram Gomashe, Palchamy Kadirvel, S. Rajkumar
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-06706-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849769448006746112
author Pooja Pathania
Gaddam Prasanna Kumar
Nishu Gupta
R. Parimalan
J. Radhamani
Rajesh Kumar
Sunil Shriram Gomashe
Palchamy Kadirvel
S. Rajkumar
author_facet Pooja Pathania
Gaddam Prasanna Kumar
Nishu Gupta
R. Parimalan
J. Radhamani
Rajesh Kumar
Sunil Shriram Gomashe
Palchamy Kadirvel
S. Rajkumar
author_sort Pooja Pathania
collection DOAJ
description Abstract Safflower (Carthamus tinctorius L.) is an important oilseed crop with diverse uses and the potential for genetic improvement. This study aimed to optimize genotyping-by-sequencing (GBS) for safflower via in silico and in vitro methods with two restriction site-associated DNA sequencing (RAD-seq) approaches, i.e., single restriction site-associated DNA sequencing (sdRAD-seq) and double-digest RAD sequencing (ddRAD-seq) and three restriction enzyme combinations (ApeKI, NlaIII_Msel, and EcoRI_Msel). Forty-two safflower accessions were selected for this study. In silico testing revealed that NlaIII_Msel generated the largest number of DNA fragments, followed by ApeKI and EcoRI_Msel. The in vitro results showed that ddRAD-seq outperformed sdRAD-seq in terms of raw read count, alignment rate, depth and breadth of coverage, and SNP detection. An alignment-free analysis using k-mer counting and sketching based on genetic distance further confirmed the superiority of ddRAD-seq. Gene-level k-mer validation identified more core genes in the ddRAD-seq data. Variant calling resulted in 6,721, 173,212, and 221,805 single nucleotide polymorphic sites (SNPs) for ApeKI, NlaIII_Msel, and EcoRI_Msel, respectively. SNP annotation and distribution analysis revealed that EcoRI_Msel captured more SNPs with fewer missing observations. Principal component analysis via ddRAD-seq data explained 30.29% and 33.98% of the total genetic variation in NlaIII_Msel and EcoRI_Msel, respectively. This study demonstrated that ddRAD-seq with the EcoRI_Msel enzyme combination is the most suitable GBS approach for genome sampling and SNP genotyping in safflower.
format Article
id doaj-art-c24d016676d24ca99b13c7fc21eafbcb
institution DOAJ
issn 2045-2322
language English
publishDate 2025-07-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-c24d016676d24ca99b13c7fc21eafbcb2025-08-20T03:03:24ZengNature PortfolioScientific Reports2045-23222025-07-0115111010.1038/s41598-025-06706-2Comparative analysis of RAD-seq methods for SNP discovery and genetic diversity assessment in oil seed crop safflowerPooja Pathania0Gaddam Prasanna Kumar1Nishu Gupta2R. Parimalan3J. Radhamani4Rajesh Kumar5Sunil Shriram Gomashe6Palchamy Kadirvel7S. Rajkumar8ICAR- National Bureau of Plant Genetic ResourcesICAR- National Bureau of Plant Genetic ResourcesICAR- National Bureau of Plant Genetic ResourcesICAR- National Bureau of Plant Genetic ResourcesICAR- National Bureau of Plant Genetic ResourcesICAR- National Bureau of Plant Genetic ResourcesICAR- National Bureau of Plant Genetic Resources, RS-AkolaICAR-Indian Institute of Oilseeds ResearchICAR- National Bureau of Plant Genetic ResourcesAbstract Safflower (Carthamus tinctorius L.) is an important oilseed crop with diverse uses and the potential for genetic improvement. This study aimed to optimize genotyping-by-sequencing (GBS) for safflower via in silico and in vitro methods with two restriction site-associated DNA sequencing (RAD-seq) approaches, i.e., single restriction site-associated DNA sequencing (sdRAD-seq) and double-digest RAD sequencing (ddRAD-seq) and three restriction enzyme combinations (ApeKI, NlaIII_Msel, and EcoRI_Msel). Forty-two safflower accessions were selected for this study. In silico testing revealed that NlaIII_Msel generated the largest number of DNA fragments, followed by ApeKI and EcoRI_Msel. The in vitro results showed that ddRAD-seq outperformed sdRAD-seq in terms of raw read count, alignment rate, depth and breadth of coverage, and SNP detection. An alignment-free analysis using k-mer counting and sketching based on genetic distance further confirmed the superiority of ddRAD-seq. Gene-level k-mer validation identified more core genes in the ddRAD-seq data. Variant calling resulted in 6,721, 173,212, and 221,805 single nucleotide polymorphic sites (SNPs) for ApeKI, NlaIII_Msel, and EcoRI_Msel, respectively. SNP annotation and distribution analysis revealed that EcoRI_Msel captured more SNPs with fewer missing observations. Principal component analysis via ddRAD-seq data explained 30.29% and 33.98% of the total genetic variation in NlaIII_Msel and EcoRI_Msel, respectively. This study demonstrated that ddRAD-seq with the EcoRI_Msel enzyme combination is the most suitable GBS approach for genome sampling and SNP genotyping in safflower.https://doi.org/10.1038/s41598-025-06706-2Carthamus tinctoriusGenetic diversityGenome samplingRAD-seqRestriction enzymesSNP discovery
spellingShingle Pooja Pathania
Gaddam Prasanna Kumar
Nishu Gupta
R. Parimalan
J. Radhamani
Rajesh Kumar
Sunil Shriram Gomashe
Palchamy Kadirvel
S. Rajkumar
Comparative analysis of RAD-seq methods for SNP discovery and genetic diversity assessment in oil seed crop safflower
Scientific Reports
Carthamus tinctorius
Genetic diversity
Genome sampling
RAD-seq
Restriction enzymes
SNP discovery
title Comparative analysis of RAD-seq methods for SNP discovery and genetic diversity assessment in oil seed crop safflower
title_full Comparative analysis of RAD-seq methods for SNP discovery and genetic diversity assessment in oil seed crop safflower
title_fullStr Comparative analysis of RAD-seq methods for SNP discovery and genetic diversity assessment in oil seed crop safflower
title_full_unstemmed Comparative analysis of RAD-seq methods for SNP discovery and genetic diversity assessment in oil seed crop safflower
title_short Comparative analysis of RAD-seq methods for SNP discovery and genetic diversity assessment in oil seed crop safflower
title_sort comparative analysis of rad seq methods for snp discovery and genetic diversity assessment in oil seed crop safflower
topic Carthamus tinctorius
Genetic diversity
Genome sampling
RAD-seq
Restriction enzymes
SNP discovery
url https://doi.org/10.1038/s41598-025-06706-2
work_keys_str_mv AT poojapathania comparativeanalysisofradseqmethodsforsnpdiscoveryandgeneticdiversityassessmentinoilseedcropsafflower
AT gaddamprasannakumar comparativeanalysisofradseqmethodsforsnpdiscoveryandgeneticdiversityassessmentinoilseedcropsafflower
AT nishugupta comparativeanalysisofradseqmethodsforsnpdiscoveryandgeneticdiversityassessmentinoilseedcropsafflower
AT rparimalan comparativeanalysisofradseqmethodsforsnpdiscoveryandgeneticdiversityassessmentinoilseedcropsafflower
AT jradhamani comparativeanalysisofradseqmethodsforsnpdiscoveryandgeneticdiversityassessmentinoilseedcropsafflower
AT rajeshkumar comparativeanalysisofradseqmethodsforsnpdiscoveryandgeneticdiversityassessmentinoilseedcropsafflower
AT sunilshriramgomashe comparativeanalysisofradseqmethodsforsnpdiscoveryandgeneticdiversityassessmentinoilseedcropsafflower
AT palchamykadirvel comparativeanalysisofradseqmethodsforsnpdiscoveryandgeneticdiversityassessmentinoilseedcropsafflower
AT srajkumar comparativeanalysisofradseqmethodsforsnpdiscoveryandgeneticdiversityassessmentinoilseedcropsafflower