Comparative study of tools for copy number variation detection using next-generation sequencing data

Abstract Copy number variation (CNV) plays an important role in disease susceptibility as a type of intermediate-scale structural variation (SV). Accurate CNV detection is crucial for understanding human genetic diversity, elucidating disease mechanisms, and advancing cancer genomics. A variety of C...

Full description

Saved in:
Bibliographic Details
Main Authors: Ruchao Du, Jinxin Dong, Hua Jiang, Minyong Qi, Zuyao Zhao
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-06527-3
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849402645030109184
author Ruchao Du
Jinxin Dong
Hua Jiang
Minyong Qi
Zuyao Zhao
author_facet Ruchao Du
Jinxin Dong
Hua Jiang
Minyong Qi
Zuyao Zhao
author_sort Ruchao Du
collection DOAJ
description Abstract Copy number variation (CNV) plays an important role in disease susceptibility as a type of intermediate-scale structural variation (SV). Accurate CNV detection is crucial for understanding human genetic diversity, elucidating disease mechanisms, and advancing cancer genomics. A variety of CNV detection tools based on short sequencing reads from next-generation sequencing (NGS) have been developed. Although many researchers have conducted extensive comparisons of the detection performance of various tools, these studies have not fully considered the comprehensive impact of factors such as variant length, sequencing depth, tumor purity, and CNV types on tools performance. Therefore, we selected 12 widely used and representative detection tools to comprehensively compare their performance on both simulated and real data. For the simulated data, we compared their performance across six variant types under 36 configurations, including three variant lengths, four sequencing depths, and three tumor purities. For the real data, we used the overlapping density score (ODS) to evaluate the performance of the 12 detection tools. Additionally, we compared their time and space complexities. In this study, we analyzed the impact of each configuration on the tools and recommended the most suitable detection tools for each scenario. This study provides important guidance for researchers in selecting the appropriate variant detection tools for complex situations.
format Article
id doaj-art-5038c343d2ae4d6bb2a74a742e6f6ef5
institution Kabale University
issn 2045-2322
language English
publishDate 2025-07-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-5038c343d2ae4d6bb2a74a742e6f6ef52025-08-20T03:37:30ZengNature PortfolioScientific Reports2045-23222025-07-0115111710.1038/s41598-025-06527-3Comparative study of tools for copy number variation detection using next-generation sequencing dataRuchao Du0Jinxin Dong1Hua Jiang2Minyong Qi3Zuyao Zhao4School of Computer Science and Technology, Liaocheng UniversitySchool of Computer Science and Technology, Liaocheng UniversitySchool of Computer Science and Technology, Liaocheng UniversitySchool of Computer Science and Technology, Liaocheng UniversityOrthopedics Department, Liaocheng People’s HospitalAbstract Copy number variation (CNV) plays an important role in disease susceptibility as a type of intermediate-scale structural variation (SV). Accurate CNV detection is crucial for understanding human genetic diversity, elucidating disease mechanisms, and advancing cancer genomics. A variety of CNV detection tools based on short sequencing reads from next-generation sequencing (NGS) have been developed. Although many researchers have conducted extensive comparisons of the detection performance of various tools, these studies have not fully considered the comprehensive impact of factors such as variant length, sequencing depth, tumor purity, and CNV types on tools performance. Therefore, we selected 12 widely used and representative detection tools to comprehensively compare their performance on both simulated and real data. For the simulated data, we compared their performance across six variant types under 36 configurations, including three variant lengths, four sequencing depths, and three tumor purities. For the real data, we used the overlapping density score (ODS) to evaluate the performance of the 12 detection tools. Additionally, we compared their time and space complexities. In this study, we analyzed the impact of each configuration on the tools and recommended the most suitable detection tools for each scenario. This study provides important guidance for researchers in selecting the appropriate variant detection tools for complex situations.https://doi.org/10.1038/s41598-025-06527-3Copy number variationNext-generation sequencingVariant lengthSequencing depthTumor purityRecommendation
spellingShingle Ruchao Du
Jinxin Dong
Hua Jiang
Minyong Qi
Zuyao Zhao
Comparative study of tools for copy number variation detection using next-generation sequencing data
Scientific Reports
Copy number variation
Next-generation sequencing
Variant length
Sequencing depth
Tumor purity
Recommendation
title Comparative study of tools for copy number variation detection using next-generation sequencing data
title_full Comparative study of tools for copy number variation detection using next-generation sequencing data
title_fullStr Comparative study of tools for copy number variation detection using next-generation sequencing data
title_full_unstemmed Comparative study of tools for copy number variation detection using next-generation sequencing data
title_short Comparative study of tools for copy number variation detection using next-generation sequencing data
title_sort comparative study of tools for copy number variation detection using next generation sequencing data
topic Copy number variation
Next-generation sequencing
Variant length
Sequencing depth
Tumor purity
Recommendation
url https://doi.org/10.1038/s41598-025-06527-3
work_keys_str_mv AT ruchaodu comparativestudyoftoolsforcopynumbervariationdetectionusingnextgenerationsequencingdata
AT jinxindong comparativestudyoftoolsforcopynumbervariationdetectionusingnextgenerationsequencingdata
AT huajiang comparativestudyoftoolsforcopynumbervariationdetectionusingnextgenerationsequencingdata
AT minyongqi comparativestudyoftoolsforcopynumbervariationdetectionusingnextgenerationsequencingdata
AT zuyaozhao comparativestudyoftoolsforcopynumbervariationdetectionusingnextgenerationsequencingdata