Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples

Abstract Objective Mislabelling and swapping of laboratory samples are handling errors that can lead to erroneous interpretation of data and/or patient harm. Sequenced samples can be traced back to the respective donors by matching of single nucleotide polymorphisms (SNPs). Frameworks and software t...

Full description

Saved in:
Bibliographic Details
Main Authors: Deyan Yordanov Yosifov, Christof Schneider, Stephan Stilgenbauer, Daniel Mertens, Eugen Tausch
Format: Article
Language:English
Published: BMC 2025-07-01
Series:BMC Research Notes
Subjects:
Online Access:https://doi.org/10.1186/s13104-025-07348-3
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849335105605074944
author Deyan Yordanov Yosifov
Christof Schneider
Stephan Stilgenbauer
Daniel Mertens
Eugen Tausch
author_facet Deyan Yordanov Yosifov
Christof Schneider
Stephan Stilgenbauer
Daniel Mertens
Eugen Tausch
author_sort Deyan Yordanov Yosifov
collection DOAJ
description Abstract Objective Mislabelling and swapping of laboratory samples are handling errors that can lead to erroneous interpretation of data and/or patient harm. Sequenced samples can be traced back to the respective donors by matching of single nucleotide polymorphisms (SNPs). Frameworks and software to do this have been developed for use with whole genome/exome sequencing data but not for targeted next-generation sequencing (tNGS), possibly due to the limited genomic coverage with tNGS and the need for individualization of the set of interrogated SNPs. We decided to adapt a popular tool for use with tNGS data, to demonstrate the possibility of selecting informative SNPs from a typical tNGS panel and to create an automated workflow for detection of sample handling errors. Results We compiled a custom list of 28 SNPs and with its help we demonstrated the practicability of using only tNGS data to cost-effectively detect mislabelled samples. In two cohorts of totally 1441 patients with sequential samples, we could identify 3 sample swaps, 7 mislabelled samples (3 externally and 4 internally) and 1 mistake of unknown origin. We provide an R function for automated detection of sample swaps and mislabelling to the community as a free and open-source tool.
format Article
id doaj-art-859c93fe082e477eb97c77af35985d89
institution Kabale University
issn 1756-0500
language English
publishDate 2025-07-01
publisher BMC
record_format Article
series BMC Research Notes
spelling doaj-art-859c93fe082e477eb97c77af35985d892025-08-20T03:45:24ZengBMCBMC Research Notes1756-05002025-07-011811710.1186/s13104-025-07348-3Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samplesDeyan Yordanov Yosifov0Christof Schneider1Stephan Stilgenbauer2Daniel Mertens3Eugen Tausch4Division of CLL, Department of Internal Medicine III, Ulm University HospitalDivision of CLL, Department of Internal Medicine III, Ulm University HospitalDivision of CLL, Department of Internal Medicine III, Ulm University HospitalDivision of CLL, Department of Internal Medicine III, Ulm University HospitalDivision of CLL, Department of Internal Medicine III, Ulm University HospitalAbstract Objective Mislabelling and swapping of laboratory samples are handling errors that can lead to erroneous interpretation of data and/or patient harm. Sequenced samples can be traced back to the respective donors by matching of single nucleotide polymorphisms (SNPs). Frameworks and software to do this have been developed for use with whole genome/exome sequencing data but not for targeted next-generation sequencing (tNGS), possibly due to the limited genomic coverage with tNGS and the need for individualization of the set of interrogated SNPs. We decided to adapt a popular tool for use with tNGS data, to demonstrate the possibility of selecting informative SNPs from a typical tNGS panel and to create an automated workflow for detection of sample handling errors. Results We compiled a custom list of 28 SNPs and with its help we demonstrated the practicability of using only tNGS data to cost-effectively detect mislabelled samples. In two cohorts of totally 1441 patients with sequential samples, we could identify 3 sample swaps, 7 mislabelled samples (3 externally and 4 internally) and 1 mistake of unknown origin. We provide an R function for automated detection of sample swaps and mislabelling to the community as a free and open-source tool.https://doi.org/10.1186/s13104-025-07348-3Sample misidentificationSample mislabellingSample swapsSingle nucleotide polymorphisms (SNPs)GenotypingTargeted NGS
spellingShingle Deyan Yordanov Yosifov
Christof Schneider
Stephan Stilgenbauer
Daniel Mertens
Eugen Tausch
Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples
BMC Research Notes
Sample misidentification
Sample mislabelling
Sample swaps
Single nucleotide polymorphisms (SNPs)
Genotyping
Targeted NGS
title Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples
title_full Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples
title_fullStr Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples
title_full_unstemmed Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples
title_short Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples
title_sort genotyping from targeted ngs data based on a small set of snps correctly matches patient samples
topic Sample misidentification
Sample mislabelling
Sample swaps
Single nucleotide polymorphisms (SNPs)
Genotyping
Targeted NGS
url https://doi.org/10.1186/s13104-025-07348-3
work_keys_str_mv AT deyanyordanovyosifov genotypingfromtargetedngsdatabasedonasmallsetofsnpscorrectlymatchespatientsamples
AT christofschneider genotypingfromtargetedngsdatabasedonasmallsetofsnpscorrectlymatchespatientsamples
AT stephanstilgenbauer genotypingfromtargetedngsdatabasedonasmallsetofsnpscorrectlymatchespatientsamples
AT danielmertens genotypingfromtargetedngsdatabasedonasmallsetofsnpscorrectlymatchespatientsamples
AT eugentausch genotypingfromtargetedngsdatabasedonasmallsetofsnpscorrectlymatchespatientsamples