Validation of Core and Whole-Genome Multi-Locus Sequence Typing Schemes for Shiga-Toxin-Producing <i>E. coli</i> (STEC) Outbreak Detection in a National Surveillance Network, PulseNet 2.0, USA

Shiga-toxin-producing <i>E. coli</i> (STEC) is a leading causing of bacterial foodborne and zoonotic illnesses in the USA. Whole-genome sequencing (WGS) is a powerful tool used in public health and microbiology for the detection, surveillance, and outbreak investigation of STEC. In this...

Full description

Saved in:
Bibliographic Details
Main Authors: Molly M. Leeper, Morgan N. Schroeder, Taylor Griswold, Mohit Thakur, Krittika Krishnan, Lee S. Katz, Kelley B. Hise, Grant M. Williams, Steven G. Stroika, Sung B. Im, Rebecca L. Lindsey, Peyton A. Smith, Jasmine Huffman, Alyssa Kelley, Sara Cleland, Alan J. Collins, Shruti Gautam, Eishita Tyagi, Subin Park, João A. Carriço, Miguel P. Machado, Hannes Pouseele, Dolf Michielsen, Heather A. Carleton
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Microorganisms
Subjects:
Online Access:https://www.mdpi.com/2076-2607/13/6/1310
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Shiga-toxin-producing <i>E. coli</i> (STEC) is a leading causing of bacterial foodborne and zoonotic illnesses in the USA. Whole-genome sequencing (WGS) is a powerful tool used in public health and microbiology for the detection, surveillance, and outbreak investigation of STEC. In this study, we applied three WGS-based subtyping methods, high quality single-nucleotide polymorphism (hqSNP) analysis, whole genome multi-locus sequence typing using chromosome-associated loci [wgMLST (chrom)], and core genome multi-locus sequence typing (cgMLST), to isolate sequences from 11 STEC outbreaks. For each outbreak, we evaluated the concordance between subtyping methods using pairwise genomic differences (number of SNPs or alleles), linear regression models, and tanglegrams. Pairwise genomic differences were highly concordant between methods for all but one outbreak, which was associated with international travel. The slopes of the regressions for hqSNP vs. allele differences were 0.432 (cgMLST) and 0.966 wgMLST (chrom); the slope was 1.914 for cgMLST vs. wgMLST (chrom) differences. Tanglegrams comprised of outbreak and sporadic sequences showed moderate clustering concordance between methods, where Baker’s Gamma Indices (BGIs) ranged between 0.35 and 0.99 and Cophenetic Correlation Coefficients (CCCs) were ≥0.88 across all outbreaks. The K-means analysis using the Silhouette method showed the clear separation of outbreak groups with average silhouette widths ≥0.87 across all methods. This study validates the use of cgMLST for the national surveillance of STEC illness clusters using the PulseNet 2.0 system and demonstrates that hqSNP or wgMLST can be used for further resolution.
ISSN:2076-2607