GENOMICON-Seq enables realistic simulation of amplicon and exome sequencing for low-frequency mutation detection

Abstract Accurate detection of low-frequency mutations is crucial for understanding viral evolution and tumorigenesis in humans, but is often confounded by technical artifacts introduced during library preparation and sequencing. We present GENOMICON-Seq, an end-to-end simulation tool that models bo...

Full description

Saved in:
Bibliographic Details
Main Authors: Milan S. Stosic, Jean-Marc Costanzi, Ole Herman Ambur, Trine B. Rounge
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-05267-8
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Accurate detection of low-frequency mutations is crucial for understanding viral evolution and tumorigenesis in humans, but is often confounded by technical artifacts introduced during library preparation and sequencing. We present GENOMICON-Seq, an end-to-end simulation tool that models both amplicon and whole exome sequencing (WES) workflows with realistic biological mutations and technical noise. GENOMICON-Seq inserts ground truth mutations, ranging from APOBEC3-like edits to COSMIC single base substitution signatures, before subjecting samples to simulated PCR errors, probe-capture enrichment, and Illumina-specific sequencing biases. By tracking each mutation’s origin (true or error-derived), researchers can pinpoint detection limits and optimize variant-calling thresholds. We illustrate GENOMICON-Seq’s versatility through study cases involving human papillomavirus (HPV) amplicon sequencing, highlighting the impacts of polymerase fidelity, viral copy number, and read depth on detecting low-frequency mutations. In parallel, WES simulations demonstrate how capture biases and varying allele frequencies affect somatic mutation calls. GENOMICON-Seq is thus a flexible, reproducible framework for assessing new protocols, benchmarking variant callers, and refining data analysis pipelines, ultimately reducing costly trial-and-error in the laboratory. The Docker-based package is freely available at https://github.com/Rounge-lab/GENOMICON-Seq .
ISSN:2045-2322