subMG automates data submission for metagenomics studies

Abstract Background Publicly available metagenomics datasets are crucial for ensuring the reproducibility of scientific findings and supporting contemporary large-scale studies. However, submitting a comprehensive metagenomics dataset is both cumbersome and time-consuming. It requires including samp...

Full description

Saved in:
Bibliographic Details
Main Authors: Tom Tubbesing, Andreas Schlüter, Alexander Sczyrba
Format: Article
Language:English
Published: BMC 2025-06-01
Series:BioData Mining
Subjects:
Online Access:https://doi.org/10.1186/s13040-025-00453-w
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849470240819249152
author Tom Tubbesing
Andreas Schlüter
Alexander Sczyrba
author_facet Tom Tubbesing
Andreas Schlüter
Alexander Sczyrba
author_sort Tom Tubbesing
collection DOAJ
description Abstract Background Publicly available metagenomics datasets are crucial for ensuring the reproducibility of scientific findings and supporting contemporary large-scale studies. However, submitting a comprehensive metagenomics dataset is both cumbersome and time-consuming. It requires including sample information, sequencing reads, assemblies, binned contigs, metagenome-assembled genomes (MAGs), and appropriate metadata. As a result, metagenomics studies are often published with incomplete datasets or, in some cases, without any data at all. subMG addresses this challenge by simplifying and automating the data submission process, thereby encouraging broader and more consistent data sharing. Results subMG streamlines the process of submitting metagenomics study results to the European Nucleotide Archive (ENA) by allowing researchers to input files and metadata from their studies in a single form and automating downstream tasks that otherwise require extensive manual effort and expertise. The tool comes with comprehensive documentation as well as example data tailored for different use cases and can be operated via the command-line or a graphical user interface (GUI), making it easily deployable to a wide range of potential users. Conclusions By simplifying the submission of genome-resolved metagenomics study datasets, subMG significantly reduces the time, effort, and expertise required from researchers, thus paving the way for more numerous and comprehensive data submissions in the future. An increased availability of well-documented and FAIR data can benefit future research, particularly in meta-analyses and comparative studies.
format Article
id doaj-art-a0d2423c94564c658f947233fcf66bb4
institution Kabale University
issn 1756-0381
language English
publishDate 2025-06-01
publisher BMC
record_format Article
series BioData Mining
spelling doaj-art-a0d2423c94564c658f947233fcf66bb42025-08-20T03:25:12ZengBMCBioData Mining1756-03812025-06-011811710.1186/s13040-025-00453-wsubMG automates data submission for metagenomics studiesTom Tubbesing0Andreas Schlüter1Alexander Sczyrba2Computational Metagenomics Group, Center for Biotechnology (CeBiTec), Bielefeld UniversityComputational Metagenomics Group, Center for Biotechnology (CeBiTec), Bielefeld UniversityComputational Metagenomics Group, Center for Biotechnology (CeBiTec), Bielefeld UniversityAbstract Background Publicly available metagenomics datasets are crucial for ensuring the reproducibility of scientific findings and supporting contemporary large-scale studies. However, submitting a comprehensive metagenomics dataset is both cumbersome and time-consuming. It requires including sample information, sequencing reads, assemblies, binned contigs, metagenome-assembled genomes (MAGs), and appropriate metadata. As a result, metagenomics studies are often published with incomplete datasets or, in some cases, without any data at all. subMG addresses this challenge by simplifying and automating the data submission process, thereby encouraging broader and more consistent data sharing. Results subMG streamlines the process of submitting metagenomics study results to the European Nucleotide Archive (ENA) by allowing researchers to input files and metadata from their studies in a single form and automating downstream tasks that otherwise require extensive manual effort and expertise. The tool comes with comprehensive documentation as well as example data tailored for different use cases and can be operated via the command-line or a graphical user interface (GUI), making it easily deployable to a wide range of potential users. Conclusions By simplifying the submission of genome-resolved metagenomics study datasets, subMG significantly reduces the time, effort, and expertise required from researchers, thus paving the way for more numerous and comprehensive data submissions in the future. An increased availability of well-documented and FAIR data can benefit future research, particularly in meta-analyses and comparative studies.https://doi.org/10.1186/s13040-025-00453-wMetagenomicsEuropean Nucleotide ArchiveSubmissionFAIRMetadata
spellingShingle Tom Tubbesing
Andreas Schlüter
Alexander Sczyrba
subMG automates data submission for metagenomics studies
BioData Mining
Metagenomics
European Nucleotide Archive
Submission
FAIR
Metadata
title subMG automates data submission for metagenomics studies
title_full subMG automates data submission for metagenomics studies
title_fullStr subMG automates data submission for metagenomics studies
title_full_unstemmed subMG automates data submission for metagenomics studies
title_short subMG automates data submission for metagenomics studies
title_sort submg automates data submission for metagenomics studies
topic Metagenomics
European Nucleotide Archive
Submission
FAIR
Metadata
url https://doi.org/10.1186/s13040-025-00453-w
work_keys_str_mv AT tomtubbesing submgautomatesdatasubmissionformetagenomicsstudies
AT andreasschluter submgautomatesdatasubmissionformetagenomicsstudies
AT alexandersczyrba submgautomatesdatasubmissionformetagenomicsstudies