GBM-Reservoir: Brain tumor (Glioblastoma Multiforme) MRI dataset collection with ground truth segmentation masksfigshare

In this article, we present a brain tumor database collection comprising 23,049 samples, with each sample including four different types of MRI brain scans: FLAIR, T1, T1ce, and T2. Additionally, one or two segmentation masks (ground truth) are provided for each sample. The first mask is the raw out...

Full description

Saved in:
Bibliographic Details
Main Authors: Naida Solak, André Ferreira, Gijs Luijten, Behrus Puladi, Victor Alves, Jan Egger
Format: Article
Language:English
Published: Elsevier 2025-02-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340925000198
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832576480947732480
author Naida Solak
André Ferreira
Gijs Luijten
Behrus Puladi
Victor Alves
Jan Egger
author_facet Naida Solak
André Ferreira
Gijs Luijten
Behrus Puladi
Victor Alves
Jan Egger
author_sort Naida Solak
collection DOAJ
description In this article, we present a brain tumor database collection comprising 23,049 samples, with each sample including four different types of MRI brain scans: FLAIR, T1, T1ce, and T2. Additionally, one or two segmentation masks (ground truth) are provided for each sample. The first mask is the raw output from the registration process and is provided for all samples, while the second mask, provided particularly for synthetic samples, is a post-processed version of the first, designed to simplify interpretation and optimize it for network training. These samples have been acquired via registration process of 438 samples available at the moment of registration from the original dataset provided by the BraTS 2022 Challenge. Registering each pair of existing brain scans results in two additional scans that retain a similar brain shape while featuring varying tumor locations. Consequently, by registering all possible pairs, a dataset originally consisting of n samples can be expanded to n2 samples. The original dataset was collected from different institutions under standard clinical conditions, but with different equipment and imaging protocols. As a result, the image quality is heterogeneous, reflecting the diversity of clinical practices across institutions. This dataset can be utilized for various tasks, such as developing fully automated segmentation algorithms for new, unseen brain tumor cases, particularly through deep learning-based approaches, since ground truth is provided for each sample.
format Article
id doaj-art-712a0cffe0ce4c729ca9d94b358cae07
institution Kabale University
issn 2352-3409
language English
publishDate 2025-02-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj-art-712a0cffe0ce4c729ca9d94b358cae072025-01-31T05:11:49ZengElsevierData in Brief2352-34092025-02-0158111287GBM-Reservoir: Brain tumor (Glioblastoma Multiforme) MRI dataset collection with ground truth segmentation masksfigshareNaida Solak0André Ferreira1Gijs Luijten2Behrus Puladi3Victor Alves4Jan Egger5Graz University of Technology (TU Graz), Graz, Styria, Austria; Computer Algorithms for Medicine Laboratory (Café Lab), Graz, Styria, Austria; Institute for AI in Medicine (IKIM), University Hospital Essen (UKE), Ruhrgebiet, Essen, GermanyInstitute for AI in Medicine (IKIM), University Hospital Essen (UKE), Ruhrgebiet, Essen, Germany; Center Algoritmi / LASI, University of Minho, Braga, Portugal; Institute of Medical Informatics, University Hospital RWTH Aachen, Aachen, Germany; Department of Oral and Maxillofacial Surgery, University Hospital RWTH Aachen, Aachen, GermanyGraz University of Technology (TU Graz), Graz, Styria, Austria; Computer Algorithms for Medicine Laboratory (Café Lab), Graz, Styria, Austria; Institute for AI in Medicine (IKIM), University Hospital Essen (UKE), Ruhrgebiet, Essen, Germany; Center for Virtual and Extended Reality in Medicine, University Medicine Essen, Essen, GermanyInstitute of Medical Informatics, University Hospital RWTH Aachen, Aachen, Germany; Department of Oral and Maxillofacial Surgery, University Hospital RWTH Aachen, Aachen, GermanyCenter Algoritmi / LASI, University of Minho, Braga, PortugalGraz University of Technology (TU Graz), Graz, Styria, Austria; Computer Algorithms for Medicine Laboratory (Café Lab), Graz, Styria, Austria; Institute for AI in Medicine (IKIM), University Hospital Essen (UKE), Ruhrgebiet, Essen, Germany; Center for Virtual and Extended Reality in Medicine, University Medicine Essen, Essen, Germany; Corresponding author.In this article, we present a brain tumor database collection comprising 23,049 samples, with each sample including four different types of MRI brain scans: FLAIR, T1, T1ce, and T2. Additionally, one or two segmentation masks (ground truth) are provided for each sample. The first mask is the raw output from the registration process and is provided for all samples, while the second mask, provided particularly for synthetic samples, is a post-processed version of the first, designed to simplify interpretation and optimize it for network training. These samples have been acquired via registration process of 438 samples available at the moment of registration from the original dataset provided by the BraTS 2022 Challenge. Registering each pair of existing brain scans results in two additional scans that retain a similar brain shape while featuring varying tumor locations. Consequently, by registering all possible pairs, a dataset originally consisting of n samples can be expanded to n2 samples. The original dataset was collected from different institutions under standard clinical conditions, but with different equipment and imaging protocols. As a result, the image quality is heterogeneous, reflecting the diversity of clinical practices across institutions. This dataset can be utilized for various tasks, such as developing fully automated segmentation algorithms for new, unseen brain tumor cases, particularly through deep learning-based approaches, since ground truth is provided for each sample.http://www.sciencedirect.com/science/article/pii/S2352340925000198Brain tumor segmentationData augmentationRegistrationBraTSDeep learning
spellingShingle Naida Solak
André Ferreira
Gijs Luijten
Behrus Puladi
Victor Alves
Jan Egger
GBM-Reservoir: Brain tumor (Glioblastoma Multiforme) MRI dataset collection with ground truth segmentation masksfigshare
Data in Brief
Brain tumor segmentation
Data augmentation
Registration
BraTS
Deep learning
title GBM-Reservoir: Brain tumor (Glioblastoma Multiforme) MRI dataset collection with ground truth segmentation masksfigshare
title_full GBM-Reservoir: Brain tumor (Glioblastoma Multiforme) MRI dataset collection with ground truth segmentation masksfigshare
title_fullStr GBM-Reservoir: Brain tumor (Glioblastoma Multiforme) MRI dataset collection with ground truth segmentation masksfigshare
title_full_unstemmed GBM-Reservoir: Brain tumor (Glioblastoma Multiforme) MRI dataset collection with ground truth segmentation masksfigshare
title_short GBM-Reservoir: Brain tumor (Glioblastoma Multiforme) MRI dataset collection with ground truth segmentation masksfigshare
title_sort gbm reservoir brain tumor glioblastoma multiforme mri dataset collection with ground truth segmentation masksfigshare
topic Brain tumor segmentation
Data augmentation
Registration
BraTS
Deep learning
url http://www.sciencedirect.com/science/article/pii/S2352340925000198
work_keys_str_mv AT naidasolak gbmreservoirbraintumorglioblastomamultiformemridatasetcollectionwithgroundtruthsegmentationmasksfigshare
AT andreferreira gbmreservoirbraintumorglioblastomamultiformemridatasetcollectionwithgroundtruthsegmentationmasksfigshare
AT gijsluijten gbmreservoirbraintumorglioblastomamultiformemridatasetcollectionwithgroundtruthsegmentationmasksfigshare
AT behruspuladi gbmreservoirbraintumorglioblastomamultiformemridatasetcollectionwithgroundtruthsegmentationmasksfigshare
AT victoralves gbmreservoirbraintumorglioblastomamultiformemridatasetcollectionwithgroundtruthsegmentationmasksfigshare
AT janegger gbmreservoirbraintumorglioblastomamultiformemridatasetcollectionwithgroundtruthsegmentationmasksfigshare