Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)

Abstract HYPERDOC is a hyperspectral imaging dataset of historical documents and mock-ups, designed to facilitate research in material identification in the cultural heritage domain. It contains mock-ups of historical inks (metallo-gallate, sepia, carbon-based, and mixtures) on various supports, inc...

Full description

Saved in:
Bibliographic Details
Main Authors: Ana Belén López-Baldomero, Juan Luis Nieves, Francisco Moronta-Montero, Miguel Ángel Martínez-Domingo, Ramón Fernández-Gualda, Javier Hernández-Andrés, Anna Sofía Reichert, Ana López-Montes, Teresa Espejo, Javier Romero, Eva María Valero
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-05599-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849333913212682240
author Ana Belén López-Baldomero
Juan Luis Nieves
Francisco Moronta-Montero
Miguel Ángel Martínez-Domingo
Ramón Fernández-Gualda
Javier Hernández-Andrés
Anna Sofía Reichert
Ana López-Montes
Teresa Espejo
Javier Romero
Eva María Valero
author_facet Ana Belén López-Baldomero
Juan Luis Nieves
Francisco Moronta-Montero
Miguel Ángel Martínez-Domingo
Ramón Fernández-Gualda
Javier Hernández-Andrés
Anna Sofía Reichert
Ana López-Montes
Teresa Espejo
Javier Romero
Eva María Valero
author_sort Ana Belén López-Baldomero
collection DOAJ
description Abstract HYPERDOC is a hyperspectral imaging dataset of historical documents and mock-ups, designed to facilitate research in material identification in the cultural heritage domain. It contains mock-ups of historical inks (metallo-gallate, sepia, carbon-based, and mixtures) on various supports, including some artificially aged, and historical documents from the 15th to 17th centuries (manuscripts, illuminated manuscripts, and family trees). Hyperspectral reflectance images were acquired using line-scan cameras in the VNIR (400-1000 nm) and SWIR (900-1700 nm) ranges and were spatially registered. Small regions of interest, referred to as ‘minicubes’, were extracted from the full document images, and pixel-level ground truth material annotations were performed. False-color RGB images and metadata were included in both the full document and minicube captures. The HYPERDOC dataset has been successfully applied in various experimental studies, including ink classification using machine learning models, spectral unmixing, colorimetric analysis, and binarization. These applications highlight the dataset’s potential, which is publicly available to promote interdisciplinary collaboration and advance the use of hyperspectral imaging in the conservation field.
format Article
id doaj-art-0f4b2b44377449019565d0b7c4a393af
institution Kabale University
issn 2052-4463
language English
publishDate 2025-07-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj-art-0f4b2b44377449019565d0b7c4a393af2025-08-20T03:45:44ZengNature PortfolioScientific Data2052-44632025-07-0112111810.1038/s41597-025-05599-0Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)Ana Belén López-Baldomero0Juan Luis Nieves1Francisco Moronta-Montero2Miguel Ángel Martínez-Domingo3Ramón Fernández-Gualda4Javier Hernández-Andrés5Anna Sofía Reichert6Ana López-Montes7Teresa Espejo8Javier Romero9Eva María Valero10Color Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaDepartment of Painting, Faculty of Fine Art, University of GranadaDepartment of Painting, Faculty of Fine Art, University of GranadaDepartment of Painting, Faculty of Fine Art, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaAbstract HYPERDOC is a hyperspectral imaging dataset of historical documents and mock-ups, designed to facilitate research in material identification in the cultural heritage domain. It contains mock-ups of historical inks (metallo-gallate, sepia, carbon-based, and mixtures) on various supports, including some artificially aged, and historical documents from the 15th to 17th centuries (manuscripts, illuminated manuscripts, and family trees). Hyperspectral reflectance images were acquired using line-scan cameras in the VNIR (400-1000 nm) and SWIR (900-1700 nm) ranges and were spatially registered. Small regions of interest, referred to as ‘minicubes’, were extracted from the full document images, and pixel-level ground truth material annotations were performed. False-color RGB images and metadata were included in both the full document and minicube captures. The HYPERDOC dataset has been successfully applied in various experimental studies, including ink classification using machine learning models, spectral unmixing, colorimetric analysis, and binarization. These applications highlight the dataset’s potential, which is publicly available to promote interdisciplinary collaboration and advance the use of hyperspectral imaging in the conservation field.https://doi.org/10.1038/s41597-025-05599-0
spellingShingle Ana Belén López-Baldomero
Juan Luis Nieves
Francisco Moronta-Montero
Miguel Ángel Martínez-Domingo
Ramón Fernández-Gualda
Javier Hernández-Andrés
Anna Sofía Reichert
Ana López-Montes
Teresa Espejo
Javier Romero
Eva María Valero
Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)
Scientific Data
title Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)
title_full Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)
title_fullStr Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)
title_full_unstemmed Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)
title_short Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)
title_sort hyperspectral dataset of historical documents and mock ups from 400 to 1700 nm hyperdoc
url https://doi.org/10.1038/s41597-025-05599-0
work_keys_str_mv AT anabelenlopezbaldomero hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT juanluisnieves hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT franciscomorontamontero hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT miguelangelmartinezdomingo hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT ramonfernandezgualda hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT javierhernandezandres hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT annasofiareichert hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT analopezmontes hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT teresaespejo hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT javierromero hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc
AT evamariavalero hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc