Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)
Abstract HYPERDOC is a hyperspectral imaging dataset of historical documents and mock-ups, designed to facilitate research in material identification in the cultural heritage domain. It contains mock-ups of historical inks (metallo-gallate, sepia, carbon-based, and mixtures) on various supports, inc...
Saved in:
| Main Authors: | , , , , , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-07-01
|
| Series: | Scientific Data |
| Online Access: | https://doi.org/10.1038/s41597-025-05599-0 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849333913212682240 |
|---|---|
| author | Ana Belén López-Baldomero Juan Luis Nieves Francisco Moronta-Montero Miguel Ángel Martínez-Domingo Ramón Fernández-Gualda Javier Hernández-Andrés Anna Sofía Reichert Ana López-Montes Teresa Espejo Javier Romero Eva María Valero |
| author_facet | Ana Belén López-Baldomero Juan Luis Nieves Francisco Moronta-Montero Miguel Ángel Martínez-Domingo Ramón Fernández-Gualda Javier Hernández-Andrés Anna Sofía Reichert Ana López-Montes Teresa Espejo Javier Romero Eva María Valero |
| author_sort | Ana Belén López-Baldomero |
| collection | DOAJ |
| description | Abstract HYPERDOC is a hyperspectral imaging dataset of historical documents and mock-ups, designed to facilitate research in material identification in the cultural heritage domain. It contains mock-ups of historical inks (metallo-gallate, sepia, carbon-based, and mixtures) on various supports, including some artificially aged, and historical documents from the 15th to 17th centuries (manuscripts, illuminated manuscripts, and family trees). Hyperspectral reflectance images were acquired using line-scan cameras in the VNIR (400-1000 nm) and SWIR (900-1700 nm) ranges and were spatially registered. Small regions of interest, referred to as ‘minicubes’, were extracted from the full document images, and pixel-level ground truth material annotations were performed. False-color RGB images and metadata were included in both the full document and minicube captures. The HYPERDOC dataset has been successfully applied in various experimental studies, including ink classification using machine learning models, spectral unmixing, colorimetric analysis, and binarization. These applications highlight the dataset’s potential, which is publicly available to promote interdisciplinary collaboration and advance the use of hyperspectral imaging in the conservation field. |
| format | Article |
| id | doaj-art-0f4b2b44377449019565d0b7c4a393af |
| institution | Kabale University |
| issn | 2052-4463 |
| language | English |
| publishDate | 2025-07-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Data |
| spelling | doaj-art-0f4b2b44377449019565d0b7c4a393af2025-08-20T03:45:44ZengNature PortfolioScientific Data2052-44632025-07-0112111810.1038/s41597-025-05599-0Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC)Ana Belén López-Baldomero0Juan Luis Nieves1Francisco Moronta-Montero2Miguel Ángel Martínez-Domingo3Ramón Fernández-Gualda4Javier Hernández-Andrés5Anna Sofía Reichert6Ana López-Montes7Teresa Espejo8Javier Romero9Eva María Valero10Color Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaDepartment of Painting, Faculty of Fine Art, University of GranadaDepartment of Painting, Faculty of Fine Art, University of GranadaDepartment of Painting, Faculty of Fine Art, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaColor Imaging Laboratory, Department of Optics, Faculty of Sciences, University of GranadaAbstract HYPERDOC is a hyperspectral imaging dataset of historical documents and mock-ups, designed to facilitate research in material identification in the cultural heritage domain. It contains mock-ups of historical inks (metallo-gallate, sepia, carbon-based, and mixtures) on various supports, including some artificially aged, and historical documents from the 15th to 17th centuries (manuscripts, illuminated manuscripts, and family trees). Hyperspectral reflectance images were acquired using line-scan cameras in the VNIR (400-1000 nm) and SWIR (900-1700 nm) ranges and were spatially registered. Small regions of interest, referred to as ‘minicubes’, were extracted from the full document images, and pixel-level ground truth material annotations were performed. False-color RGB images and metadata were included in both the full document and minicube captures. The HYPERDOC dataset has been successfully applied in various experimental studies, including ink classification using machine learning models, spectral unmixing, colorimetric analysis, and binarization. These applications highlight the dataset’s potential, which is publicly available to promote interdisciplinary collaboration and advance the use of hyperspectral imaging in the conservation field.https://doi.org/10.1038/s41597-025-05599-0 |
| spellingShingle | Ana Belén López-Baldomero Juan Luis Nieves Francisco Moronta-Montero Miguel Ángel Martínez-Domingo Ramón Fernández-Gualda Javier Hernández-Andrés Anna Sofía Reichert Ana López-Montes Teresa Espejo Javier Romero Eva María Valero Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC) Scientific Data |
| title | Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC) |
| title_full | Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC) |
| title_fullStr | Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC) |
| title_full_unstemmed | Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC) |
| title_short | Hyperspectral dataset of historical documents and mock-ups from 400 to 1700 nm (HYPERDOC) |
| title_sort | hyperspectral dataset of historical documents and mock ups from 400 to 1700 nm hyperdoc |
| url | https://doi.org/10.1038/s41597-025-05599-0 |
| work_keys_str_mv | AT anabelenlopezbaldomero hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT juanluisnieves hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT franciscomorontamontero hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT miguelangelmartinezdomingo hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT ramonfernandezgualda hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT javierhernandezandres hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT annasofiareichert hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT analopezmontes hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT teresaespejo hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT javierromero hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc AT evamariavalero hyperspectraldatasetofhistoricaldocumentsandmockupsfrom400to1700nmhyperdoc |