A fusocelular skin dataset with whole slide images for deep learning models

Abstract Cutaneous spindle cell (CSC) lesions encompass a spectrum from benign to malignant neoplasms, often posing significant diagnostic challenges. Computer-aided diagnosis systems offer a promising solution to make pathologists’ decisions objective and faster. These systems usually require large...

Full description

Saved in:
Bibliographic Details
Main Authors: Rocío del Amor, Miguel López-Pérez, Pablo Meseguer, Sandra Morales, Liria Terradez, Jose Aneiros-Fernandez, Javier Mateos, Rafael Molina, Valery Naranjo
Format: Article
Language:English
Published: Nature Portfolio 2025-05-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-05108-3
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Cutaneous spindle cell (CSC) lesions encompass a spectrum from benign to malignant neoplasms, often posing significant diagnostic challenges. Computer-aided diagnosis systems offer a promising solution to make pathologists’ decisions objective and faster. These systems usually require large-scale datasets with curated labels for effective training; however, manual annotation is time-consuming and expensive. To overcome this challenge, crowdsourcing has emerged as a popular and valuable strategy to scale up the labeling process by distributing the effort among different non-expert annotators. This work introduces AI4SkIN, the first public dataset Whole Slide Images (WSIs) for CSC neoplasms, annotated using an innovative crowdsourcing protocol. AI4SkIN dataset contains 641 Hematoxylin and Eosin stained WSIs with multiclass labels from both expert and trainee pathologists. The dataset improves CSC neoplasm diagnosis using advanced machine learning and crowdsourcing based on Gaussian Processes, showing that models trained on non-expert labels perform comparably to those using expert labels. In conclusion, we illustrate that AI4SkIN provides a good resource for developing and validating methods for multiclass CSC neoplasm classification.
ISSN:2052-4463