Metadata Enriched Multi-Instance Contrastive Learning for High-Quality Facial Skin Visual Representations

Utilizing self-supervised learning to learn meaningful representations from unlabeled data can be a cost-effective strategy, particularly in medical domains where expert labeling incurs high costs. Contrastive learning typically employs a single contrastive relationship based on individual instances...

Full description

Saved in:
Bibliographic Details
Main Authors: Jihyo Kim, Sungchul Kim, Seungwon Seo, Bumsoo Kim, Daejeong Mun, Hoonjae Lee, Sangheum Hwang
Format: Article
Language:English
Published: Taylor & Francis Group 2025-12-01
Series:Applied Artificial Intelligence
Online Access:https://www.tandfonline.com/doi/10.1080/08839514.2025.2462389
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850109052004073472
author Jihyo Kim
Sungchul Kim
Seungwon Seo
Bumsoo Kim
Daejeong Mun
Hoonjae Lee
Sangheum Hwang
author_facet Jihyo Kim
Sungchul Kim
Seungwon Seo
Bumsoo Kim
Daejeong Mun
Hoonjae Lee
Sangheum Hwang
author_sort Jihyo Kim
collection DOAJ
description Utilizing self-supervised learning to learn meaningful representations from unlabeled data can be a cost-effective strategy, particularly in medical domains where expert labeling incurs high costs. Contrastive learning typically employs a single contrastive relationship based on individual instances. However, depending on the task-related characteristics, such as facial skin images, this approach may be unsuitable for learning useful representations. In this work, we propose an advanced contrastive learning method to learn high-quality facial skin representations that are useful for various downstream applications related to skin disorders, such as wrinkles and pigmentation. Our method leverages metadata to establish effective multi-instance contrastive relationships specifically for facial skin images. To this end, we employ mini-batches, constructed through the integration of multiple contrastive relationships, to enable a model to learn the multifaceted features of facial skin. Using a facial skin image dataset, we demonstrate that the proposed method is effective in classifying facial wrinkles and pigmentation severity compared to conventional contrastive learning. The features learned by the proposed method adapt well to other skin lesion datasets from different sources, demonstrating the transferability of the learned skin representations. Our study highlights the potential of application-specific batch configurations leveraging metadata to enhance the effectiveness of self-supervised learning.
format Article
id doaj-art-1b7cb7cf0ece46e98ebdec835ad697d0
institution OA Journals
issn 0883-9514
1087-6545
language English
publishDate 2025-12-01
publisher Taylor & Francis Group
record_format Article
series Applied Artificial Intelligence
spelling doaj-art-1b7cb7cf0ece46e98ebdec835ad697d02025-08-20T02:38:11ZengTaylor & Francis GroupApplied Artificial Intelligence0883-95141087-65452025-12-0139110.1080/08839514.2025.2462389Metadata Enriched Multi-Instance Contrastive Learning for High-Quality Facial Skin Visual RepresentationsJihyo Kim0Sungchul Kim1Seungwon Seo2Bumsoo Kim3Daejeong Mun4Hoonjae Lee5Sangheum Hwang6Department of Data Science, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Data Science, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Data Science, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Data Science, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Industrial Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaArtLab Inc, Seoul, Republic of KoreaDepartment of Data Science, Seoul National University of Science and Technology, Seoul, Republic of KoreaUtilizing self-supervised learning to learn meaningful representations from unlabeled data can be a cost-effective strategy, particularly in medical domains where expert labeling incurs high costs. Contrastive learning typically employs a single contrastive relationship based on individual instances. However, depending on the task-related characteristics, such as facial skin images, this approach may be unsuitable for learning useful representations. In this work, we propose an advanced contrastive learning method to learn high-quality facial skin representations that are useful for various downstream applications related to skin disorders, such as wrinkles and pigmentation. Our method leverages metadata to establish effective multi-instance contrastive relationships specifically for facial skin images. To this end, we employ mini-batches, constructed through the integration of multiple contrastive relationships, to enable a model to learn the multifaceted features of facial skin. Using a facial skin image dataset, we demonstrate that the proposed method is effective in classifying facial wrinkles and pigmentation severity compared to conventional contrastive learning. The features learned by the proposed method adapt well to other skin lesion datasets from different sources, demonstrating the transferability of the learned skin representations. Our study highlights the potential of application-specific batch configurations leveraging metadata to enhance the effectiveness of self-supervised learning.https://www.tandfonline.com/doi/10.1080/08839514.2025.2462389
spellingShingle Jihyo Kim
Sungchul Kim
Seungwon Seo
Bumsoo Kim
Daejeong Mun
Hoonjae Lee
Sangheum Hwang
Metadata Enriched Multi-Instance Contrastive Learning for High-Quality Facial Skin Visual Representations
Applied Artificial Intelligence
title Metadata Enriched Multi-Instance Contrastive Learning for High-Quality Facial Skin Visual Representations
title_full Metadata Enriched Multi-Instance Contrastive Learning for High-Quality Facial Skin Visual Representations
title_fullStr Metadata Enriched Multi-Instance Contrastive Learning for High-Quality Facial Skin Visual Representations
title_full_unstemmed Metadata Enriched Multi-Instance Contrastive Learning for High-Quality Facial Skin Visual Representations
title_short Metadata Enriched Multi-Instance Contrastive Learning for High-Quality Facial Skin Visual Representations
title_sort metadata enriched multi instance contrastive learning for high quality facial skin visual representations
url https://www.tandfonline.com/doi/10.1080/08839514.2025.2462389
work_keys_str_mv AT jihyokim metadataenrichedmultiinstancecontrastivelearningforhighqualityfacialskinvisualrepresentations
AT sungchulkim metadataenrichedmultiinstancecontrastivelearningforhighqualityfacialskinvisualrepresentations
AT seungwonseo metadataenrichedmultiinstancecontrastivelearningforhighqualityfacialskinvisualrepresentations
AT bumsookim metadataenrichedmultiinstancecontrastivelearningforhighqualityfacialskinvisualrepresentations
AT daejeongmun metadataenrichedmultiinstancecontrastivelearningforhighqualityfacialskinvisualrepresentations
AT hoonjaelee metadataenrichedmultiinstancecontrastivelearningforhighqualityfacialskinvisualrepresentations
AT sangheumhwang metadataenrichedmultiinstancecontrastivelearningforhighqualityfacialskinvisualrepresentations