Diffusion probabilistic model for Tibetan painted sketch extraction

Abstract During the process of Tibetan painting, the same sketch can be used to create different types of artwork. However, for each type, the artist must meticulously redraw an identical sketch. Crafting a complex sketch demands considerable time and effort from the artist, making the extraction of...

Full description

Saved in:
Bibliographic Details
Main Authors: Fubo Wang, Shengling Geng, Zeyu Jia, Mingcong Dang
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-07638-7
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849333318294700032
author Fubo Wang
Shengling Geng
Zeyu Jia
Mingcong Dang
author_facet Fubo Wang
Shengling Geng
Zeyu Jia
Mingcong Dang
author_sort Fubo Wang
collection DOAJ
description Abstract During the process of Tibetan painting, the same sketch can be used to create different types of artwork. However, for each type, the artist must meticulously redraw an identical sketch. Crafting a complex sketch demands considerable time and effort from the artist, making the extraction of these sketchs crucial. Given the intricate colors and line structures of Tibetan paintings, existing learning-based methods for extracting sketchs often fail to accurately and clearly capture these elements. With the successful application of Diffusion Probabilistic Models (DPM), we propose a method for extracting lsketchs of Tibetan paintings based on DPM—DiffusionSketch. By introducing DPM, DiffusionSketch simultaneously captures coarse-grained global context and fine-grained local context features in two stages. We propose a corresponding adaptive wavelet filter to adjust latent features of specific frequencies and integrate the features extracted from both stages through a Feature Fusion Module (FFM). Moreover, to minimize errors and distortions in the generated sketchs, the fused output features are processed by a Generative Adversarial Network (GAN) to predict the final effect of the Tibetan painting sketchs. Through all the above technical designs, DiffusionSketch can generate clear and concise sketch of Tibetan paintings with minimal resource consumption. In our HeHuang Tibetan Painting (HHTP) dataset, the sketchs extracted by DiffusionSketch are less noisy, have clearer lines, and closely resemble the sketchs created by real Tibetan painting artists compared to existing sketch extraction methods. The sketchs extracted by our method rank first, with an average ranking of 1.137 among the results extracted by 36 different methods. On the HHTP dataset, our method outperforms the second-ranked method by 57.12 % in subjective evaluation and by 18.30 % in objective evaluation criteria. Code: https://github.com/HeHuangAI/DiffusionSketch .
format Article
id doaj-art-067ee2d4453c4d9281df152b7d8b1e66
institution Kabale University
issn 2045-2322
language English
publishDate 2025-07-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-067ee2d4453c4d9281df152b7d8b1e662025-08-20T03:45:53ZengNature PortfolioScientific Reports2045-23222025-07-0115111910.1038/s41598-025-07638-7Diffusion probabilistic model for Tibetan painted sketch extractionFubo Wang0Shengling Geng1Zeyu Jia2Mingcong Dang3Qinghai Normal University, School of Computer ScienceQinghai Normal University, School of Computer ScienceQinghai Normal University, School of Computer ScienceQinghai Normal University, School of Computer ScienceAbstract During the process of Tibetan painting, the same sketch can be used to create different types of artwork. However, for each type, the artist must meticulously redraw an identical sketch. Crafting a complex sketch demands considerable time and effort from the artist, making the extraction of these sketchs crucial. Given the intricate colors and line structures of Tibetan paintings, existing learning-based methods for extracting sketchs often fail to accurately and clearly capture these elements. With the successful application of Diffusion Probabilistic Models (DPM), we propose a method for extracting lsketchs of Tibetan paintings based on DPM—DiffusionSketch. By introducing DPM, DiffusionSketch simultaneously captures coarse-grained global context and fine-grained local context features in two stages. We propose a corresponding adaptive wavelet filter to adjust latent features of specific frequencies and integrate the features extracted from both stages through a Feature Fusion Module (FFM). Moreover, to minimize errors and distortions in the generated sketchs, the fused output features are processed by a Generative Adversarial Network (GAN) to predict the final effect of the Tibetan painting sketchs. Through all the above technical designs, DiffusionSketch can generate clear and concise sketch of Tibetan paintings with minimal resource consumption. In our HeHuang Tibetan Painting (HHTP) dataset, the sketchs extracted by DiffusionSketch are less noisy, have clearer lines, and closely resemble the sketchs created by real Tibetan painting artists compared to existing sketch extraction methods. The sketchs extracted by our method rank first, with an average ranking of 1.137 among the results extracted by 36 different methods. On the HHTP dataset, our method outperforms the second-ranked method by 57.12 % in subjective evaluation and by 18.30 % in objective evaluation criteria. Code: https://github.com/HeHuangAI/DiffusionSketch .https://doi.org/10.1038/s41598-025-07638-7
spellingShingle Fubo Wang
Shengling Geng
Zeyu Jia
Mingcong Dang
Diffusion probabilistic model for Tibetan painted sketch extraction
Scientific Reports
title Diffusion probabilistic model for Tibetan painted sketch extraction
title_full Diffusion probabilistic model for Tibetan painted sketch extraction
title_fullStr Diffusion probabilistic model for Tibetan painted sketch extraction
title_full_unstemmed Diffusion probabilistic model for Tibetan painted sketch extraction
title_short Diffusion probabilistic model for Tibetan painted sketch extraction
title_sort diffusion probabilistic model for tibetan painted sketch extraction
url https://doi.org/10.1038/s41598-025-07638-7
work_keys_str_mv AT fubowang diffusionprobabilisticmodelfortibetanpaintedsketchextraction
AT shenglinggeng diffusionprobabilisticmodelfortibetanpaintedsketchextraction
AT zeyujia diffusionprobabilisticmodelfortibetanpaintedsketchextraction
AT mingcongdang diffusionprobabilisticmodelfortibetanpaintedsketchextraction