Diffusion probabilistic model for Tibetan painted sketch extraction
Abstract During the process of Tibetan painting, the same sketch can be used to create different types of artwork. However, for each type, the artist must meticulously redraw an identical sketch. Crafting a complex sketch demands considerable time and effort from the artist, making the extraction of...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-07-01
|
| Series: | Scientific Reports |
| Online Access: | https://doi.org/10.1038/s41598-025-07638-7 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849333318294700032 |
|---|---|
| author | Fubo Wang Shengling Geng Zeyu Jia Mingcong Dang |
| author_facet | Fubo Wang Shengling Geng Zeyu Jia Mingcong Dang |
| author_sort | Fubo Wang |
| collection | DOAJ |
| description | Abstract During the process of Tibetan painting, the same sketch can be used to create different types of artwork. However, for each type, the artist must meticulously redraw an identical sketch. Crafting a complex sketch demands considerable time and effort from the artist, making the extraction of these sketchs crucial. Given the intricate colors and line structures of Tibetan paintings, existing learning-based methods for extracting sketchs often fail to accurately and clearly capture these elements. With the successful application of Diffusion Probabilistic Models (DPM), we propose a method for extracting lsketchs of Tibetan paintings based on DPM—DiffusionSketch. By introducing DPM, DiffusionSketch simultaneously captures coarse-grained global context and fine-grained local context features in two stages. We propose a corresponding adaptive wavelet filter to adjust latent features of specific frequencies and integrate the features extracted from both stages through a Feature Fusion Module (FFM). Moreover, to minimize errors and distortions in the generated sketchs, the fused output features are processed by a Generative Adversarial Network (GAN) to predict the final effect of the Tibetan painting sketchs. Through all the above technical designs, DiffusionSketch can generate clear and concise sketch of Tibetan paintings with minimal resource consumption. In our HeHuang Tibetan Painting (HHTP) dataset, the sketchs extracted by DiffusionSketch are less noisy, have clearer lines, and closely resemble the sketchs created by real Tibetan painting artists compared to existing sketch extraction methods. The sketchs extracted by our method rank first, with an average ranking of 1.137 among the results extracted by 36 different methods. On the HHTP dataset, our method outperforms the second-ranked method by 57.12 % in subjective evaluation and by 18.30 % in objective evaluation criteria. Code: https://github.com/HeHuangAI/DiffusionSketch . |
| format | Article |
| id | doaj-art-067ee2d4453c4d9281df152b7d8b1e66 |
| institution | Kabale University |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-07-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-067ee2d4453c4d9281df152b7d8b1e662025-08-20T03:45:53ZengNature PortfolioScientific Reports2045-23222025-07-0115111910.1038/s41598-025-07638-7Diffusion probabilistic model for Tibetan painted sketch extractionFubo Wang0Shengling Geng1Zeyu Jia2Mingcong Dang3Qinghai Normal University, School of Computer ScienceQinghai Normal University, School of Computer ScienceQinghai Normal University, School of Computer ScienceQinghai Normal University, School of Computer ScienceAbstract During the process of Tibetan painting, the same sketch can be used to create different types of artwork. However, for each type, the artist must meticulously redraw an identical sketch. Crafting a complex sketch demands considerable time and effort from the artist, making the extraction of these sketchs crucial. Given the intricate colors and line structures of Tibetan paintings, existing learning-based methods for extracting sketchs often fail to accurately and clearly capture these elements. With the successful application of Diffusion Probabilistic Models (DPM), we propose a method for extracting lsketchs of Tibetan paintings based on DPM—DiffusionSketch. By introducing DPM, DiffusionSketch simultaneously captures coarse-grained global context and fine-grained local context features in two stages. We propose a corresponding adaptive wavelet filter to adjust latent features of specific frequencies and integrate the features extracted from both stages through a Feature Fusion Module (FFM). Moreover, to minimize errors and distortions in the generated sketchs, the fused output features are processed by a Generative Adversarial Network (GAN) to predict the final effect of the Tibetan painting sketchs. Through all the above technical designs, DiffusionSketch can generate clear and concise sketch of Tibetan paintings with minimal resource consumption. In our HeHuang Tibetan Painting (HHTP) dataset, the sketchs extracted by DiffusionSketch are less noisy, have clearer lines, and closely resemble the sketchs created by real Tibetan painting artists compared to existing sketch extraction methods. The sketchs extracted by our method rank first, with an average ranking of 1.137 among the results extracted by 36 different methods. On the HHTP dataset, our method outperforms the second-ranked method by 57.12 % in subjective evaluation and by 18.30 % in objective evaluation criteria. Code: https://github.com/HeHuangAI/DiffusionSketch .https://doi.org/10.1038/s41598-025-07638-7 |
| spellingShingle | Fubo Wang Shengling Geng Zeyu Jia Mingcong Dang Diffusion probabilistic model for Tibetan painted sketch extraction Scientific Reports |
| title | Diffusion probabilistic model for Tibetan painted sketch extraction |
| title_full | Diffusion probabilistic model for Tibetan painted sketch extraction |
| title_fullStr | Diffusion probabilistic model for Tibetan painted sketch extraction |
| title_full_unstemmed | Diffusion probabilistic model for Tibetan painted sketch extraction |
| title_short | Diffusion probabilistic model for Tibetan painted sketch extraction |
| title_sort | diffusion probabilistic model for tibetan painted sketch extraction |
| url | https://doi.org/10.1038/s41598-025-07638-7 |
| work_keys_str_mv | AT fubowang diffusionprobabilisticmodelfortibetanpaintedsketchextraction AT shenglinggeng diffusionprobabilisticmodelfortibetanpaintedsketchextraction AT zeyujia diffusionprobabilisticmodelfortibetanpaintedsketchextraction AT mingcongdang diffusionprobabilisticmodelfortibetanpaintedsketchextraction |