Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration
An effective method to enable the enjoyment of works of art by the blind is to reproduce tactile copies of the work, to facilitate tactile exploration. This is even more important when it comes to paintings, which are inherently not accessible to the blind unless they are transformed into 3D models....
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2024-12-01
|
Series: | Heritage |
Subjects: | |
Online Access: | https://www.mdpi.com/2571-9408/8/1/12 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832588432242638848 |
---|---|
author | Rocco Furferi |
author_facet | Rocco Furferi |
author_sort | Rocco Furferi |
collection | DOAJ |
description | An effective method to enable the enjoyment of works of art by the blind is to reproduce tactile copies of the work, to facilitate tactile exploration. This is even more important when it comes to paintings, which are inherently not accessible to the blind unless they are transformed into 3D models. Today, artificial intelligence techniques are rapidly growing and represent a paramount method for solving a variety of previously hard-to-solve tasks. It is, therefore, presumable that the translation from 2D images to 3D models using such methods will be also in continuous development. Unfortunately, reconstructing a 3D model from a single image, especially when it comes to painting-based images, is an ill-posed problem due to the depth ambiguity and the lack of a ground truth for the 3D model. To confront this issue, this paper proposes an overview of artificial intelligence-based methods for reconstructing 3D geometry from a single image is provided. The survey explores the potentiality of Convolutional Neural Networks, Generative Adversarial Networks, Variational Autoencoders, and zero-shot methods. Through a small set of case studies, the capabilities and limitations of CNNs in creating a 3D-scene model from artworks are also encompassed. The findings suggest that, while deep learning models demonstrate that they are effective for 3D retrieval from paintings, they also call for post-processing and user interaction to improve the accuracy of the 3D models. |
format | Article |
id | doaj-art-a2ce4e31fa2940959b91d0f2963056f1 |
institution | Kabale University |
issn | 2571-9408 |
language | English |
publishDate | 2024-12-01 |
publisher | MDPI AG |
record_format | Article |
series | Heritage |
spelling | doaj-art-a2ce4e31fa2940959b91d0f2963056f12025-01-24T13:34:19ZengMDPI AGHeritage2571-94082024-12-01811210.3390/heritage8010012Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile ExplorationRocco Furferi0Department of Industrial Engineering, University of Florence, 50139 Firenze, ItalyAn effective method to enable the enjoyment of works of art by the blind is to reproduce tactile copies of the work, to facilitate tactile exploration. This is even more important when it comes to paintings, which are inherently not accessible to the blind unless they are transformed into 3D models. Today, artificial intelligence techniques are rapidly growing and represent a paramount method for solving a variety of previously hard-to-solve tasks. It is, therefore, presumable that the translation from 2D images to 3D models using such methods will be also in continuous development. Unfortunately, reconstructing a 3D model from a single image, especially when it comes to painting-based images, is an ill-posed problem due to the depth ambiguity and the lack of a ground truth for the 3D model. To confront this issue, this paper proposes an overview of artificial intelligence-based methods for reconstructing 3D geometry from a single image is provided. The survey explores the potentiality of Convolutional Neural Networks, Generative Adversarial Networks, Variational Autoencoders, and zero-shot methods. Through a small set of case studies, the capabilities and limitations of CNNs in creating a 3D-scene model from artworks are also encompassed. The findings suggest that, while deep learning models demonstrate that they are effective for 3D retrieval from paintings, they also call for post-processing and user interaction to improve the accuracy of the 3D models.https://www.mdpi.com/2571-9408/8/1/12CNNsGANsVAEspaintingsblind people3D printing |
spellingShingle | Rocco Furferi Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration Heritage CNNs GANs VAEs paintings blind people 3D printing |
title | Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration |
title_full | Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration |
title_fullStr | Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration |
title_full_unstemmed | Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration |
title_short | Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration |
title_sort | deep learning approaches for 3d model generation from 2d artworks to aid blind people with tactile exploration |
topic | CNNs GANs VAEs paintings blind people 3D printing |
url | https://www.mdpi.com/2571-9408/8/1/12 |
work_keys_str_mv | AT roccofurferi deeplearningapproachesfor3dmodelgenerationfrom2dartworkstoaidblindpeoplewithtactileexploration |