Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration

An effective method to enable the enjoyment of works of art by the blind is to reproduce tactile copies of the work, to facilitate tactile exploration. This is even more important when it comes to paintings, which are inherently not accessible to the blind unless they are transformed into 3D models....

Full description

Saved in:
Bibliographic Details
Main Author: Rocco Furferi
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Heritage
Subjects:
Online Access:https://www.mdpi.com/2571-9408/8/1/12
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832588432242638848
author Rocco Furferi
author_facet Rocco Furferi
author_sort Rocco Furferi
collection DOAJ
description An effective method to enable the enjoyment of works of art by the blind is to reproduce tactile copies of the work, to facilitate tactile exploration. This is even more important when it comes to paintings, which are inherently not accessible to the blind unless they are transformed into 3D models. Today, artificial intelligence techniques are rapidly growing and represent a paramount method for solving a variety of previously hard-to-solve tasks. It is, therefore, presumable that the translation from 2D images to 3D models using such methods will be also in continuous development. Unfortunately, reconstructing a 3D model from a single image, especially when it comes to painting-based images, is an ill-posed problem due to the depth ambiguity and the lack of a ground truth for the 3D model. To confront this issue, this paper proposes an overview of artificial intelligence-based methods for reconstructing 3D geometry from a single image is provided. The survey explores the potentiality of Convolutional Neural Networks, Generative Adversarial Networks, Variational Autoencoders, and zero-shot methods. Through a small set of case studies, the capabilities and limitations of CNNs in creating a 3D-scene model from artworks are also encompassed. The findings suggest that, while deep learning models demonstrate that they are effective for 3D retrieval from paintings, they also call for post-processing and user interaction to improve the accuracy of the 3D models.
format Article
id doaj-art-a2ce4e31fa2940959b91d0f2963056f1
institution Kabale University
issn 2571-9408
language English
publishDate 2024-12-01
publisher MDPI AG
record_format Article
series Heritage
spelling doaj-art-a2ce4e31fa2940959b91d0f2963056f12025-01-24T13:34:19ZengMDPI AGHeritage2571-94082024-12-01811210.3390/heritage8010012Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile ExplorationRocco Furferi0Department of Industrial Engineering, University of Florence, 50139 Firenze, ItalyAn effective method to enable the enjoyment of works of art by the blind is to reproduce tactile copies of the work, to facilitate tactile exploration. This is even more important when it comes to paintings, which are inherently not accessible to the blind unless they are transformed into 3D models. Today, artificial intelligence techniques are rapidly growing and represent a paramount method for solving a variety of previously hard-to-solve tasks. It is, therefore, presumable that the translation from 2D images to 3D models using such methods will be also in continuous development. Unfortunately, reconstructing a 3D model from a single image, especially when it comes to painting-based images, is an ill-posed problem due to the depth ambiguity and the lack of a ground truth for the 3D model. To confront this issue, this paper proposes an overview of artificial intelligence-based methods for reconstructing 3D geometry from a single image is provided. The survey explores the potentiality of Convolutional Neural Networks, Generative Adversarial Networks, Variational Autoencoders, and zero-shot methods. Through a small set of case studies, the capabilities and limitations of CNNs in creating a 3D-scene model from artworks are also encompassed. The findings suggest that, while deep learning models demonstrate that they are effective for 3D retrieval from paintings, they also call for post-processing and user interaction to improve the accuracy of the 3D models.https://www.mdpi.com/2571-9408/8/1/12CNNsGANsVAEspaintingsblind people3D printing
spellingShingle Rocco Furferi
Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration
Heritage
CNNs
GANs
VAEs
paintings
blind people
3D printing
title Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration
title_full Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration
title_fullStr Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration
title_full_unstemmed Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration
title_short Deep Learning Approaches for 3D Model Generation from 2D Artworks to Aid Blind People with Tactile Exploration
title_sort deep learning approaches for 3d model generation from 2d artworks to aid blind people with tactile exploration
topic CNNs
GANs
VAEs
paintings
blind people
3D printing
url https://www.mdpi.com/2571-9408/8/1/12
work_keys_str_mv AT roccofurferi deeplearningapproachesfor3dmodelgenerationfrom2dartworkstoaidblindpeoplewithtactileexploration