Computational models reveal that intuitive physics underlies visual processing of soft objects

Abstract Computational explorations of human cognition have been especially successful when applied to visual perception. Existing models have primarily focused on rigid objects, emphasizing shape-preserving invariance to changes in viewpoint, lighting, object size, and scene context. Yet many objec...

Full description

Saved in:
Bibliographic Details
Main Authors: Wenyan Bi, Aalap D. Shah, Kimberly W. Wong, Brian J. Scholl, Ilker Yildirim
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-025-61458-x
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849342153828859904
author Wenyan Bi
Aalap D. Shah
Kimberly W. Wong
Brian J. Scholl
Ilker Yildirim
author_facet Wenyan Bi
Aalap D. Shah
Kimberly W. Wong
Brian J. Scholl
Ilker Yildirim
author_sort Wenyan Bi
collection DOAJ
description Abstract Computational explorations of human cognition have been especially successful when applied to visual perception. Existing models have primarily focused on rigid objects, emphasizing shape-preserving invariance to changes in viewpoint, lighting, object size, and scene context. Yet many objects in our everyday environments, such as cloths, are soft. This poses both quantitatively greater and qualitatively different challenges for models of perception, due to soft objects’ dynamic and high-dimensional internal structure, as in the changing folds and wrinkles of a cloth waving in the wind. Soft object perception is also correspondingly rich, involving distinct properties such as stiffness. Here we explore the ability of different kinds of computational models to capture visual perception of the physical properties of cloths (e.g., their degrees of stiffness) undergoing different naturalistic transformations (e.g., falling vs. waving in the wind). Across visual matching tasks, both the successes and failures of human performance are well explained by Woven: a new model that incorporates physics-based simulations to infer probabilistic representations of cloths. Woven outperforms powerful, performance-equated alternatives, including its ablations and a deep neural network, and suggests that humanlike machine vision may also require representations that transcend image statistics, and involve intuitive physics.
format Article
id doaj-art-c0e7ecc721f9497a8cf12c2ffaf83b25
institution Kabale University
issn 2041-1723
language English
publishDate 2025-07-01
publisher Nature Portfolio
record_format Article
series Nature Communications
spelling doaj-art-c0e7ecc721f9497a8cf12c2ffaf83b252025-08-20T03:43:27ZengNature PortfolioNature Communications2041-17232025-07-0116111510.1038/s41467-025-61458-xComputational models reveal that intuitive physics underlies visual processing of soft objectsWenyan Bi0Aalap D. Shah1Kimberly W. Wong2Brian J. Scholl3Ilker Yildirim4Department of Psychology, Yale UniversityDepartment of Psychology, Yale UniversityDepartment of Psychology, Yale UniversityDepartment of Psychology, Yale UniversityDepartment of Psychology, Yale UniversityAbstract Computational explorations of human cognition have been especially successful when applied to visual perception. Existing models have primarily focused on rigid objects, emphasizing shape-preserving invariance to changes in viewpoint, lighting, object size, and scene context. Yet many objects in our everyday environments, such as cloths, are soft. This poses both quantitatively greater and qualitatively different challenges for models of perception, due to soft objects’ dynamic and high-dimensional internal structure, as in the changing folds and wrinkles of a cloth waving in the wind. Soft object perception is also correspondingly rich, involving distinct properties such as stiffness. Here we explore the ability of different kinds of computational models to capture visual perception of the physical properties of cloths (e.g., their degrees of stiffness) undergoing different naturalistic transformations (e.g., falling vs. waving in the wind). Across visual matching tasks, both the successes and failures of human performance are well explained by Woven: a new model that incorporates physics-based simulations to infer probabilistic representations of cloths. Woven outperforms powerful, performance-equated alternatives, including its ablations and a deep neural network, and suggests that humanlike machine vision may also require representations that transcend image statistics, and involve intuitive physics.https://doi.org/10.1038/s41467-025-61458-x
spellingShingle Wenyan Bi
Aalap D. Shah
Kimberly W. Wong
Brian J. Scholl
Ilker Yildirim
Computational models reveal that intuitive physics underlies visual processing of soft objects
Nature Communications
title Computational models reveal that intuitive physics underlies visual processing of soft objects
title_full Computational models reveal that intuitive physics underlies visual processing of soft objects
title_fullStr Computational models reveal that intuitive physics underlies visual processing of soft objects
title_full_unstemmed Computational models reveal that intuitive physics underlies visual processing of soft objects
title_short Computational models reveal that intuitive physics underlies visual processing of soft objects
title_sort computational models reveal that intuitive physics underlies visual processing of soft objects
url https://doi.org/10.1038/s41467-025-61458-x
work_keys_str_mv AT wenyanbi computationalmodelsrevealthatintuitivephysicsunderliesvisualprocessingofsoftobjects
AT aalapdshah computationalmodelsrevealthatintuitivephysicsunderliesvisualprocessingofsoftobjects
AT kimberlywwong computationalmodelsrevealthatintuitivephysicsunderliesvisualprocessingofsoftobjects
AT brianjscholl computationalmodelsrevealthatintuitivephysicsunderliesvisualprocessingofsoftobjects
AT ilkeryildirim computationalmodelsrevealthatintuitivephysicsunderliesvisualprocessingofsoftobjects