Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems

Convolutional neural networks (CNNs) have been employed along with variational Monte Carlo methods for finding the ground state of quantum many-body spin systems with great success. However, it remains uncertain how CNNs, with a model complexity that scales at most linearly with the number of partic...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yilong Ju, Shah Saad Alam, Jonathan Minoff, Fabio Anselmi, Han Pu, Ankit Patel
Format:	Article
Language:	English
Published:	American Physical Society 2025-01-01
Series:	Physical Review Research
Online Access:	http://doi.org/10.1103/PhysRevResearch.7.013094
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832590451466567680
author	Yilong Ju Shah Saad Alam Jonathan Minoff Fabio Anselmi Han Pu Ankit Patel
author_facet	Yilong Ju Shah Saad Alam Jonathan Minoff Fabio Anselmi Han Pu Ankit Patel
author_sort	Yilong Ju
collection	DOAJ
description	Convolutional neural networks (CNNs) have been employed along with variational Monte Carlo methods for finding the ground state of quantum many-body spin systems with great success. However, it remains uncertain how CNNs, with a model complexity that scales at most linearly with the number of particles, solve the “curse of dimensionality” and efficiently represent wavefunctions in exponentially large Hilbert spaces. In this work, we use methodologies from information theory, group theory and machine learning, to elucidate how CNN captures relevant physics of quantum systems. We connect CNNs to a class of restricted maximum entropy (MaxEnt) and entangled plaquette correlator product state (EP-CPS) models that approximate symmetry constrained classical correlations between subsystems. For the final part of the puzzle, inspired by similar analyses for matrix product states and tensor networks, we show that the CNNs rely on the spectrum of each subsystem's entanglement Hamiltonians as captured by the size of the convolutional filter. All put together, these allow CNNs to simulate exponential quantum wave functions using a model that scales at most linear in system size as well as provide clues into when CNNs might fail to simulate Hamiltonians. We incorporate our insights into a new training algorithm and demonstrate its improved efficiency, accuracy, and robustness. Finally, we use regression analysis to show how the CNNs solutions can be used to identify salient physical features of the system that are the most relevant to an efficient approximation. Our integrated approach can be extended to similarly analyzing other neural network architectures and quantum spin systems.
format	Article
id	doaj-art-2aee3469c8a748029358bf2ee2aee07b
institution	Kabale University
issn	2643-1564
language	English
publishDate	2025-01-01
publisher	American Physical Society
record_format	Article
series	Physical Review Research
spelling	doaj-art-2aee3469c8a748029358bf2ee2aee07b2025-01-23T15:02:10ZengAmerican Physical SocietyPhysical Review Research2643-15642025-01-017101309410.1103/PhysRevResearch.7.013094Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systemsYilong JuShah Saad AlamJonathan MinoffFabio AnselmiHan PuAnkit PatelConvolutional neural networks (CNNs) have been employed along with variational Monte Carlo methods for finding the ground state of quantum many-body spin systems with great success. However, it remains uncertain how CNNs, with a model complexity that scales at most linearly with the number of particles, solve the “curse of dimensionality” and efficiently represent wavefunctions in exponentially large Hilbert spaces. In this work, we use methodologies from information theory, group theory and machine learning, to elucidate how CNN captures relevant physics of quantum systems. We connect CNNs to a class of restricted maximum entropy (MaxEnt) and entangled plaquette correlator product state (EP-CPS) models that approximate symmetry constrained classical correlations between subsystems. For the final part of the puzzle, inspired by similar analyses for matrix product states and tensor networks, we show that the CNNs rely on the spectrum of each subsystem's entanglement Hamiltonians as captured by the size of the convolutional filter. All put together, these allow CNNs to simulate exponential quantum wave functions using a model that scales at most linear in system size as well as provide clues into when CNNs might fail to simulate Hamiltonians. We incorporate our insights into a new training algorithm and demonstrate its improved efficiency, accuracy, and robustness. Finally, we use regression analysis to show how the CNNs solutions can be used to identify salient physical features of the system that are the most relevant to an efficient approximation. Our integrated approach can be extended to similarly analyzing other neural network architectures and quantum spin systems.http://doi.org/10.1103/PhysRevResearch.7.013094
spellingShingle	Yilong Ju Shah Saad Alam Jonathan Minoff Fabio Anselmi Han Pu Ankit Patel Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems Physical Review Research
title	Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_full	Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_fullStr	Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_full_unstemmed	Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_short	Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_sort	interpreting convolutional neural networks low dimensional approximation to quantum spin systems
url	http://doi.org/10.1103/PhysRevResearch.7.013094
work_keys_str_mv	AT yilongju interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems AT shahsaadalam interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems AT jonathanminoff interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems AT fabioanselmi interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems AT hanpu interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems AT ankitpatel interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems

Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems

Similar Items