Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems

Convolutional neural networks (CNNs) have been employed along with variational Monte Carlo methods for finding the ground state of quantum many-body spin systems with great success. However, it remains uncertain how CNNs, with a model complexity that scales at most linearly with the number of partic...

Full description

Saved in:
Bibliographic Details
Main Authors: Yilong Ju, Shah Saad Alam, Jonathan Minoff, Fabio Anselmi, Han Pu, Ankit Patel
Format: Article
Language:English
Published: American Physical Society 2025-01-01
Series:Physical Review Research
Online Access:http://doi.org/10.1103/PhysRevResearch.7.013094
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590451466567680
author Yilong Ju
Shah Saad Alam
Jonathan Minoff
Fabio Anselmi
Han Pu
Ankit Patel
author_facet Yilong Ju
Shah Saad Alam
Jonathan Minoff
Fabio Anselmi
Han Pu
Ankit Patel
author_sort Yilong Ju
collection DOAJ
description Convolutional neural networks (CNNs) have been employed along with variational Monte Carlo methods for finding the ground state of quantum many-body spin systems with great success. However, it remains uncertain how CNNs, with a model complexity that scales at most linearly with the number of particles, solve the “curse of dimensionality” and efficiently represent wavefunctions in exponentially large Hilbert spaces. In this work, we use methodologies from information theory, group theory and machine learning, to elucidate how CNN captures relevant physics of quantum systems. We connect CNNs to a class of restricted maximum entropy (MaxEnt) and entangled plaquette correlator product state (EP-CPS) models that approximate symmetry constrained classical correlations between subsystems. For the final part of the puzzle, inspired by similar analyses for matrix product states and tensor networks, we show that the CNNs rely on the spectrum of each subsystem's entanglement Hamiltonians as captured by the size of the convolutional filter. All put together, these allow CNNs to simulate exponential quantum wave functions using a model that scales at most linear in system size as well as provide clues into when CNNs might fail to simulate Hamiltonians. We incorporate our insights into a new training algorithm and demonstrate its improved efficiency, accuracy, and robustness. Finally, we use regression analysis to show how the CNNs solutions can be used to identify salient physical features of the system that are the most relevant to an efficient approximation. Our integrated approach can be extended to similarly analyzing other neural network architectures and quantum spin systems.
format Article
id doaj-art-2aee3469c8a748029358bf2ee2aee07b
institution Kabale University
issn 2643-1564
language English
publishDate 2025-01-01
publisher American Physical Society
record_format Article
series Physical Review Research
spelling doaj-art-2aee3469c8a748029358bf2ee2aee07b2025-01-23T15:02:10ZengAmerican Physical SocietyPhysical Review Research2643-15642025-01-017101309410.1103/PhysRevResearch.7.013094Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systemsYilong JuShah Saad AlamJonathan MinoffFabio AnselmiHan PuAnkit PatelConvolutional neural networks (CNNs) have been employed along with variational Monte Carlo methods for finding the ground state of quantum many-body spin systems with great success. However, it remains uncertain how CNNs, with a model complexity that scales at most linearly with the number of particles, solve the “curse of dimensionality” and efficiently represent wavefunctions in exponentially large Hilbert spaces. In this work, we use methodologies from information theory, group theory and machine learning, to elucidate how CNN captures relevant physics of quantum systems. We connect CNNs to a class of restricted maximum entropy (MaxEnt) and entangled plaquette correlator product state (EP-CPS) models that approximate symmetry constrained classical correlations between subsystems. For the final part of the puzzle, inspired by similar analyses for matrix product states and tensor networks, we show that the CNNs rely on the spectrum of each subsystem's entanglement Hamiltonians as captured by the size of the convolutional filter. All put together, these allow CNNs to simulate exponential quantum wave functions using a model that scales at most linear in system size as well as provide clues into when CNNs might fail to simulate Hamiltonians. We incorporate our insights into a new training algorithm and demonstrate its improved efficiency, accuracy, and robustness. Finally, we use regression analysis to show how the CNNs solutions can be used to identify salient physical features of the system that are the most relevant to an efficient approximation. Our integrated approach can be extended to similarly analyzing other neural network architectures and quantum spin systems.http://doi.org/10.1103/PhysRevResearch.7.013094
spellingShingle Yilong Ju
Shah Saad Alam
Jonathan Minoff
Fabio Anselmi
Han Pu
Ankit Patel
Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
Physical Review Research
title Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_full Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_fullStr Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_full_unstemmed Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_short Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
title_sort interpreting convolutional neural networks low dimensional approximation to quantum spin systems
url http://doi.org/10.1103/PhysRevResearch.7.013094
work_keys_str_mv AT yilongju interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems
AT shahsaadalam interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems
AT jonathanminoff interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems
AT fabioanselmi interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems
AT hanpu interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems
AT ankitpatel interpretingconvolutionalneuralnetworkslowdimensionalapproximationtoquantumspinsystems