On Quantum Natural Policy Gradients

This article delves into the role of the quantum Fisher information matrix (FIM) in enhancing the performance of parameterized quantum circuit (PQC)-based reinforcement learning agents. While previous studies have highlighted the effectiveness of PQC-based policies preconditioned with the quantum FI...

Full description

Saved in:
Bibliographic Details
Main Authors: Andre Sequeira, Luis Paulo Santos, Luis Soares Barbosa
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Transactions on Quantum Engineering
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10569042/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832584004320100352
author Andre Sequeira
Luis Paulo Santos
Luis Soares Barbosa
author_facet Andre Sequeira
Luis Paulo Santos
Luis Soares Barbosa
author_sort Andre Sequeira
collection DOAJ
description This article delves into the role of the quantum Fisher information matrix (FIM) in enhancing the performance of parameterized quantum circuit (PQC)-based reinforcement learning agents. While previous studies have highlighted the effectiveness of PQC-based policies preconditioned with the quantum FIM in contextual bandits, its impact in broader reinforcement learning contexts, such as Markov decision processes, is less clear. Through a detailed analysis of Löwner inequalities between quantum and classical FIMs, this study uncovers the nuanced distinctions and implications of using each type of FIM. Our results indicate that a PQC-based agent using the quantum FIM without additional insights typically incurs a larger approximation error and does not guarantee improved performance compared to the classical FIM. Empirical evaluations in classic control benchmarks suggest even though quantum FIM preconditioning outperforms standard gradient ascent, in general, it is not superior to classical FIM preconditioning.
format Article
id doaj-art-e5050aba5e83488686c7b85f3d802c6c
institution Kabale University
issn 2689-1808
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Transactions on Quantum Engineering
spelling doaj-art-e5050aba5e83488686c7b85f3d802c6c2025-01-28T00:02:23ZengIEEEIEEE Transactions on Quantum Engineering2689-18082024-01-01511110.1109/TQE.2024.341809410569042On Quantum Natural Policy GradientsAndre Sequeira0https://orcid.org/0000-0002-6659-9277Luis Paulo Santos1https://orcid.org/0000-0003-4466-1129Luis Soares Barbosa2Department of Informatics, University of Minho, Braga, PortugalDepartment of Informatics, University of Minho, Braga, PortugalDepartment of Informatics, University of Minho, Braga, PortugalThis article delves into the role of the quantum Fisher information matrix (FIM) in enhancing the performance of parameterized quantum circuit (PQC)-based reinforcement learning agents. While previous studies have highlighted the effectiveness of PQC-based policies preconditioned with the quantum FIM in contextual bandits, its impact in broader reinforcement learning contexts, such as Markov decision processes, is less clear. Through a detailed analysis of Löwner inequalities between quantum and classical FIMs, this study uncovers the nuanced distinctions and implications of using each type of FIM. Our results indicate that a PQC-based agent using the quantum FIM without additional insights typically incurs a larger approximation error and does not guarantee improved performance compared to the classical FIM. Empirical evaluations in classic control benchmarks suggest even though quantum FIM preconditioning outperforms standard gradient ascent, in general, it is not superior to classical FIM preconditioning.https://ieeexplore.ieee.org/document/10569042/Fisher informationnatural policy gradients (NPGs)quantum policy gradientsquantum reinforcement learning
spellingShingle Andre Sequeira
Luis Paulo Santos
Luis Soares Barbosa
On Quantum Natural Policy Gradients
IEEE Transactions on Quantum Engineering
Fisher information
natural policy gradients (NPGs)
quantum policy gradients
quantum reinforcement learning
title On Quantum Natural Policy Gradients
title_full On Quantum Natural Policy Gradients
title_fullStr On Quantum Natural Policy Gradients
title_full_unstemmed On Quantum Natural Policy Gradients
title_short On Quantum Natural Policy Gradients
title_sort on quantum natural policy gradients
topic Fisher information
natural policy gradients (NPGs)
quantum policy gradients
quantum reinforcement learning
url https://ieeexplore.ieee.org/document/10569042/
work_keys_str_mv AT andresequeira onquantumnaturalpolicygradients
AT luispaulosantos onquantumnaturalpolicygradients
AT luissoaresbarbosa onquantumnaturalpolicygradients