On Quantum Natural Policy Gradients

dc.contributor.author Luís Paulo Santos en
dc.contributor.author Luís Soares Barbosa en
dc.contributor.other 6969 en
dc.contributor.other 5603 en
dc.date.accessioned 2025-01-13T17:04:45Z
dc.date.available 2025-01-13T17:04:45Z
dc.date.issued 2024 en
dc.description.abstract This article delves into the role of the quantum Fisher information matrix (FIM) in enhancing the performance of parameterized quantum circuit (PQC)-based reinforcement learning agents. While previous studies have highlighted the effectiveness of PQC-based policies preconditioned with the quantum FIM in contextual bandits, its impact in broader reinforcement learning contexts, such as Markov decision processes, is less clear. Through a detailed analysis of L & ouml;wner inequalities between quantum and classical FIMs, this study uncovers the nuanced distinctions and implications of using each type of FIM. Our results indicate that a PQC-based agent using the quantum FIM without additional insights typically incurs a larger approximation error and does not guarantee improved performance compared to the classical FIM. Empirical evaluations in classic control benchmarks suggest even though quantum FIM preconditioning outperforms standard gradient ascent, in general, it is not superior to classical FIM preconditioning. en
dc.identifier P-00Z-X4M en
dc.identifier.uri https://repositorio.inesctec.pt/handle/123456789/15248
dc.language eng en
dc.rights info:eu-repo/semantics/openAccess en
dc.title On Quantum Natural Policy Gradients en
dc.type en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
P-00Z-X4M.pdf
Size:
4.3 MB
Format:
Adobe Portable Document Format
Description: