TY - JOUR
T1 - Learning linear optical circuits with coherent states
AU - Volkoff, T. J.
AU - Sornborger, Andrew T.
N1 - Publisher Copyright:
© 2024 The Author(s). Published by IOP Publishing Ltd.
PY - 2024/7/26
Y1 - 2024/7/26
N2 - We analyze the energy and training data requirements for supervised learning of an M-mode linear optical circuit by minimizing an empirical risk defined solely from the action of the circuit on coherent states. When the linear optical circuit acts non-trivially only on k < M unknown modes (i.e. a linear optical k-junta), we provide an energy-efficient, adaptive algorithm that identifies the junta set and learns the circuit. We compare two schemes for allocating a total energy, E, to the learning algorithm. In the first scheme, each of the T random training coherent states has energy E / T . In the second scheme, a single random MT-mode coherent state with energy E is partitioned into T training coherent states. The latter scheme exhibits a polynomial advantage in training data size sufficient for convergence of the empirical risk to the full risk due to concentration of measure on the ( 2 M T − 1 ) -sphere. Specifically, generalization bounds for both schemes are proven, which indicate that for ϵ-approximation of the full risk by the empirical risk with high probability, O ( E 2 / 3 M 2 / 3 / ϵ 2 / 3 ) training states are sufficient for the first scheme and O ( E 1 / 3 M 1 / 3 / ϵ 2 / 3 ) training states are sufficient for the second scheme.
AB - We analyze the energy and training data requirements for supervised learning of an M-mode linear optical circuit by minimizing an empirical risk defined solely from the action of the circuit on coherent states. When the linear optical circuit acts non-trivially only on k < M unknown modes (i.e. a linear optical k-junta), we provide an energy-efficient, adaptive algorithm that identifies the junta set and learns the circuit. We compare two schemes for allocating a total energy, E, to the learning algorithm. In the first scheme, each of the T random training coherent states has energy E / T . In the second scheme, a single random MT-mode coherent state with energy E is partitioned into T training coherent states. The latter scheme exhibits a polynomial advantage in training data size sufficient for convergence of the empirical risk to the full risk due to concentration of measure on the ( 2 M T − 1 ) -sphere. Specifically, generalization bounds for both schemes are proven, which indicate that for ϵ-approximation of the full risk by the empirical risk with high probability, O ( E 2 / 3 M 2 / 3 / ϵ 2 / 3 ) training states are sufficient for the first scheme and O ( E 1 / 3 M 1 / 3 / ϵ 2 / 3 ) training states are sufficient for the second scheme.
KW - generalization bounds
KW - quantum machine learning
KW - quantum optics
UR - http://www.scopus.com/inward/record.url?scp=85198701906&partnerID=8YFLogxK
U2 - 10.1088/1751-8121/ad5cac
DO - 10.1088/1751-8121/ad5cac
M3 - Article
AN - SCOPUS:85198701906
SN - 1751-8113
VL - 57
JO - Journal of Physics A: Mathematical and Theoretical
JF - Journal of Physics A: Mathematical and Theoretical
IS - 30
M1 - 305302
ER -