Best arm in Continuous Bandits and Variational Quantum Algorithms
Title: |
Best arm in Continuous Bandits and Variational Quantum Algorithms |
DNr: |
NAISS 2025/22-47 |
Project Type: |
NAISS Small Compute |
Principal Investigator: |
Marc Constantin Wanner <wanner@chalmers.se> |
Affiliation: |
Chalmers tekniska högskola |
Duration: |
2025-01-21 – 2025-04-01 |
Classification: |
10201 |
Keywords: |
|
Abstract
We study pure exploration for stochastic ban-
dits where the set of arms is given by the d-
dimensional unit-cube and the expected payoff is
Lipschitz. While regret minimization has been ad-
dressed in this setting, existing methods for pure
exploration only cover discrete payoff. We initi-
ate the study of pure exploration in a continuous
setting. We derive a fixed-confidence information-
theoretic lower bound. Under certain assumptions
on the expected payoff, we show that this bound
can be discretely approximated from below and
derive a simple algorithm, which is near-optimal
with respect to an instance specific lower bound.
Finally, we present numerical experiments that
validate our theoretical bounds and provide a com-
parison of our algorithm’s performance with that
of other existing algorithms.