Best arm in Continuous Bandits and Variational Quantum Algorithms
| Title: |
Best arm in Continuous Bandits and Variational Quantum Algorithms |
| DNr: |
NAISS 2025/22-47 |
| Project Type: |
NAISS Small Compute |
| Principal Investigator: |
Marc Constantin Wanner <wanner@chalmers.se> |
| Affiliation: |
Chalmers tekniska högskola |
| Duration: |
2025-01-21 – 2025-04-01 |
| Classification: |
10201 |
| Keywords: |
|
Abstract
We study pure exploration for stochastic ban-
dits where the set of arms is given by the d-
dimensional unit-cube and the expected payoff is
Lipschitz. While regret minimization has been ad-
dressed in this setting, existing methods for pure
exploration only cover discrete payoff. We initi-
ate the study of pure exploration in a continuous
setting. We derive a fixed-confidence information-
theoretic lower bound. Under certain assumptions
on the expected payoff, we show that this bound
can be discretely approximated from below and
derive a simple algorithm, which is near-optimal
with respect to an instance specific lower bound.
Finally, we present numerical experiments that
validate our theoretical bounds and provide a com-
parison of our algorithm’s performance with that
of other existing algorithms.