News

Abstract: We consider a class of multi-armed bandit problems where the set of available actions can be mapped to a convex, compact region of Ropf d, sometimes denoted as the ldquocontinuum-armed ...
Abstract: This paper studies the problem of stochastic continuum-armed bandit with constraints (SCBwC), where we optimize an unknown reward function subject to an unknown constraint function over a ...