Bandit and Jager - Search News

News

Regret and Convergence Bounds for a Class of Continuum-Armed Bandit Problems

Abstract: We consider a class of multi-armed bandit problems where the set of available actions can be mapped to a convex, compact region of Ropf d, sometimes denoted as the ldquocontinuum-armed ...

IEEE5d

Safe Learning in Stochastic Continuum-Armed Bandit With Constraints and Its Application to Network Resource Management

Abstract: This paper studies the problem of stochastic continuum-armed bandit with constraints (SCBwC), where we optimize an unknown reward function subject to an unknown constraint function over a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

News

Trending now