Abstract: This paper investigates the use of relaxed recentered logarithmic barrier functions in the context of nonlinear model predictive control. These functions are a variation of the regular ...
In this paper, we study the pure exploration model with general distribution functions, which means that the reward function of each arm depends on the whole distribution, not only its mean. We adapt ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results