Boundary Crossing Probabilities for General Exponential Families

O.-A. Maillard

doi:10.3103/S1066530718010015

Boundary Crossing Probabilities for General Exponential Families

作者: Maillard O.¹
隶属关系:
1. Inria Lille–Nord Europe
期: 卷 27, 编号 1 (2018)
页面: 1-31
栏目: Article
URL: https://journals.rcsi.science/1066-5307/article/view/225812
DOI: https://doi.org/10.3103/S1066530718010015
ID: 225812

如何引用文章

全文:

开放存取

##reader.subscriptionAccessGranted##
受限制的访问

订阅存取

详细
作者简介
参考
补充文件
统计

详细

We consider parametric exponential families of dimension K on the real line. We study a variant of boundary crossing probabilities coming from the multi-armed bandit literature, in the case when the real-valued distributions form an exponential family of dimension K. Formally, our result is a concentration inequality that bounds the probability that B^ψ(θ̂_n, θ*) ≥ f(t/n)/n, where θ* is the parameter of an unknown target distribution, θ̂_n is the empirical parameter estimate built from n observations, ψ is the log-partition function of the exponential family and B^ψ is the corresponding Bregman divergence. From the perspective of stochastic multi-armed bandits, we pay special attention to the case when the boundary function f is logarithmic, as it is enables to analyze the regret of the state-of-the-art KL-ucb and KL-ucb+ strategies, whose analysis was left open in such generality. Indeed, previous results only hold for the case when K = 1, while we provide results for arbitrary finite dimension K, thus considerably extending the existing results. Perhaps surprisingly, we highlight that the proof techniques to achieve these strong results already existed three decades ago in the work of T. L. Lai, and were apparently forgotten in the bandit community. We provide a modern rewriting of these beautiful techniques that we believe are useful beyond the application to stochastic multi-armed bandits.

关键词

exponential families, Bregman concentration, multi-armed bandits, optimality

作者简介

O.-A. Maillard

Inria Lille–Nord Europe

编辑信件的主要联系方式.
Email: odalricambrym.maillard@inria.fr
法国, Villeneuve d’Ascq

补充文件

附件文件

动作

1. JATS XML

下载

用户名
密码
记住我

忘记您的密码?	注册

用户名
密码
记住我

忘记您的密码?	注册