2018年学术报告第45期

发布者:严继臧发布时间:2018-12-14浏览次数:614

统计与管理学院2018年学术报告第45期

【主 题】 A General Framework of Multi-Armed Bandit Processes by Arm Switch Restrictions

【报告人】 吴贤毅, 教授

            华东师范大学

【时 间】 2018年12月11日(星期二)10:00-11:00

【地 点】 上海财经大学统计与管理学院大楼1208会议室

摘 要】This work proposes a general framework of multi-armed bandit (MAB) processes by introducing a type of restrictions on the switches among arms evolving in continuous time.

The Gittins index process is constructed for any single arm subject to the restrictions on switches and then the optimality of the corresponding Gittins index rule is established. The Gittins indices defined in this paper are consistent with the ones for MAB processes in continuous time, integer time, semi-Markovian setting as well as general discrete time setting, so that the new theory covers the classical models as special cases and also applies to many other situations that have not yet been touched in the literature. While the proof of the optimality of Gittins index policies benefits from ideas in the existing theory of MAB processes in continuous time, new techniques are introduced which drastically simplify the proof.

地址:中国上海市杨浦区国定路777号
邮编:200433
院办:021-65901099 021-65901079
本科生教务:021-35312698、021-65901229
研究生教务:021-65901076、021-65901229
版权所有©365上市公司(英国)集团-官方网站
扫码关注我们