2024 European Summer Meeting, Rotterdam: August, 2024

Optimal Allocation Strategies in a Discrete-Time Exponential Bandit Problem

Audrey Hu, Liang Zou

This study addresses a theoretic-bandit problem involving a "safe" and a "risky" arm across countable periods. Departing from the "either-or" binary choices in previous studies, we explore smooth allocation strategies using the first-order approach. Modelling both the action and the posterior as state variables, we obtain clear characterizations of the optimal allocation strategies and comparative statics. The optimal plan significantly enhances the binary strategies, yielding a higher probability of breakthrough and a higher expected payoff. The Goldilocks principle emerges in that the incentives for exploring the risky arm peak at a level that is neither too difficult nor too easy.



Preview

Page 1
Page 2
Page 3
Page 4
Page 5
Page 6
Page 7
Page 8
Page 9
Page 10
Page 11
Page 12
Page 13
Page 14
Page 15
Page 16
Page 17
Page 18
Page 19
Page 20
Page 21
Page 22
Page 23
Page 24
Page 25
Page 26
Page 27
Page 28