Phase Transitions in Bandits with Switching Constraints

The dataset is used to study the stochastic multi-armed bandit problem with a constraint that limits the total cost incurred by switching between actions to be no larger than a given switching budget.

BibTex: