Phase Transitions in Bandits with Switching Constraints
The dataset is used to study the stochastic multi-armed bandit problem with a constraint that limits the total cost incurred by switching between actions to be no larger than a given switching budget.
BibTex: