-
Multi-armed Bandit Problem with Known Trend
Multi-armed bandit problem with known trend -
Phase Transitions in Bandits with Switching Constraints
The dataset is used to study the stochastic multi-armed bandit problem with a constraint that limits the total cost incurred by switching between actions to be no larger than a... -
Simulated Purchasing Datasets
Simulated purchasing datasets with non-stationary reward distributions