-
Yahoo! Dataset
A real-world dataset collected from the Yahoo! Front Page Today Module, containing a large-scale collection of anonymized user click logs -
Synthetic Environment
A synthetic environment where the reward means originate from a Bernoulli distribution with different parameters and several local change-points -
Learning in Restless Multi-Armed Bandits via Adaptive Arm Sequencing Rules
Learning in restless multi-armed bandits via Adaptive Arm Sequencing Rules