Multi-user Multi-armed Bandits for Uncoordinated Spectrum Access
The proposed algorithm consists of an estimation phase and an allocation phase. It is shown that if every user adopts the algorithm, the system wide regret is constant with time with high probability.
BibTex: