Dataset - LDM

Minimax Regret for Online Learning with Feedback Graphs

The dataset used in the paper is a sequence of strongly observable undirected feedback graphs, where each graph has independence number α for some common value α.
- Dataset
- JSON
Lipschitz Bandits

The dataset used in the paper is a Lipschitz bandit problem, where the learner aims at selecting satisficing arms (arms with mean reward exceeding a certain threshold value) as...
- Dataset
- JSON
Concave Bandits

The dataset used in the paper is a concave bandit problem, where the learner aims at selecting satisficing arms (arms with mean reward exceeding a certain threshold value) as...
- Dataset
- JSON
Finite-Armed Bandits

The dataset used in the paper is a finite-armed bandit problem, where the learner aims at selecting satisficing arms (arms with mean reward exceeding a certain threshold value)...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found