-
Synthetic Data
The dataset used in the paper is a synthetic dataset for off-policy contextual bandits, with contexts x ∈ X, a finite set of actions A, and bounded real rewards r ∈ A → [0, 1]. -
Adult dataset
A commonly observed pattern in machine learning models is an underprediction of the target feature, with the model’s predicted target rate for members of a given category...