-
Learning to Charge RF-Energy Harvesting Devices in WiFi Networks
The dataset used in this paper is a simulation dataset for RF-energy harvesting devices in WiFi networks. -
SimpleQuestion dataset for Wikidata
The dataset used in this paper is a reinforcement learning dataset, specifically the SimpleQuestion dataset, which contains questions answerable using Wikidata as the knowledge... -
Toxic-DPO Dataset
The dataset used in the paper is the Toxic-DPO dataset, which is used for reinforcement learning from human feedback. -
Anthropic-HH-RLHF Dataset
The dataset used in the paper is the Anthropic-HH-RLHF dataset, which is used for reinforcement learning from human feedback. -
3-Dots Dataset
The 3-dots dataset is a variation of the moving dot dataset with three dots on the three channels of the image. -
Moving Dot Dataset
The dataset is a simple environment with a moving dot inside a square. The dot cannot leave the square, and is always visible on the screen. The goal is to learn a...