-
RedPajama Dataset
The RedPajama dataset is used for single-turn dialogue task. -
Training a helpful and harmless assistant with reinforcement learning from hu...
The authors propose a novel approach that incorporates parameter-efficient tuning to better optimize control tokens, thus benefitting controllable generation.