Training a helpful and harmless assistant with reinforcement learning from human feedback

The authors propose a novel approach that incorporates parameter-efficient tuning to better optimize control tokens, thus benefitting controllable generation.

BibTex: