Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Groups: Human-computer interaction Organizations: No Organization Filter Results Training a helpful and harmless assistant with reinforcement learning from hu... The authors propose a novel approach that incorporates parameter-efficient tuning to better optimize control tokens, thus benefitting controllable generation. Dataset JSON