-
Learning to summarize with human feedback
The paper presents a study on the impact of synthetic data on large language models (LLMs) and proposes a method to steer LLMs towards desirable non-differentiable attributes. -
Kosmos-2: Grounding multimodal large language models to the world
Kosmos-2: Grounding multimodal large language models to the world.