Learning to summarize with human feedback

doi:doi:10.57702/bakxgny5

Learning to summarize with human feedback

Followers: 0

Organization

No Organization

There is no description for this organization

License

No License Provided

Export

DCAT(rdf/xml) DCAT(xml) DCAT(N3) DCAT(ttl) DCAT(jsonld) DataCite CSL DublinCore BibTex

Learning to summarize with human feedback

The paper presents a study on the impact of synthetic data on large language models (LLMs) and proposes a method to steer LLMs towards desirable non-differentiable attributes.

BibTex:

Before browse our site, please accept our cookies policy