DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

doi:doi:10.57702/ltxezq8o

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Followers: 0

Organization

No Organization

There is no description for this organization

License

No License Provided

Export

DCAT(rdf/xml) DCAT(xml) DCAT(N3) DCAT(ttl) DCAT(jsonld) DataCite CSL DublinCore BibTex

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Learning from human feedback has been shown to improve text-to-image models. These techniques first learn a reward function that captures what humans care about in the task and then improve the models based on the learned reward function.

BibTex:

Before browse our site, please accept our cookies policy