Grit: A generative region-to-text transformer for object understanding

Grit: A generative region-to-text transformer for object understanding.

BibTex: