G-Ref

G-Ref is a dataset for referring image segmentation, comprising 104K referring language expressions for around 55K objects in about 27K images.

BibTex: