You're currently viewing an old version of this dataset. To see the current version, click here.

Flickr30k Entities

Flickr30k Entities is a dataset of 31k images, each annotated with 5 captions, and contains 275k annotated bounding boxes associated with natural language phrases.

Data and Resources

This dataset has no data

Cite this as

Bryan A Plummer, Liwei Wang, Chris M Cervantes, Juan C Caicedo, Julia Hockenmaier, Svetlana Lazebnik (2024). Dataset: Flickr30k Entities. https://doi.org/10.57702/8jotglt8

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.1711.08389
Citation
  • https://doi.org/10.48550/arXiv.2306.07490
  • https://doi.org/10.48550/arXiv.2006.03776
  • https://doi.org/10.48550/arXiv.2405.15321
  • https://doi.org/10.48550/arXiv.2206.08358
Author Bryan A Plummer
More Authors
Liwei Wang
Chris M Cervantes
Juan C Caicedo
Julia Hockenmaier
Svetlana Lazebnik
Homepage https://flickr30k.org/