SHIMRIE dataset
The SHIMRIE dataset is a new dataset for the Object Segmentation from Manipulation Instructions (OSMI) task. It contains 4341 images and 11371 sentences, with a vocabulary size of 3558 words, a total of 196541 words, and an average sentence length of 18.8 words.
BibTex: