This dataset was generated using MSCOCO [22]. The goal is to have high accuracy and confidence in selecting the correct prompt for the respective image where the background is...
This dataset was generated using MSCOCO [22]. The goal is to have high accuracy and confidence in selecting the correct prompt for the respective image where the attribute is...
This dataset was created using Visual Genome (VG) [21]. The goal is to have high accuracy and confidence in selecting the correct prompt for the respective image where the...