MD30 (Manually-annotated Dataset)
MD30 (Manually-annotated Dataset) comprises 30 images chosen from the 807 images. For those images, we discard sn by BLIP2 and attach a more accurate text as sn by a human... -
BD807 (BLIP2-guided Dataset with 807 images)
NoiseCollage generates an image with N objects from the following conditions, L, S, and s∗: L = {l1,..., lN } is the N layout conditions to control the layout of individual...