Caption MNIST

Caption MNIST is a synthetic image-text pair dataset built by filling in the missing colors, digits, and positions in the MNIST dataset.

BibTex: