Cite this as

Xu et al., Chen et al., Li et al., Zhou et al., Dosovitskiy et al., Unterthiner et al., Minderer et al., Heigold et al., Gelly et al. (2024). Dataset: An image is worth 16x16 words: Transformers for image recognition at scale. Resource: Original Metadata. https://doi.org/10.57702/ovifi3ii

DOI retrieved: December 2, 2024

Additional Information

Field Value
Created December 2, 2024
Last updated December 2, 2024
Format JSON