ALIGN

doi:doi:10.57702/1siwlt77

You're currently viewing an old version of this dataset. To see the current version, click here.

ALIGN

Scaling up visual and vision-language representation learning with noisy text supervision.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Soravit Changpinyo, Yinfei Yang, Ye Xia, Yi-Ting Chen, Zarana Parekh, Hieu Pham, Quoc Le, Yun-Hsuan Sung, Zhen Li, Tom Duerig (2024). Dataset: ALIGN. https://doi.org/10.57702/1siwlt77

DOI retrieved: December 2, 2024

Additional Info

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.48550/arXiv.2304.08480
Author	Soravit Changpinyo
More Authors	Yinfei Yang Ye Xia Yi-Ting Chen Zarana Parekh Hieu Pham Quoc Le Yun-Hsuan Sung Zhen Li Tom Duerig
Homepage	https://arxiv.org/abs/2106.10232