Pali: A Jointly-Scaled Multilingual Language-Image Model

This paper proposes a method called Pali, which jointly scales visual and vision-language representation learning.

Data and Resources

Cite this as

Hao Fang, Tsung-Yi Lin, Ramakrishna Vedantam (2024). Dataset: Pali: A Jointly-Scaled Multilingual Language-Image Model. https://doi.org/10.57702/r5u62yth

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Hao Fang
More Authors
Tsung-Yi Lin
Ramakrishna Vedantam