How2: A large-scale dataset for multimodal language understanding

A large-scale multimodal machine translation dataset named How2, which has 1.57 times longer mean sentence length than Multi30k and no repetition.

Data and Resources

Cite this as

Vikas Raunak, Sang Keun Choe, Quanyang Lu, Yi Xu, Florian Metze (2024). Dataset: How2: A large-scale dataset for multimodal language understanding. https://doi.org/10.57702/n0cwczp9

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Vikas Raunak
More Authors
Sang Keun Choe
Quanyang Lu
Yi Xu
Florian Metze
Homepage https://github.com/srvk/how2-dataset