DialogCC: Large-Scale Multi-Modal Dialogue Dataset

A large-scale multi-modal dialogue dataset created by leveraging the automatic pipeline with filtering using CLIP similarity.

Data and Resources

Cite this as

Young-Jun Lee, Byungsoo Ko, Han-Gyu Kim, Ho-Jin Choi (2024). Dataset: DialogCC: Large-Scale Multi-Modal Dialogue Dataset. https://doi.org/10.57702/vepcqd6n

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2212.04119
Author Young-Jun Lee
More Authors
Byungsoo Ko
Han-Gyu Kim
Ho-Jin Choi
Homepage https://github.com/passing2961/DialogCC