DialogCC: Large-Scale Multi-Modal Dialogue Dataset

A large-scale multi-modal dialogue dataset created by leveraging the automatic pipeline with filtering using CLIP similarity.

BibTex: