Multimodal Categorization Task

The dataset used in the paper is a multimodal categorization task using image data and speech signals.

BibTex: