The DSTC7 dataset is also referenced for evaluating model performance in the context of audio visual scene-aware dialog, challenging the generation of appropriate responses based on multimodal inputs.
BibTex:
Before browse our site, please accept our cookies policy