DSTC8

The DSTC8 dataset is used for addressing the audio visual scene-aware dialog task, specifically involving generating responses based on multimodal inputs including video, audio, and dialogue history.

BibTex: