1 dataset found

Tags: Video-Audio-Text

Filter Results
  • MUGEN-GAME

    MUGEN-GAME: A large-scale and multimodal dataset for video-audio-text multimodal understanding and generation
You can also access this registry using the API (see API Docs).