2 datasets found

Tags: Multimodal Learning

Filter Results
  • RWTH-PHOENIX-Weather

    Continuous sign language recognition (SLR) deals with unaligned video-text pair and uses the word error rate (WER), i.e., edit distance, as the main evaluation metric.
  • CSL

    The CSL dataset is a large-scale Chinese scientific literature dataset obtained from the "Qianyan" open-source NLP platform. It consists of 396,209 Chinese core journal papers'...