1 dataset found

Tags: Multimodal Learning

Filter Results
  • XD-Violence

    The XD-Violence dataset is a large-scale multimodal video dataset for violence detection. It consists of 4,754 untrimmed videos with a total duration of 217 hours, covering six...