XD-Violence

The XD-Violence dataset is a large-scale multimodal video dataset for violence detection. It consists of 4,754 untrimmed videos with a total duration of 217 hours, covering six violence types and providing RGB, optical flow, and audio streams for each video.

BibTex: