1 dataset found

Tags: transformer networks

Filter Results
  • Long Video Understanding Benchmark

    Towards long-form video understanding. We propose a two-stream spatio-temporal attention network for long video classification which combines the advantages of convolutional...
You can also access this registry using the API (see API Docs).