ViT-ReT: Vision and Recurrent Transformer Neural Networks for Human Activity Recognition in Videos

Human activity recognition is an emerging and important area in computer vision which seeks to determine the activity an individual or group of individuals are performing.

BibTex: