-
Auto-Context R-CNN
Region-based convolutional neural networks (R-CNN) have largely dominated object detection. Operators deļ¬ned on RoIs (Re-gion of Interests) play an important role in R-CNNs such... -
MMX-Trailer-20 Dataset
Long form video understanding (LVU) is a sub-domain of video recognition concerned with understanding contextual information across contiguous shots which can contain multiple...