Container: A General-Purpose Building Block for Multi-Head Context Aggregation

Convolutional neural networks (CNNs) are ubiquitous in computer vision, with a myriad of effective and efficient variations. Recently, Transformers – originally introduced in natural language processing – have been increasingly adopted in computer vision.

Data and Resources

Cite this as

Peng Gao, Jiasen Lu, Hongsheng Li, Roozbeh Mottaghi, Aniruddha Kembhavi (2024). Dataset: Container: A General-Purpose Building Block for Multi-Head Context Aggregation. https://doi.org/10.57702/3u5v1wf5

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Author Peng Gao
More Authors
Jiasen Lu
Hongsheng Li
Roozbeh Mottaghi
Aniruddha Kembhavi
Homepage https://github.com/allenai/container