X-volution: On the Unification of Convolution and Self-attention

Convolution and self-attention are acting as two fundamental building blocks in deep neural networks, where the former extracts local image features in a linear way while the latter non-locally encodes high-order contextual relationships.

Data and Resources

Cite this as

Xuanhong Chen, Hang Wang, Bingbing Ni (2024). Dataset: X-volution: On the Unification of Convolution and Self-attention. https://doi.org/10.57702/wq7g3i9y

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2106.02253
Author Xuanhong Chen
More Authors
Hang Wang
Bingbing Ni