Attention-based beamformers for multi-channel speech recognition

The proposed 2D Conv-Attention model is compared with a traditional neural beamformer and multi-head attention based model.

BibTex: