-
Face Transformer
Face transformer for recognition. -
Smart Reply and Ambient Clinical Intelligence
The dataset used for Smart Reply and Ambient Clinical Intelligence tasks -
SIGMORPHON 2020 Shared Task 1
The task of grapheme-to-phoneme (G2P) conversion is important for both speech recognition and synthesis. The data provided by the organizers of the shared task are extracted... -
DeFTAN-II: Efficient Multichannel Speech Enhancement with Subgroup Processing
Multichannel speech enhancement model based on transformer architecture and subgroup processing -
YOLO-Former: YOLO Shakes Hand With ViT
The proposed YOLO-Former method seamlessly integrates the ideas of transformer and YOLOv4 to create a highly accurate and efficient object detection system. -
Asformer: Transformer for Action Segmentation
Action segmentation dataset for supervised action segmentation -
Safe Self-Refinement for Transformer-based Domain Adaptation
Unsupervised Domain Adaptation (UDA) aims to leverage a label-rich source domain to solve tasks on a related unlabeled target domain.