-
Zero-Shot Fine-Grained Classification by Deep Feature Learning with Semantics
Fine-grained image classification, which aims to distinguish images with subtle distinctions, is a challenging task due to two main issues: lack of sufficient training data for... -
SUN Attribute
The dataset used in the paper is SUN Attribute, which consists of 717 classes of images with annotations. -
Zero-Shot Automatic Pronunciation Assessment
Automatic Pronunciation Assessment (APA) is vital for computer-assisted language learning. Prior methods rely on annotated speech-text data to train Automatic Speech Recognition... -
Pneumonia, Tuberculosis, Retinopathy, and Brain Tumor Datasets
The dataset used in the paper for zero-shot medical image classification, pneumonia, tuberculosis, retinopathy, and brain tumor. -
Zero-Shot Temporal Action Detection via Vision-Language Prompting
Zero-Shot Temporal Action Detection via Vision-Language Prompting (STALE) model for the under-studied yet practically useful zero-shot temporal action detection (ZS-TAD) -
Class Representative Learning Model
The CRL model is based on class-level classifiers, built class-by-class, that would be a representative of instances of a specific class by utilizing activation features of... -
Vision-by-Language for Training-Free Compositional Image Retrieval
Compositional Image Retrieval through Vision-by-Language (CIReVL) is a training-free approach for Zero-Shot Compositional Image Retrieval (CIR). Utilizing off-the-shelf... -
ODIN: On-demand Data Formulation to Mitigate Dataset Lock-in
ODIN is an innovative approach that addresses the problem of dataset constraints by integrating generative AI models. -
Imagenette Dataset
The Imagenette dataset is a zero-shot image classification dataset, containing 13,394 images from ten easily separable classes in ImageNet.