45 datasets found

Tags: Multimodal Learning

Filter Results
  • LLaVA-1.5

    The dataset used in this paper is a multimodal large language model (LLaMA) dataset, specifically LLaVA-1.5, which consists of 7 billion parameters and is used for multimodal...
  • Few-Shot Class-Incremental Learning

    Few-Shot Class-Incremental Learning (FSCIL) is a special case of Class-Incremental Learning (CIL), where only a few training examples are available at every learning session.
  • Multimodal Parameter-Efficient Few-Shot Class Incremental Learning

    Few-Shot Class Incremental Learning (FSCIL) is a challenging continual learning task, where limited training examples are available during several learning sessions.
  • MNIST-SVHN-Text dataset

    The MNIST-SVHN-Text dataset is a multi-modal dataset consisting of images, text, and labels.
  • MSCOCO

    Human Pose Estimation (HPE) aims to estimate the position of each joint point of the human body in a given image. HPE tasks support a wide range of downstream tasks such as...
You can also access this registry using the API (see API Docs).