No Organization - Organizations

LLaMA-AdapterV2

LLaMA-AdapterV2: A parameter-efficient visual instruction model for text-image generation.
- Dataset
- JSON
M2Chat: Empowering VLM for Multimodal LLM Interleaved

M2Chat is a novel unified multimodal LLM framework for generating interleaved text-image conversation across various scenarios.
- Dataset
- JSON
Hand-drawn Symbol Recognition of Surgical Flowsheet Graphs with Deep Image Se...

The dataset used in this paper for hand-drawn symbol recognition of surgical flowsheet graphs with deep image segmentation.
- Dataset
- JSON
SFEW

The SFEW dataset is a subset of EmotiW2015 in-the-wild emotion dataset.
- Dataset
- JSON
EmotioNet

Label distribution learning on auxiliary label space graphs for facial expression recognition.
- Dataset
- JSON
AffectNet

The dataset consists of a significant collection of 60,000 facial expression images, categorized into eight different classes, including neutral, happy, angry, sad, fear,...
- Dataset
- JSON
RAF-DB

Facial Expression Recognition (FER) is a classiﬁcation task that points to face variants. Hence, there are certain afﬁnity features between facial expressions, receiving lit-
- Dataset
- JSON
iLIDS-VID

The video-based person re-identiﬁcation (ReID) aims to identify the given pedestrian video sequence across multiple non-overlapping cameras.
- Dataset
- JSON
PRID-2011

The video-based person re-identiﬁcation (ReID) aims to identify the given pedestrian video sequence across multiple non-overlapping cameras.
- Dataset
- JSON
MARS

The video-based person re-identiﬁcation (ReID) aims to identify the given pedestrian video sequence across multiple non-overlapping cameras.
- Dataset
- JSON
Multi-scale 3D Convolution Network for Video Based Person Re-Identiﬁcation

Video based person ReID using a two-stream convolution network to explicitly leverage spatial and temporal cues.
- Dataset
- JSON
Contrails Detection Dataset

The dataset is used for aircraft contrail detection in global satellite images.
- Dataset
- JSON
K400

The dataset used in this paper is K400, a dataset for human action recognition.
- Dataset
- JSON
SSv2

The dataset used in this paper is SSv2, a dataset for human action recognition.
- Dataset
- JSON
PID Dataset

A comprehensive dataset for training deep learning algorithms for classifying different types of pavement distress.
- Dataset
- JSON
XCAT phantom

The dataset used for training the prior score model for diffusion posterior sampling in CT image reconstruction.
- Dataset
- JSON
CTW1500

The dataset used for testing the proposed unsupervised pre-training method for query-based end-to-end instance segmentation (QEIS) models.
- Dataset
- JSON
Cervical Cancer Segmentation on Multiparametric MRI

A dataset of multiparametric MRI images of cervical cancer patients for segmentation and analysis.
- Dataset
- JSON
ImageNet VID

Video object detection dataset.
- Dataset
- JSON
OpenI

A publicly available radiology dataset of chest x-rays and reports that is a subset of the OpenI open source literature and biomedical image collections.
- Dataset
- JSON

24,175 datasets found