-
Fashion-Mnist
A binary imbalanced classification dataset with 28 × 28 grayscale images of 10 classes corresponding to fashion products. -
CVBL video database
CVBL video database for face recognition in videos -
Opening Up Minds with Argumentative Dialogues
A dataset of 183 argumentative dialogues about 3 controversial topics: veganism, Brexit and COVID-19 vaccination. -
GlyphGAN: Style-Consistent Font Generation Based on Generative Adversarial Ne...
Font generation experiment using GlyphGAN, including legibility, diversity, and style consistency evaluation. -
Devil in the Number: Towards Robust Multi-modality Data Filter
The dataset used in the paper is a web-scale dataset for training a vision-language model. The dataset contains text-image pairs, and the authors propose a novel filter to... -
McMaster18
The dataset used in the paper for image deblurring tasks. -
LLaMA-AdapterV2
LLaMA-AdapterV2: A parameter-efficient visual instruction model for text-image generation. -
M2Chat: Empowering VLM for Multimodal LLM Interleaved
M2Chat is a novel unified multimodal LLM framework for generating interleaved text-image conversation across various scenarios. -
Hand-drawn Symbol Recognition of Surgical Flowsheet Graphs with Deep Image Se...
The dataset used in this paper for hand-drawn symbol recognition of surgical flowsheet graphs with deep image segmentation.