-
PANDA (Pedantic ANswer-correctness Determination and Adjudication)
Question answering (QA) can only make progress if we know if an answer is correct, but for many of the most challenging and interesting QA examples, current answer correctness... -
Multi-Image VQA for Unsupervised Anomaly Detection
Unsupervised anomaly detection dataset for multi-image visual question answering -
Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture
Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture -
RAMM: Retrieval-augmented Biomedical Visual Question Answering
A retrieval-augmented pretrain-and-finetune paradigm for biomedical VQA which includes a high-quality image-text pairs PMCPM, a pre-trained multi-modal model, and a novel... -
Question Answering Datasets
The dataset used in the paper is a collection of adversarial examples and natural examples for question answering tasks. -
REVERIE dataset
The REVERIE dataset is a dataset of household tasks in an indoor environment. It contains images annotated with natural language instructions including the referring expressions... -
OpenOrca dataset
The dataset used for the Vectara hallucination task, containing OpenOrca questions. -
Answer Sequence Learning with Neural Networks for Answer Selection in Communi...
Answer selection in community question answering (CQA) is regarded as an answer sequence label-ling task, and a novel approach is proposed based on the recurrent architecture... -
Experimental Results
The authors evaluate the performance of their proposed conformal prediction methods for multistep feedback covariate shift (MFCS) on synthetic black-box optimization and active... -
Youtube2Text-QA
Video question answering task, which requires machines to answer questions about videos in a natural language form.