Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Tags: adversarial attacks Filter Results Multimodal Large Language Models Harmlessness Alignment Dataset The dataset used in the paper to evaluate the harmlessness alignment of multimodal large language models (MLLMs). The dataset consists of 750 harmful instructions paired with... Dataset JSON