Conceptual Inconsistencies in Large Language Models

The dataset consists of 119 clusters, with a total of 584 questions, which include 4 different linguistic forms per query, so we have approximately 146 semantically different queries.

BibTex: