Patent dataset

The dataset used in the paper is a text and image dataset, with 5477 instances, 3201 attributes, and 248 categories. The dataset is used for clustering and evaluation of the proposed method.

BibTex: