Human Annotated Paragraph Dataset

The dataset used for paragraph recognition in document images by spatial graph convolutional networks (GCN) applied on OCR text boxes.

BibTex: