-
Document-Level OCR Dataset
The dataset used for testing the Vary-base model, containing document-level OCR test set. -
Document and Chart Dataset
The dataset used for training the new vision vocabulary network, containing high-resolution document and chart images with corresponding text.