-
ICDAR2015 Incidental Scene Text
The dataset for multi-oriented scene text detection, containing 1000 training images and 500 test images. -
MSRA-TD500
The MSRA-TD500 dataset is a benchmark for scene text detection, containing 700 training images and 200 test images, with multi-lingual, arbitrary-oriented and long text lines. -
Verisimilar Image Synthesis for Detection and Recognition of Texts
The proposed scene text image synthesis technique starts with two types of inputs including “Background Images” and “Source Texts” as illustrated in column 1 and 2 in Fig. 1. -
ICDAR 2013
ICDAR 2013 consists of 229 training images and 233 testing images, and similar to ICDAR 2015, it also provides "Strong", "Weak" and "Generic" lexicons for text spotting task.... -
ICDAR 2017 MLT
ICDAR 2017 MLT is a large scale multi-lingual text dataset, which includes 7200 training images, 1800 validation images and 9000 testing images. The dataset is composed of... -
IC15 dataset
The dataset used for scene text spotting with small text instances. -
MSRA-TD500 dataset
The MSRA-TD500 dataset contains 500 natural scene images, of which 300 are for training and 200 are for testing. -
Total-Text dataset
The Total-Text dataset contains the text of various shapes, including horizontal, multi-orientational, and curved. -
Rotated ICDAR 2013 dataset
The dataset used for scene text spotting with arbitrary shapes. -
Total-Text
Total-Text is a dataset for word-level arbitrary-shaped English text detection, containing 1,255 images for training and 300 images for testing.