9 datasets found

Groups: Scene Text Recognition Formats: JSON

Filter Results
  • MSRA-TD500

    The MSRA-TD500 dataset is a benchmark for scene text detection, containing 700 training images and 200 test images, with multi-lingual, arbitrary-oriented and long text lines.
  • Verisimilar Image Synthesis for Detection and Recognition of Texts

    The proposed scene text image synthesis technique starts with two types of inputs including “Background Images” and “Source Texts” as illustrated in column 1 and 2 in Fig. 1.
  • TotalText: A Comprehensive Dataset for Scene Text Detection and Recognition

    The TotalText dataset is a comprehensive dataset for scene text detection and recognition.
  • ICDAR 2013

    ICDAR 2013 consists of 229 training images and 233 testing images, and similar to ICDAR 2015, it also provides "Strong", "Weak" and "Generic" lexicons for text spotting task....
  • ICDAR 2017 MLT

    ICDAR 2017 MLT is a large scale multi-lingual text dataset, which includes 7200 training images, 1800 validation images and 9000 testing images. The dataset is composed of...
  • ICDAR2015

    ICDAR2015 dataset consists of 1,670 images (17,548 annotated text regions) acquired using the Google Glass.
  • ICDAR2013

    ICDAR2013 dataset is obtained from the Robust Reading Challenges 2013.
  • Total-Text

    Total-Text is a dataset for word-level arbitrary-shaped English text detection, containing 1,255 images for training and 300 images for testing.
  • SynthText

    SynthText dataset is proposed by Gupta et al. for scene text detection. The original dataset is composed of 800,000 scene text images, each with multiple word instances.