Scene Text Recognition - Groups

Syn-1m

The dataset used for scene text recognition, containing regular and irregular scene text images.
- Dataset
- JSON
Syn-100k

The dataset used for scene text recognition, containing regular and irregular scene text images.
- Dataset
- JSON
Syn-10k

The dataset used for scene text recognition, containing regular and irregular scene text images.
- Dataset
- JSON
Real-50k

The dataset used for scene text recognition, containing regular and irregular scene text images.
- Dataset
- JSON
CUTE 80

Scene text recognition dataset
- Dataset
- JSON
SCATTER

Scene text recognition dataset
- Dataset
- JSON
MSRA-TD500

The MSRA-TD500 dataset is a benchmark for scene text detection, containing 700 training images and 200 test images, with multi-lingual, arbitrary-oriented and long text lines.
- Dataset
- JSON
Verisimilar Image Synthesis for Detection and Recognition of Texts

The proposed scene text image synthesis technique starts with two types of inputs including “Background Images” and “Source Texts” as illustrated in column 1 and 2 in Fig. 1.
- Dataset
- JSON
ICDAR2015 Robust Reading Competition

The ICDAR2015 dataset is a benchmark for scene text recognition.
- Dataset
- JSON
ICDAR2013 Robust Reading Competition

The ICDAR2013 dataset is a benchmark for scene text recognition.
- Dataset
- JSON
TotalText: A Comprehensive Dataset for Scene Text Detection and Recognition

The TotalText dataset is a comprehensive dataset for scene text detection and recognition.
- Dataset
- JSON
Multi-Lingual

The Multi-Lingual dataset is a benchmark for scene text recognition, containing 9,000 images with 9 languages.
- Dataset
- JSON
ICDAR 2013

ICDAR 2013 consists of 229 training images and 233 testing images, and similar to ICDAR 2015, it also provides "Strong", "Weak" and "Generic" lexicons for text spotting task....
- Dataset
- JSON
ICDAR 2017 MLT

ICDAR 2017 MLT is a large scale multi-lingual text dataset, which includes 7200 training images, 1800 validation images and 9000 testing images. The dataset is composed of...
- Dataset
- JSON
CUTE

CUTE is released by Risnumawan et al. There are only 288 word images in this dataset, but most of them are seriously curved.
- Dataset
- JSON
SVT

SVT is a very challenging dataset collected by Wang et al. from the Google Street View.
- Dataset
- JSON
MJSynth

The OCR and MT datasets are used to train the OCR and MT models respectively.
- Dataset
- JSON
ICDAR2015

ICDAR2015 dataset consists of 1,670 images (17,548 annotated text regions) acquired using the Google Glass.
- Dataset
- JSON
ICDAR2013

ICDAR2013 dataset is obtained from the Robust Reading Challenges 2013.
- Dataset
- JSON
Total-Text

Total-Text is a dataset for word-level arbitrary-shaped English text detection, containing 1,255 images for training and 300 images for testing.
- Dataset
- JSON

32 datasets found