-
Multi-Lingual
The Multi-Lingual dataset is a benchmark for scene text recognition, containing 9,000 images with 9 languages. -
Total-Text
Total-Text is a dataset for word-level arbitrary-shaped English text detection, containing 1,255 images for training and 300 images for testing.