OCR Images

Off-the-Shelf Datasets

CityScapes Dataset
Cityscapes is a large-scale urban street-scene dataset with stereo video and high-quality pixel-level annotations, built for benchmarking semantic segmentation, instance segmentation, and panoptic scene understanding for autonomous driving and smart-city computer vision.
Visual question-answering tasks
Dataset containing open-ended questions about images. These questions require an understanding of vision, language and commonsense knowledge to answer.
Spanish (Mexico) OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.
Arabic OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.
UK-English OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.
Mandarin OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.
Japanese OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.
German OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.
Spanish (ESP) OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.
Urdu OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.
Vietnamese OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.
Hindi OCR Images Data - Images with Transcription
The data can be used for tasks such as character recognition in multiple scenes.