Object Detection

Off-the-Shelf Datasets

Computer vision data for locating instances of objects in images or videos.

We also have video datasets, audio datasets, image datasets or text datasets available.

CityScapes Dataset

Cityscapes is a large-scale urban street-scene dataset with stereo video and high-quality pixel-level annotations, built for benchmarking semantic segmentation, instance segmentation, and panoptic scene understanding for autonomous driving and smart-city computer vision.

Activity Detection

Audio-visual emotion recognition

These expressions are produced at two levels of emotional intensities (regular and strong) except for the neutral emotion that only contains regular intensity.

Object Detection

Visual question-answering tasks

Dataset containing open-ended questions about images. These questions require an understanding of vision, language and commonsense knowledge to answer.

Spanish (Mexico) OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.

Arabic OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.

UK-English OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.

Mandarin OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.

Japanese OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.

German OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.

Spanish (ESP) OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.

Urdu OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.

Vietnamese OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.

Object Detection

Hindi OCR Images Data - Images with Transcription

The data can be used for tasks such as character recognition in multiple scenes.