Visual question-answering tasks

Dataset containing open-ended questions about images. These questions require an understanding of vision, language and commonsense knowledge to answer.
Files
50
Size
Format
jpg
Duration
Country
Worldwide
Participants
20
Languages
Updated
January 27, 2023

Description

To capture “question makes sense”, we explained to the workers (and conducted qualification tests to make sure that they understood) that any premise assumed in the question must hold true for the image they select. For instance,the question “What is the woman doing?” assumes that a woman is present and can be seen in the image. It does not make sense to ask this question on an image without awoman visible in it

Licence

💬 Contact us

Version Info

Version:
Last updated:

Dataset Technical Specification

Number of files:
50
Total dataset size:
Duration:
Format:
jpg
Sample rate:
Resolution:

Dataset Demographics

Country:
Worldwide
Gender:
Age:
Number of participants:
20

Related Datasets