Lip Reading in the Wild (LRW)

The package including the videos and the metadata is available for non-commercial, academic research.
Files
1000
Size
Format
wav
Duration
Country
Worldwide
Participants
100
Languages
Updated
January 27, 2023

Description

The dataset consists of up to 100 utterances of 500 different words, spoken by hundreds of different speakers. All videos are 29 frames (1.16 seconds) in length, and the word occurs in the middle of the video. The word duration is given in the metadata, from which you can determine the start and end frames.

Version Info

Version:
Last updated:
Owner:

Dataset Technical Specification

Number of files:
1000
Total dataset size:
Duration:
Format:
wav
Sample rate:
Resolution:

Dataset Demographics

📍 Country:
Worldwide
🧍 Gender:
M/F 50-50%
📅 Age:
18-55
👥 Number of participants:
100

🛡️ Consent & Compliance