A dataset of videos of talking faces with transcriptions

Video Datasets

Data were collected from 100 subjects, yielding over thousand instances of synchronized data

Download Dataset Download Sample Request Quote Request Sample

Files

1000

Size

Format

wav

Duration

Country

Worldwide

Participants

100

Languages

Updated

January 27, 2023

Description

A large-scale multimodal dataset developed to support machine learning research in contexts that utilize a combination of thermal, visual, and audio data streams; examples include human–computer interaction, biometric authentication, recognition systems, domain transfer, and speech recognition.

Dataset Technical Specification

Number of files:

1000

Total dataset size:

Duration:

Format:

wav

Sample rate:

Resolution:

Dataset Demographics

📍 Country:

Worldwide

🧍 Gender:

M/F 50-50%

📅 Age:

18-55

👥 Number of participants:

100

🛡️ Consent & Compliance

Download Dataset Download Sample Request Quote Request Sample

A dataset of videos of talking faces with transcriptions

Description

Sample Download

Licence

Version Info

Dataset Technical Specification

Dataset Demographics

🛡️ Consent & Compliance

A dataset for lipreading using sequences of video frames

A dataset of video clips with spoken and visual attributes

Lip Reading in the Wild (LRW)

A dataset of videos of talking faces with transcriptions

Fire Videos Data

European License Plate Recognition

AI Solutions

Resources

Hire Experts

A dataset of videos of talking faces with transcriptions

Description

Sample Download

Licence

Version Info

Dataset Technical Specification

Dataset Demographics

🛡️ Consent & Compliance

Related Datasets

A dataset for lipreading using sequences of video frames

A dataset of video clips with spoken and visual attributes

Lip Reading in the Wild (LRW)

A dataset of videos of talking faces with transcriptions

Fire Videos Data

European License Plate Recognition

AI Solutions

Resources

Hire Experts