Arabic (Saudi Arabia) language conversational telephony

Dataset is fully transcribed and timestamped.
Files
10
Size
Format
Duration
2
Country
Saudi Arabia
Participants
2
Languages
Updated
January 12, 2023

Description

These conversations were recorded using the phone conversation module on Boom app. The data set includes 2 hours of time-stamped and transcribed unscripted speech data (i.e. natural conversation) between two speakers. Each speech segment is at maximum 15 seconds in length. Each conversation is at least 15 minutes long. The transcription is done in time-stamped segments of 15 seconds in length at maximum. Each segment indicates the speaker, the start and the end of the segment, and additional information on the segment.

Version Info

Version:
Last updated:
Owner:

Dataset Technical Specification

Number of files:
10
Total dataset size:
Duration:
2
Format:
Sample rate:
Resolution:

Dataset Demographics

📍 Country:
Saudi Arabia
🧍 Gender:
M/F
📅 Age:
18-52
👥 Number of participants:
2

🛡️ Consent & Compliance