Outsource work to the Twine expert freelance network
![](https://cdn.prod.website-files.com/61a36f46798f2d52e14b678a/61a36f46798f2d6c664b689f_Vector-1.png)
support@twine.net
![](https://cdn.prod.website-files.com/61a36f46798f2d52e14b678a/61a36f46798f2deec94b689c_Vector.png)
+44-161-710-3084
The segments are of varying length, between 3 and 10 seconds long, and in each clip the only visible face in the video and audible sound in the soundtrack belong to a single speaking person. In total, the dataset contains roughly 20 hours of video segments with approximately 40 distinct speakers, spanning a wide variety of people, languages and face poses.