Multimodal corpora are data collections used to study how two or more modalities interface with one another in human communication.
IFA Dialog Video corpus[edit | edit source]
This corpus contains annotated video recordings of friendly face-to-face dialogues. It is modelled on the face-to-face dialogues in the Spoken Dutch Corpus (CGN). The procedures and design of the corpus were adapted to make this corpus useful for other researchers of Dutch speech. For this corpus 20 dialogue conversations of 15 minutes were recorded and annotated, in total 5 hours of speech. To stay close to the face-to-face dialogues in the CGN, pairs of well-acquainted participants were selected, either good friends, relatives, or long-time colleagues. The participants were allowed to talk about any topic they wanted.