Table 1 Comparison of AUSpeech with other different databases in terms of resolution, duration, tasks, etc.
From: An Audio-Ultrasound Synchronized Database of Tongue Movement for Mandarin speech
Datasets | No. of speakers | Type | Resolution | Modalities | Language | Duration | Task |
|---|---|---|---|---|---|---|---|
TAL | 82 | Normal | 64 × 842 | Ultrasound, Lip Videos, Audio | English | 13.5 hours(audio) | Sentence, Non-words |
UltraSuite | 86 | Normal/Dysarthria (Children) | 63 × 412 | Ultrasound, Audio, Text | English | 18.67 hours(audio) | Words, Sentence, Non-words |
SSR7000 | 1 | Normal | 640 × 445 | Ultrasound, Lip Videos | English | 7484 samples | Sentence |
AUspeech | 54 | Normal/Dysarthria (Adult) | 920 × 700 | Ultrasound, Audio, Text | Mandarin | 22.31 hours | Vowels, Monosyllable, Sentence, Non-words |