Table 1 Comparison of existing speech MRI datasets involving moving vocal tract.
Dataset | The number of speakers | Language studied | Type of imaging data provided | The availability of raw data |
|---|---|---|---|---|
Narayanan, S. et al.16 | 10 (5 f, 5 m) | American English | RT-MRI with synchronized audio Electromagnetic articulography | No |
Kim, J. et al.17 | 10 (5 f, 5 m) | American English | RT-MRI with synchronized audio | No |
Töger, J. et al.18 | 8 (4 f, 4 m) | American English | RT-MRI with synchronized audio Static T2w MRI | No |
Sorensen, T. et al.19 | 17 (9 f, 8 m) | American English | RT-MRI with synchronized audio 3D volumetric MRI | No |
Douros, I. et al.20 | 2 (2 m) | French | RT-MRI with synchronized audio 3D volumetric MRI | No |
This dataset52 | 75 (40 f, 35 m) | American English | RT-MRI with synchronized audio 3D volumetric MRI Static T2w MRI | Yes |