Abstract
Cross-linguistic studies on tone and phonation have revealed the role of laryngeal phonatory settings in tonal contrasts. However, systematic research on how individuals within the same speech community achieve multidimensional tonal distinction remains lacking. Hmu, an Eastern Hmongic language, is typologically notable for its five level tones, offering ideal material for examining the interaction between pitch and phonation. Based on acoustic and EGG data collected from 30 speakers, this study investigates the differentiation strategies of the five level tones at both the group and individual levels. The results reveal that T11 differs significantly from the other four level tones across acoustic and EGG parameters, characterized by a larger spectral tilt and a higher noise level, aligning with the properties of breathy voice. In contrast, the other four level tones generally exhibit a smaller spectral tilt and lower noise, consistent with the characteristics of modal voice. Among them, T55, due to its high F0, may be further identified as a high-pitched voice. Individually, native speakers show variation in how they utilize phonations to encode linguistic contrast in T11, with three primary subtypes observed: breathy, harsh, and near-modal voice. Significantly, the non-modal phonation associated with T11 does not extend across the entire vowel but is primarily concentrated in the first third. We also found that an individual’s pitch range may be one factor influencing the number of acoustic cues they use when distinguishing tones. Speakers with a narrower pitch range usually employ non-modal phonations. This study provides empirical evidence that tonal contrast is multidimensional and offers a referential analysis method for future investigations into individual variation in phonation type.
Similar content being viewed by others
Data availability
The data pertinent to this study can be found in the Supplementary Information section. For detailed raw data, interested teams may contact the corresponding author with reasonable requests.
References
Andruski JE, Ratliff M (2000) Phonation types in production of phonological tone: the case of Green Mong. J Int Phon Assoc 30:37–61. https://doi.org/10.1017/S0025100300006654
Baken RJ, Orlikoff RF (2000) Clinical measurement of speech and voice. Singular Publishing Group, San Diego
Becker-Kristal R (2010) Acoustic typology of vowel inventories and dispersion theory: Insights from a large cross-linguistic corpus. University of California Press, Los Angeles
Bishop J, Keating P (2012) Perception of pitch location within a speaker’s range: Fundamental frequency, voice quality and speaker sex. J Acoust Soc Am 132:1100–1112. https://doi.org/10.1121/1.4714351
Blankenship B (1997) The time course of breathiness and laryngealization in vowels. Dissertation, University of California
Blankenship B (2002) The timing of nonmodal phonation in vowels. J Phon 30:163–191. https://doi.org/10.1006/jpho.2001.0155
Brunelle M, Kirby J (2016) Tone and phonation in Southeast Asian languages. Lang Linguist Compass 10:191–207. https://doi.org/10.1111/lnc3.12182
Chai Y, Garellek M (2022) On H1−H2 as an acoustic measure of linguistic phonation type. J Acoust Soc Am 152:1856–1870. https://doi.org/10.1121/10.0014175
Chao Y (1948) Mandarin primer: An intensive course in spoken Chinese. Harvard University Press, Cambridge
Chen C, Havenhill J (2025) Harsh voice and its interaction with vowel quality in Fuzhou Min Chinese. J Acoust Soc Am 157:2582–2602. https://doi.org/10.1121/10.0036256
Krom G (1993) A cepstrum-based technique for determining harmonics-to-noise ratio in speech signals. J Speech Lang Hear Res 36:254–266. https://doi.org/10.1044/jshr.3602.254
Denning K (1989) The diachronic development of phonological voice quality, with special reference to Dinka and the other Nilotic languages. Dissertation, Stanford University
DiCanio CT (2009) The phonetics of register in Takhian Thong Chong. J Int Phon Assoc 39:162–188. https://doi.org/10.1017/S0025100309003879
Edmondson JA, Esling JH (2006) The valves of the throat and their functioning in tone, vocal register and stress: laryngoscopic case studies. Phonology 23:157–191. https://doi.org/10.1017/S095267570600087X
Esling JH, Harris JG (2005) States of the glottis: An articulatory phonetic model based on laryngoscopic observations. In: Hardcastle WJ, Beck JM (eds) A figure of speech: A festschrift for John Laver. Erlbaum, Mahwah, p 347–383
Esling JH, Moisik SR, Benner A, Crevier-Buchman L (2019) Voice quality: The laryngeal articulator model. Cambridge University Press, Cambridge
Esposito CM (2012) An acoustic and electroglottographic study of White Hmong tone and phonation. J Phon 40:466–476. https://doi.org/10.1016/j.wocn.2012.02.007
Esposito CM, Khan SD (2020) The cross-linguistic patterns of phonation types. Lang Linguist Compass 14: e12392. https://doi.org/10.1111/lnc3.12392
Fabre P (1957) Un procede electrique d inscription de I accolement glottique aucours de laphonation: glottographie de haute frequence. Bulletin de l’Académie nationale de médecine 141:66–69
Flemming E (1995) Audio representations in phonology. Dissertation, University of California
Fourcin AJ, Abberton E (1971) First applications of a new laryngograph. Medical and Biological Illustration 21:172–182
Gao X, Kuang J (2022) Phonation variation as a function of checked syllables and prosodic boundaries. Language 7(3):171. https://doi.org/10.3390/languages7030171
Garellek M (2019) The phonetics of voice. In: Katz WF, Assmann PF (eds) Routledge handbook of phonetics. Routledge, Oxford, p 75–106
Garellek M (2020) Acoustic discriminability of the complex phonation system in !Xóõ. Phonetica 77:131–160. https://doi.org/10.1159/000494301
Garellek M (2022) Theoretical achievements of phonetics in the 21st century: phonetics of voice quality. J Phon 94: 101155. https://doi.org/10.1016/j.wocn.2022.101155
Garellek M, Chai Y, Huang Y, Van Doren M (2021) Voicing of glottal consonants and non-modal vowels. J Int Phon Assoc 53:305–332. https://doi.org/10.1017/S0025100321000116
Garellek M, Keating P (2011) The acoustic consequences of phonation and tone interactions in Jalapa Mazatec. J Int Phon Assoc 41:185–205. https://doi.org/10.1017/S0025100311000193
Garellek M, Keating P, Esposito CM, Kreiman J (2013) Voice quality and tone identification in White Hmong. J Acoust Soc Am 133:1078–1089. https://doi.org/10.1121/1.4773259
Garellek M, Ritchart A, Kuang J (2016) Breathy voice during nasality: A cross-linguistic study. J Phon 59:110–121. https://doi.org/10.1016/j.wocn.2016.09.001
Gerfen C, Baker K (2005) The production and perception of laryngealized vowels in Coatzospan Mixtec. J Phon 33:311–334. https://doi.org/10.1016/j.wocn.2004.11.002
Gerratt BR, Kreiman J (2001) Toward a taxonomy of nonmodal phonation. J Phon 29:365–381. https://doi.org/10.1006/jpho.2001.0149
Gordon M, Ladefoged P (2001) Phonation types: a cross-linguistic overview. J Phon 29:383–406. https://doi.org/10.1006/jpho.2001.0147
Hillenbrand J, Cleveland RA, Erickson RL (1994) Acoustic correlates of breathy vocal quality. J Speech Lang Hear Res 37:769–778. https://doi.org/10.1044/jshr.3704.769
Holmberg EB, Hillman RE, Perkell JS, Guiod P, Goldman SL (1995) Comparisons among aerodynamic, electroglottographic, and acoustic spectral measures of female voice. J Speech Lang Hear Res 38:1212–1223. https://doi.org/10.1044/jshr.3806.1212
Howard DM (1995) Variation of electrolaryngographically derived closed quotient for trained and untrained adult female singers. J Voice 9:163–172. https://doi.org/10.1016/S0892-1997(05)80250-4
Huffman MK (1987) Measures of phonation types in Hmong. J Acoust Soc Am 81:495–504. https://doi.org/10.1121/1.394915
Iseli M, Shue Y, Alwan A (2007) Age, sex, and vowel dependencies of acoustical measures related to the voice source. J Acoust Soc Am 121:2283–2295. https://doi.org/10.1121/1.2697522
Jessen M, Roux JC (2002) Voice quality differences associated with stops and clicks in Xhosa. J Phon 30:1–52. https://doi.org/10.1006/jpho.2001.0150
Keating P, Esposito C, Garellek M, Khan SD, Kuang J (2011) Phonation contrasts across languages. Paper presented at the Proceedings of the 17th International Congress of Phonetic Sciences, ICPhS, Hong Kong, 1046–1049
Keating P, Esposito CM, Garellek M, Khan SD, Kuang J (2010) Phonation contrasts across languages. UCLA Working Papers in Phonetics 108:188–202
Keating P, Garellek M, Kreiman J (2015) Acoustic properties of different kinds of creaky voice. Paper presented at the Proceedings of the 18th International Congress of Phonetic Sciences, ICPhS, Glasgow, (2–7)
Keating P, Kuang J, Garellek M, Esposito CM, Khan SD (2023) A cross-language acoustic space for vocalic phonation distinctions. Language 99:351–389. https://doi.org/10.1353/lan.2023.a900090
Klatt DH, Klatt LC (1990) Analysis, synthesis, and perception of voice quality variations among female and male talkers. J Acoust Soc Am 87:820–857. https://doi.org/10.1121/1.398894
Kong J (1992) Acoustic and perceptual studies on the five-level tones of Ziyun Miao. In XMa JWang (Eds.), A new preliminary study of ethnic languages, Sichuan Ethnic Publishing House, Chengdu, 152–163
Kong J (2001) On language phonation. Central University for Nationalities Press, Beijing
Kreiman J, Gerratt BR, Antoñanzas-Barroso N (2007) Measures of the glottal source spectrum. J Speech Lang Hear Res 50:595–610. https://doi.org/10.1044/1092-4388(2007/042)
Kreiman J, Gerratt BR, Garellek M, Samlan R, Zhang Z (2014) Toward a unified theory of voice production and perception. Loquens 1:e009. https://doi.org/10.3989/loquens.2014.009
Kuang J (2013) The tonal space of contrastive five level tones. Phonetica 70:1–23. https://doi.org/10.1159/000353853
Kuang J, Cui A (2018) Relative cue weighting in production and perception of an ongoing sound change in Southern Yi. J Phon 71:194–214. https://doi.org/10.1016/j.wocn.2018.09.002
Kuang J, Keating P (2014) Vocal fold vibratory patterns in tense versus lax phonation contrasts. J Acoust Soc Am 136:2784–2797. https://doi.org/10.1121/1.4896462
Kuznetsova A, Brockhoff PB, Christensen RHB (2017) lmerTest Package: Tests in Linear Mixed Effects Models. J Stat Softw 82:1–26. https://doi.org/10.18637/jss.v082.i13
Ladefoged P (1971) Preliminaries to linguistic phonetics. University of Chicago, Chicago
Ladefoged P (1983) The linguistic use of different phonation types. In DBless JAbbs (Eds.), Vocal fold physiology: Contemporary research and clinical issues. College-Hill Press, San Diego CA
Ladefoged P, Maddieson I (1996) The Sounds of the World’s Languages. Blackwell, Oxford
Laver J (1980) The phonetic description of voice quality. Cambridge University Press, Cambridge
Lee Y, Keating P, Kreiman J (2019) Acoustic voice variation within and between speakers. J Acoust Soc Am 146:1568–1579. https://doi.org/10.1121/1.5125134
Liljencrants J, Lindblom B (1972) Numerical simulation of vowel quality systems: The role of perceptual contrast. Language 48:839–862. https://doi.org/10.2307/411991
Lindblom B (1986) Phonetic universals in vowel systems. In JJOhala JJJaeger (Eds.), Experimental phonology, Academic Press, 13–44
Lindblom B (1990) Phonetic content in phonology. Phonetic Experimental Research at the Institute of Linguistics University of Stockholm 11:100–118
Lindblom B, Maddieson I (1988) Phonetic universals in consonant systems. In LHyman CNLi (Eds.), Language, speech, and mind: Studies in honour of Victoria AFromkin, Routledge, 62–78
Liu W (2020) A perceptual study on the five level tones in Hmu (Xinzhai variety). Paper presented at the Proceedings of Interspeech 2020, Shanghai, China, 1620–1623
Liu W (2021) Physiological and physical basis of voice quality and its linguistic value. Essays on linguistics 63:204–233
Liu W, Lin Y-J, Yang Z, Kong J (2020) Hmu (Xinzhai variety). J Int Phon Assoc 50:240–257. https://doi.org/10.1017/S0025100318000336
Liu W, Peng G, Kong J (2024) The role of breathy voice in Hmu tone perception. J Chin Linguist 52:138–174. https://doi.org/10.1353/jcl.2024.a919401
Liu W, Wang F, Kong J (2019) An acoustic study on the phonation variations of tones in Bai (Beiwuliqiao variety). Contemporary Linguistics 1:119–138
Maddieson I (1978) Universals of tone. In JHGreenberg CAFerguson, EAMoravcsik (Eds.), Universals of human language: Vol. 2. Phonology, Stanford University Press, Stanford CA, 335–365
Maddieson I, Ladefoged P (1985) Tense and lax in four minority languages of China. J Phon 13:433–454. https://doi.org/10.1016/S0095-4470(19)30788-0
Miller AL (2007) Guttural vowels and guttural co-articulation in Ju|’hoansi. J Phon 35:56–84. https://doi.org/10.1016/j.wocn.2005.11.001
Moisik SR (2013) Harsh voice quality and its association with blackness in popular american media. Phonetica 69:193–215. https://doi.org/10.1159/000351059
Moisik SR, Czaykowska-Higgins E, Esling JH (2021) Phonological potentials and the lower vocal tract. J Int Phon Assoc 51:1–35. https://doi.org/10.1017/S0025100318000403
Moisik SR, Esling JH (2014) Modeling the biomechanical influence of epilaryngeal stricture on the vocal folds: A low-dimensional model of vocal–ventricular fold coupling. J Speech Lang Hear Res 57:S687–S704. https://doi.org/10.1044/2014_JSLHR-S-12-0279
Moisik SR, Esling JH, (2011) The ‘whole larynx’ approach to laryngeal features. In Proceedings of the 17th International Congress of Phonetic Sciences, ICPhS, Hong Kong, 1406–1409
Moisik SR, Lin H, Esling JH (2014) A study of laryngeal gestures in Mandarin citation tones using simultaneous laryngoscopy and laryngeal ultrasound (SLLUS). J Int Phon Assoc 44:21–58. https://doi.org/10.1017/S0025100313000327
Rothenberg M, Mahshie JJ (1988) Monitoring vocal fold abduction through vocal fold contact area. J Speech Lang Hear Res 31:338–351. https://doi.org/10.1044/jshr.3103.338
Schwartz J-L, Boë L-J, Vallée N, Abry C (1997a) Major trends in vowel system inventories. J Phon 25:233–253. https://doi.org/10.1006/jpho.1997.0044
Schwartz J-L, Boë L-J, Vallée N, Abry C (1997b) The dispersion-focalization theory of vowel systems. J Phon 25:255–286. https://doi.org/10.1006/jpho.1997.0043
Shue Y-L Keating P, Vicenik C, Yu KM (2011) VoiceSauce: A program for voice analysis. In Proceedings of the 17th International Congress of Phonetic Sciences, ICPhS, Hong Kong, 1846-1849
Silverman D (1997) Phasing and recoverability. Garland Publishing, New York
Silverman D, Blankenship B, Kirk PL, Ladefoged P (1995) Phonetic structures in Jalapa Mazatec. Anthropol Linguist 37:70–88. https://www.jstor.org/stable/30028043
Simpson AP (2012) The first and second harmonics should not be used to measure breathiness in male and female voices. J Phon 40:477–490. https://doi.org/10.1016/j.wocn.2012.02.001
Stevens KN (1977) Physics of laryngeal behavior and larynx modes. Phonetica 34:264–279. https://doi.org/10.1159/000259885
Tạ, TT, Brunelle M, Nguyễn TQ (2022) Voicing and register in Ngãi Giao Chrau: Production and perception studies. J Phon 90: 101115. https://doi.org/10.1016/j.wocn.2021.101115
Tabachnick B, Fidell L (2013) Using multivariate statistics. Pearson Education Inc, Boston MA
Tehrani H (2009) EGGWorks: a program for automated analysis of EGG signals. http://www.linguistics.ucla.edu/faciliti/facilities/physiology/Egg.WorksSetup.exeS
Tian J, Kuang J (2021) The phonetic properties of the non-modal phonation in Shanghainese. J Int Phon Assoc 51:202–228. https://doi.org/10.1017/S0025100319000148
Traill A (1985) Phonetic and phonological studies of the !Xóõ Bushmen. Hamburg, Buske
Traill A (1986) The laryngeal sphincter as a phonatory mechanism in !Xóõ Bushman. In RSinger JKLundy (Eds.), Variation, culture and evolution in African populations: Papers in honor of Dr. Hertha de Villiers, Witwatersrand University Press, Johannesburg, 123–131
Wayland R, Jongman A (2003) Acoustic correlates of breathy and clear vowels: The case of Khmer. J Phon 31:181–201. https://doi.org/10.1016/S0095-4470(02)00086-4
Weinreich U, Labov W, Herzog M (1968) Empirical foundations for a theory of language change. University of Texas Press, Austin
Zhu X (2012) Multiregisters and four levels: A new tonal model. J Chin Linguist 40:1–17. https://www.jstor.org/stable/23754196
Frokjær-Jensen B, Thorvaldsen P (1968) Construction of a Fabre Glottograph. ARIPUC 3:1–8. https://doi.org/10.7146/aripuc.v3i.130687
Li X, Wang F (2016) A preliminary study of variations of phonation features in Meiba Bai. Essays on Linguistics 54:179–196
Acknowledgements
This work is supported by National Social Science Foundation (No. 22CYY022) and Jiangsu Oral Culture Corpus Transcription Project (No. HXSK 2023003). We would like to give our thanks to all participants for their excellent help in Xinzhai village with the recording, especially for Zhenghui Yang for his assistance.
Author information
Authors and Affiliations
Contributions
Wen Liu: Conceptualization, Methodology, Investigation, Data curation, Writing – original draft, Writing – review and editing, Wording, Funding acquisition. Nianhan Hou: Writing – original draft, Writing – review and editing, Formal analysis, Visualization. Hao Tang: Conceptualization, Writing – review and editing, Funding acquisition.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
This study was approved by the Ethics Committee of Shandong University (Approval No. SDU-2021-307) on December 10, 2021. All procedures involving human participants were conducted in accordance with the ethical standards of the Declaration of Helsinki.
Informed consent
This study obtained written informed consent from all participants on August 17, 2022, prior to their enrollment. All participants were clearly informed of the study’s purpose, procedures, data usage methods, and participants’ rights (including the principle of voluntary participation and the right to withdraw unconditionally at any time). Consent forms were documented in writing and collected directly from adult participants with independent legal capacity to consent. No personally identifiable information was recorded or disclosed in any data. This study did not involve any vulnerable populations, and all participants provided their consent voluntarily.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Liu, W., Hou, N. & Tang, H. Individual differences in phonation types and their interaction with pitch range: Evidence from the five level tones in Hmu. Humanit Soc Sci Commun (2026). https://doi.org/10.1057/s41599-026-07071-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1057/s41599-026-07071-9


