Abstract
Recent years have witnessed increasing interest in the social meanings of non-modal voice qualities, but most existing studies focus on English, especially in the North American context. This paper reports a perceptual study of the social meanings of creaky voice in Mandarin Chinese for mainland Chinese listeners. The study used a large set of resynthesized stimuli including multiple talkers and pairs of utterances differing only in voice quality (creaky vs. modal). Sixty Mandarin listeners completed a social perception experiment in which they collectively evaluated 38 talkers (presented in creaky or modal voice quality) on four socio-demographic dimensions (age, gender, sexuality, education) and 19 traits related to personality (e.g., confident, genuine, pretentious) and communicative style (e.g., engaging). Results of a factor analysis and mixed-effects models indicated multiple effects of creaky voice on the perception of talker age, gender, and warmth; further, these effects interacted with both talker gender and listener gender, in ways that often differed from previously documented patterns for North American English. These findings shed light on the multifaceted indexicality of creaky voice in Mandarin and contribute to mounting evidence of crosslinguistic and crosscultural variation in the social meanings of non-modal voice qualities.
Similar content being viewed by others
Data availability
Datasets and supplementary files related to the current study are available in the OSF repositories https://osf.io/srp7u and https://osf.io/cejgf. The repositories include: (1) datasets and statistical analysis reports (including R scripts and full model outputs) that allow readers to replicate the analyses reported in the paper; (2) the list of questions (with annotations) in the social perception experiment; (3) the list of sentences used to elicit natural speech recordings; and (4) a description of the resynthesis process using a Klatt synthesizer.
References
Abdelli-Beruh NB, Drugman T, Red Owl RH (2016) Occurrence frequencies of acoustic patterns of vocal fry in American English speakers. J Voice 30(6):759.e11–759.e20
Agathe B, Claire P. (2013) The influence of language and speech task upon creaky voice use among six young American women learning French. In: Bimbot F, Cerisara C, Fougeron C et al. (eds). Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013). International Speech Communication Association, Lyon, France, p 2395–2399
Anderson RC, Klofstad CA, Mayew WJ (2014) Vocal fry may undermine the success of young women in the labor market. PLoS ONE 9(5):e97506
Batliner A, Burger S, Johne B. et al. (1994) MÜSLI: a classification scheme for laryngealizations. In: Proceedings of the ESCA Workshop on Prosody, Lund, Sweden
Belotel-Grenié A, Grenié M (2004) The creaky voice phonation and the organisation of Chinese discourse. In: Proceedings of the 1st International Symposium on Tonal Aspects of Languages (TAL 2004). International Speech Communication Association, Beijing, China, p 5–8
Borrie SA, Delfino CR (2017) Conversational entrainment of vocal fry in young adult female American English speakers. J Voice 31(4):513.e25–513.e32
Brown J, Sonderegger M (2024) Creaky voice variation across language, gender and age in Canadian English–French bilingual speech. In: Cho, T., Kim, S., Holliday, J. et al (eds) Proceedings of the 19th Conference on Laboratory Phonology. Hanyang Institute for Phonetics and Cognitive Sciences of Language, Seoul, South Korea, p 309–310
Brown J, Sonderegger M (2025) A sociophonetic study of creaky voice across language, gender and age in Canadian English-French bilinguals. J Phonetics 112:101431
Brown KM, Dahl KL, Cler GJ (2021) Listener age and gender diversity: Effects on voice-based perception of gender. J Voice 35(5):739–745
Brown P, Levinson S.C. (1978) Universals in language usage: Politeness phenomena. In: Goody, E.N. (ed) Questions and Politeness: Strategies in Social Interaction. Cambridge University Press, Cambridge, UK, p 56–311
Callier P (2010) Voice quality, rhythm and valorized femininities, poster presented at Sociolinguistics Symposium 18, Southampton, UK
Cantor-Cutiva LC, Bottalico P, Webster J (2023) The effect of bilingualism on production and perception of vocal fry. J Voice 37(6):970.e1–e10
Chao M (2017) An echoic account of vocal fry. Master’s thesis, San Francisco State University, San Francisco, CA
Chao M, Bursten JRS (2021) Girl talk: understanding negative reactions to female vocal fry. Hypatia 36(1):42–59
Dallaston K, Docherty G (2020) The quantitative prevalence of creaky voice (vocal fry) in varieties of English: a systematic review of the literature. PLoS ONE 15(3):e0229960
Davidson L (2018) The effects of pitch, gender, and prosodic context on the identification of creaky voice. Phonetica 76(4):235–262
Davidson L (2020) Contributions of modal and creaky voice to the perception of habitual pitch. Language 96(1):e22–e37
Davison DS (1991) An acoustic study of so-called creaky voice in Tianjin Mandarin. UCLA Working Pap Phonetics 78:50–57
de Leeuw E, Chang CB (2024) Phonetic and phonological L1 attrition and drift in bilingual speech. In: Amengual M (ed) The Cambridge Handbook of Bilingual Phonetics and Phonology. Cambridge University Press, Cambridge, UK, p 721–745
Dilley L, Shattuck-Hufnagel S, Ostendorf M (1996) Glottalization of word-initial vowels as a function of prosodic structure. J Phonetics 24(4):423–444
Esling J (1978) The identification of features of voice quality in social groups. J Int Phonetic Assoc 8(1–2):18–23
Esposito CM, Khan SD (2020) The cross-linguistic patterns of phonation types. Lang Linguist Compass 14(12):e12392
Foulks, N. (2020) Listeners’ attitudes towards young women with glottal fry. Master’s thesis, East Tennessee State University, Johnson City, TN
Gerratt BR, Kreiman J (2001) Toward a taxonomy of nonmodal phonation. J Phonetics 29(4):365–381
Gibson TA, Summers C (2021) A perceptual study of cross-linguistic influence on vocal fry use in women exposed to two languages. Int J Biling Educ Bilingual 24(3):373–385
Gittelson B, Leemann A, Tomaschek F (2021) Using crowd-sourced speech data to study socially constrained variation in nonmodal phonation. Front Artif Intell 3:565682
Gobl C, Ní CA (2003) The role of voice quality in communicating emotion, mood and attitude. Speech Commun 40(1–2):189–212
Gobl C, Bennett E, Ní Chasaide A (2002) Expressive synthesis: How crucial is voice quality. In: Proceedings of 2002 IEEE Workshop on Speech Synthesis. Institute of Electrical and Electronics Engineers, Piscataway, NJ, p 91–94
Gordon M, Ladefoged P (2001) Phonation types: a cross-linguistic overview. J Phonetics 29(4):383–406
Greer SDF (2015) The perception of coolness: Voice quality and its social uses and interpretations. Master’s thesis, University of Calgary, Calgary, Canada
Greer SDF, Winter, SJ (2015) The perception of coolness: differences in evaluating voice quality in male and female speakers. In: The Scottish Consortium for ICPhS 2015 (ed). In Proceedings of the 18th International Congress of Phonetic Sciences. University of Glasgow, Glasgow, UK, p 0883
Grivičić T, Nilep C (2004) When phonation matters: the use and function of yeah and creaky voice. Colo Res Linguist 17(1):1–11
Hancock AB, Pool SF (2017) Influence of listener characteristics on perceptions of sex and gender. J Lang Soc Psychol 36(5):599–610
Hedelin P, Huber D (1990) Pitch period determination of aperiodic speech signals. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’90), 1. Institute of Electrical and Electronics Engineers, Piscataway, NJ, p 361–364
Henton C, Bladon A (1988) Creak as a sociophonetic marker. In: Hyman, L.M., Li, C.N. (eds) Language, Speech and Mind: Studies in Honour of Victoria A. Fromkin. Routledge, London, UK, p 3–29
Holliday J, Walker A, Jung M (2023) Bringing indexical orders to non-arbitrary meaning: the case of pitch and politeness in English and Korean. Lab Phonol 14(1):1–24
Huang Y (2023) Phonetics of period doubling. PhD thesis, University of California, San Diego, San Diego, CA
Huang Y (2024a) The effects of vocal fry and period doubling on the perceived naturalness of Mandarin tones. In: Cho, T., Kim, S., Holliday, J. et al (eds).In Proceedings of the 19th Conference on Laboratory Phonology. Hanyang Institute for Phonetics and Cognitive Sciences of Language, Seoul, South Korea, p 431–432
Huang Y (2024b) Perception and imitation of period-doubled phonation: pitch and voice quality. J Acoust Soc Am 156(2):1391–1412
Keating P, Garellek M, Kreiman J (2015) Acoustic properties of different kinds of creaky voice. In: The Scottish Consortium for ICPhS 2015 (ed) Proceedings of the 18th International Congress of Phonetic Sciences. University of Glasgow, Glasgow, UK, p 0821
Kim JY (2017) Voice quality transfer in the production of Spanish heritage speakers and English L2 learners of Spanish. In: Perpiñán, S., Heap, D., Moreno-Villamar, I. et al (eds) Romance Languages and Linguistic Theory 11: Selected Papers from the 44th Linguistic Symposium on Romance Languages (LSRL), London, Ontario. John Benjamins Publishing, Amsterdam, The Netherlands, p 191–207
Klatt DH, Klatt LC (1990) Analysis, synthesis, and perception of voice quality variations among female and male talkers. J Acoust Soc Am 87(2):820–857
Kreiman J, Gerratt BR, Khan SD (2010) Effects of native language on perception of voice quality. J Phonetics 38(4):588–593
Kuang J (2017) Covariation between voice quality and pitch: revisiting the case of Mandarin creaky voice. J Acoust Soc Am 142(3):1693–1706
Kuang J (2018) The influence of tonal categories and prosodic boundaries on the creakiness in Mandarin. J Acoust Soc Am 143(6):EL509–EL515
Kuang J, Liberman M (2015) Influence of spectral cues on the perception of pitch height. In: The Scottish Consortium for ICPhS 2015 (ed). In Proceedings of the 18th International Congress of Phonetic Sciences. University of Glasgow, Glasgow, UK, p 0435
Kuang J, Liberman M (2016) The effect of vocal fry on pitch perception. In: Proceedings of the 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers, Piscataway, NJ, p 5260–5264
Kuang J, Liberman M (2018) Integrating voice quality cues in the pitch perception of speech and non-speech utterances. Front Psychol 9:2147
Kunnemann DB (2017) Perceptions of the effects of vocal fry on aspirational careers in prospective job markets, undergraduate honors thesis, University of Arkansas, Fayetteville, AR
Kuznetsova A, Brockhoff PB, Christensen RHB (2017) lmerTest package: Tests in linear mixed effects models. J Stat Softw 82(13):1–26
Ladefoged P (1982) The linguistic use of different phonation types. UCLA Working Pap Phonetics 54:28–39
Laver J (1980) The Phonetic Description of Voice Quality. Cambridge University Press, Cambridge, UK
Laver JDM (1968) Voice quality and indexical information. Br J Disord Commun 3(1):43–54
Lee KE (2016) The perception of creaky voice: does speaker gender affect our judgments? Master’s thesis, University of Kentucky, Lexington, KY
Lee S (2015) Creaky voice as a phonational device marking parenthetical segments in talk. J Socioling 19(3):275–302
Lenth RV, Banfai B, Bolker B et al (2025) emmeans: estimated marginal means, aka least-squares means [R package], version 1.11.0-003. Available online: https://cran.r-project.org/package=emmeans
Li A, Lai W (2023) How do listeners evaluate creak: a matched-guise study in Mandarin Chinese, paper presented at the 97th Annual Meeting of the Linguistic Society of America, Denver, CO
Li A, Lai W, Kuang J (2023) Creaky voice identification in Mandarin: the effects of prosodic position, tone, pitch range and creak locality. J Acoust Soc Am 154(1):126–140
Li P, Zhang F, Tsai E (2014) Language history questionnaire (LHQ 2.0): a new dynamic web-based research tool. Bilingualism: Lang Cognition 17(3):673–680
Li Q, Mok P (2023) A perception study on voice quality and stance in Mandarin Chinese. In: Skarnitzl, R., Volín, J. (eds). In Proceedings of the 20th International Congress of Phonetic Sciences. Guarant International, Prague, Czechia, p 1786–1790
Loakes D, Gregory A (2022) Voice quality in Australian English. JASA Express Lett 2(8):085201
Melvin S, Clopper CG (2015) Gender variation in creaky voice and fundamental frequency. In: The Scottish Consortium for ICPhS 2015 (ed) Proceedings of the 18th International Congress of Phonetic Sciences. University of Glasgow, Glasgow, UK, paper number 0320
Mendoza-Denton N (2011) The semiotic hitchhiker’s guide to creaky voice: Circulation and gendered hardcore in a Chicana/o gang persona. J Linguistic Anthropol 21(2):261–280
Monsen RB, Engebretson AM (1977) Study of variations in the male and female glottal wave. J Acoustical Soc Am 62(4):981–993
Nissen S, Randle QB, Johnson JL (2020) Prosodic elements for content delivery in broadcast journalism: a quantitative study of vocal pitch. Electron N 14(2):63–77
Nodari R, Celata C, Nagy N (2019) Socio-indexical phonetic features in the heritage language context: Voiceless stop aspiration in the Calabrian community in Toronto. J Phonetics 73:91–112
Ohala JJ (1983) Cross-language use of pitch:An ethological view. Phonetica 40(1):1–18
Oliveira G, Davidson A, Holczer R (2016) A comparison of the use of glottal fry in the spontaneous speech of young and middle-aged American women. J Voice 30(6):684–687
Parker MA, Borrie SA (2017) Judgments of intelligence and likability of young adult female speakers of American English: the influence of vocal fry and the surrounding acoustic-prosodic context. J Voice 32(5):538–545
Pennock-Speck B (2005) Voice quality and the construction of female voice styles. In: Asociación Española de Estudios Anglo-Norteamericanos, de Leonardo, J.J.C.G. (eds). In Actas XXVIII Congreso Internacional AEDEAN. University of Valencia, Valencia, Spain, p 407–414
Pittam J (1987) Listeners’ evaluations of voice quality in Australian English speakers. Lang Speech 30(2):99–113
Podesva RJ (2011) Gender and the social meaning of non-modal phonation types. In: Cathcart, C., Chen, I.H., Finley, G. et al (eds). In Proceedings of the 37th Annual Meeting of the Berkeley Linguistics Society. Berkeley Linguistics Society, Berkeley, CA, p 427–448
Pratt TC (2018) Affective sociolinguistic style: an ethnography of embodied linguistic variation in an arts high school. PhD thesis, Stanford University, Stanford, CA
R Development Core Team (2023) R: A language and environment for statistical computing, version 4.4.0. http://www.r-project.org
Redi L, Shattuck-Hufnagel S (2001) Variation in the realization of glottalization in normal speakers. J Phonetics 29(4):407–429
Sakaluk JK, Short SD (2017) A methodological review of exploratory factor analysis in sexuality research: used practices, best practices, and data analysis resources. J Sex Res 54(1):1–9
Sankoff G (2004) Adolescents, young adults and the critical period: Two case studies from “Seven Up”. In: Fought C (ed). Sociolinguistic Variation: Critical Reflections. Oxford University Press, Oxford, UK, p 121–139
Sebregts K, Vriesendorp H, Quené H et al (2023) Creaky voice in L2 English and L1 Dutch. In: Skarnitzl, R., Volín, J. (eds). In Proceedings of the 20th International Congress of Phonetic Sciences. Guarant International, Prague, Czechia, p 1841–1845
Sebregts K, Vriesendorp H, Quené H et al (2024) Long-term phonetic convergence vs. speaker-specificity: Creaky voice in L2 English. In: Cho, T., Kim, S., Holliday, J. et al (eds). In Proceedings of the 19th Conference on Laboratory Phonology. Hanyang Institute for Phonetics and Cognitive Sciences of Language, Seoul, South Korea, p 387–388
Shue YL, Keating P, Vicenik C, Yu K (2015) VoiceSauce: a program for voice analysis. In: Lee, W.S., Zee, E. (eds). In Proceedings of the 17th International Congress of Phonetic Sciences. City University of Hong Kong, Hong Kong, p 1846–1849
Starr RL (2015) Sweet voice: the role of voice quality in a Japanese feminine style. Lang Soc 44(1):1–34
Stuart-Smith J (1999) Glasgow: accent and voice quality. In: Foulkes, P., Docherty, G.J. (eds) Urban Voices: accent Studies in the British Isles. Arnold, London, UK, p 201–222
Szakay A (2012) Voice quality as a marker of ethnicity in New Zealand: From acoustics to perception. J Sociolinguistics 16(3):382–397
Tian J, Zhou Y, Kuang J (2019) Cross-linguistic variation in the phonetic realization of “breathier voice”. In: Calhoun, S., Escudero, P., Tabain, M. et al (eds) Proceedings of the 19th International Congress of Phonetic Sciences. Australasian Speech Science and Technology Association Inc., Canberra, Australia, p 1450–1454
Torres PJ, Henry SG, Ramanathan V (2020) Let’s talk about pain and opioids: Low pitch and creak in medical consultations. Discourse Stud 22(2):174–204
Trudgill P (1974) The Social Differentiation of English in Norwich. Cambridge University Press, Cambridge, UK
van Bezooijen R (1995) Sociocultural aspects of pitch difference between Japanese and Dutch women. Lang Speech 38(3):253–265
White H, Penney J, Gibson A et al (2023) Convergence of creaky voice use in Australian English. In: Skarnitzl, R., Volín, J. (eds) Proceedings of the 20th International Congress of Phonetic Sciences. Guarant International, Prague, Czechia, p 1791–1795
Wilce JM (1997) Discourse, power, and the diagnosis of weakness: encountering practitioners in Bangladesh. Med Anthropol Q 11(3):352–374
Wolk L, Abdelli-Beruh NB, Slavin D (2012) Habitual use of vocal fry in young adult female speakers. J Voice 26(3):e111–e116
Xu A, Lee A (2018) Perception of vocal attractiveness by Mandarin native listeners. In: Klessa, K., Bachan, J., Wagner, A. et al. (eds) Proceedings of the 9th International Conference on Speech Prosody. International Speech Communication Association, Poznań, Poland, p 344–348
Xu Y, Lee A, Wu WL (2013) Human vocal attractiveness as signaled by body size projection. PLoS ONE 8(4):e62397
Yang RX (2011) The phonation factor in the categorical perception of Mandarin tones. In: Lee, W.S., Zee, E. (eds) Proceedings of the 17th International Congress of Phonetic Sciences. City University of Hong Kong, Hong Kong, SAR, p 2204–2207
Yau E (2020) Podcaster prosody: Creaky voice and Sarah Koenig’s journalistic persona. Lifesp Styles 6(2):2–10
Yu KM, Lam HW (2014) The role of creaky voice in Cantonese tonal perception. J Acoustical Soc Am 136(3):1320–1333
Yuasa IP (2010) Creaky voice: a new feminine voice quality for young urban-oriented upwardly mobile American women?. Am Speech 85(3):315–337
Zhu S, Chong S, Chen Y (2022) Effect of language on voice quality: an acoustic study of bilingual speakers of Mandarin Chinese and English. Folia Phoniatrica et Logopaedica 74(6):421–430
Zimman L (2017) Affective stance and voice quality in a pervasively creaky speaker: Stance objects as a tool for investigating indexical meaning, paper presented at New Ways of Analyzing Variation (NWAV) 45, Vancouver, Canada
Acknowledgements
This research is supported by grant no. 15611322 awarded to PI Yao Yao by the Research Grants Council (General Research Fund) of Hong Kong. The authors thank Shiyue Li and Ka Keung Lee for their assistance with recording and annotating speech data. We thank the editor and two anonymous reviewers for their feedback in the review process.
Author information
Authors and Affiliations
Contributions
Y.Y. and C.B.C. conceived the experiment; Y.Y. and C.B.C. acquired funding; Y.Y. and C.B.C. obtained ethics approval; Y.Y. and M.L. created the stimulus materials; Y.Y. and M.L. conducted the experiment; Y.Y. and C.B.C. curated the data; Y.Y. and M.L. analyzed the results; and Y.Y. and C.B.C. wrote the original draft of the manuscript. All authors contributed to designing the experiment and to manuscript review and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
The study was approved by the Institutional Review Board of the Hong Kong Polytechnic University on Apr 6, 2022 (ethics approval number HSEAR20220405004). The approval covered all work reported in this article. All study protocols were performed in accordance with the ethical standards of the 1964 Declaration of Helsinki and its later amendments.
Informed consent
Written informed consent was obtained from all participants, who received a nominal amount of monetary compensation for their time and effort. Consent was obtained between July and December 2023, by the researcher from the participant directly, and covered participation, data use, and data publication. All participants were fully informed of possible risks of the study as well as of the anonymity and intended use of their data; they were also debriefed on the aims of the research.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Yao, Y., Li, M. & Chang, C.B. Social perception of creaky voice in Mandarin Chinese: everyone’s gender matters. Humanit Soc Sci Commun (2026). https://doi.org/10.1057/s41599-026-07108-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1057/s41599-026-07108-z


