Abstract
With few exceptions, modern psychometric concepts and methods are not part of the “standard” toolkit used to analyze linguistic data. Data from Kam (or Dong), an under-studied language of China with a very complex tone system, is used to illustrate how a psychometric approach can help investigate inter-individual differences in the perception of tone contrasts in an AX task, and the factors that may influence them, including working memory, age, gender, and years of formal education. We show that these methods uncover an unexpected structure of the responses, as some items designed as “different” based on the language’s phonology were instead perceived as “same”, prompting a recoding of these items in line with their perception. The subsequent use of % correct responses, Signal Detection Theory, multiple regression, mediation, and path analysis, found a complex network of influences on the AX task: there are direct effects of age, education and working memory, but only an indirect effect of gender. Therefore, even such a “simple” AX task can benefit from this approach, and we argue, in this primarily methodological paper which also includes directly usable extensive computer code in R, that modern psychometrics should be more widely used in linguistics.
Similar content being viewed by others
Data availability
Due to ethical and legal restrictions, the primary data is available upon request by contacting the corresponding author, M.W. All the R code is available in the GitHub repository https://github.com/ddediu/tone_ax_dong, but cannot be run without the primary data; nevertheless, the compiled report (in HTML format) containing all the analyses and plots is provided in this repository.
References
Yip, M. Tone. Cambridge Textbooks in Linguistics (Cambridge University Press, 2002).
Collins, J. Real and spurious correlations involving tonal languages. In Enfield, N. (ed.) Dependencies in language: On the causal ontology of linguistics systems, no. 99 in Studies in Diversity Linguistics, 127–138, https://doi.org/10.5281/zenodo.573776 (Language Science Press, 2016).
Wu, B., Zhang, H. & Zhang, M. Phylogenetic insight into the origin of tones. Proceed. Royal Soc. B: Biol. Sci. 290, 20230606. https://doi.org/10.1098/rspb.2023.0606 (2023).
Everett, C., Blasi, D. E. & Roberts, S. G. Climate, vocal folds, and tonal languages: Connecting the physiological and geographic dots. Proceed. Nat. Acad. Sci. https://doi.org/10.1073/pnas.1417413112 (2015).
Liang, Y. et al. Languages in China link climate, voice quality, and tone in a causal chain. Humanit. Social Sci. Commun. 10, 1–10. https://doi.org/10.1057/s41599-023-01969-4 (2023).
Dediu, D. & Ladd, D. R. Linguistic tone is related to the population frequency of the adaptive haplogroups of two brain size genes, ASPM and Microcephalin. Proc. Natl. Acad. Sci U S A 104, 10944–9. https://doi.org/10.1073/pnas.0610848104 (2007).
Wong, P. C. M. et al. ASPM-lexical tone association in speakers of a tone language: Direct evidence for the genetic-biasing hypothesis of language evolution. Sci. Adv.6, eaba5090, (2020) https://doi.org/10.1126/sciadv.aba5090.
Peng, G., Zhang, C., Zheng, H.-Y., Minett, J. W. & Wang, W.S.-Y. The effect of intertalker variations on acoustic-perceptual mapping in Cantonese and Mandarin tone systems. J. Speech Lang. Hear. Res. 55, 579–595. https://doi.org/10.1044/1092-4388(2011/11-0025) (2012).
Shuai, L. & Gong, T. Temporal relation between top-down and bottom-up processing in lexical tone perception. Front. Behav. Neurosci. 8, 97 (2014).
Massaro, D. W., Cohen, M. M. & Tseng, C.-Y. The evaluation and integration of pitch height and pitch contour in lexical tone perception in mandarin Chinese. J. Chinese Linguist. 13, 267–289 (1985).
Moore, C. B. & Jongman, A. Speaker normalization in the perception of Mandarin Chinese tones. J. Acoust. Soc. Am. 102, 1864–1877. https://doi.org/10.1121/1.420092 (1997).
Lee, K. Y. S., Chan, K. T. Y., Lam, J. H. S., van Hasselt, C. A. & Tong, M. C. F. Lexical tone perception in native speakers of Cantonese. Int. J. Speech Lang. Pathol. 17, 53–62. https://doi.org/10.3109/17549507.2014.898096 (2015).
Fok, C. Y. Y. A Perceptual Study of Tones in Cantonese (University of Hong Kong, Centre of Asian Studies, 1974).
Gandour, J. Perceptual dimensions of tone: Evidence from cantonese. J. Chinese Linguist. 9, 20–36 (1981).
Gandour, J. Tone perception in Far Eastern languages. J. Phon. 11, 149–175. https://doi.org/10.1016/S0095-4470(19)30813-7 (1983).
Gandour, J. T. & Harshman, R. A. Crosslanguage differences in tone perception: A multidimensional scaling investigation. Lang. Speech 21, 1–33. https://doi.org/10.1177/002383097802100101 (1978).
Garellek, M., Keating, P., Esposito, C. M. & Kreiman, J. Voice quality and tone identification in White Hmong. J. Acoust. Soc. Am. 133, 1078–1089. https://doi.org/10.1121/1.4773259 (2013).
Ciocca, V. & Lui, J. The development of the perception of Cantonese lexical tones. J. Multiling. Commun. Disord. 1, 141–147. https://doi.org/10.1080/1476967031000090971 (2003).
Burnham, D., Brooker, R. & Reid, A. The effects of absolute pitch ability and musical training on lexical tone perception. Psychol. Music 43, 881–897. https://doi.org/10.1177/0305735614546359 (2015).
Chang, D., Hedberg, N. & Wang, Y. Effects of musical and linguistic experience on categorization of lexical and melodic tones. J. Acoust. Soc. Am. 139, 2432. https://doi.org/10.1121/1.4947497 (2016).
Choi, W. The selectivity of musical advantage musicians exhibit perceptual advantage for some but not all Cantonese tones. Music. Percept. 37, 423–434. https://doi.org/10.1525/mp.2020.37.5.423 (2020).
Ong, J. H., Wong, P. C. M. & Liu, F. Musicians show enhanced perception, but not production, of native lexical tones. J. Acoust Soc. Am. 148, 3443. https://doi.org/10.1121/10.0002776 (2020).
Wu, M. A Grammar of Sanjiang Kam (LINCOM Gmbh, 2018).
Donohue, C. & Wu, M. F0 and aspiration in Kam: Caught in the beginning. In Proceedings of the International Conference on Phonetics of the Languages in China (ICPLC-13) (City University of Hong Kong, Hong Kong, 2013).
Bromham, L. et al. Global predictors of language endangerment and the future of linguistic diversity. Nat. Ecol. Evol. 6, 1–11. https://doi.org/10.1038/s41559-021-01604-y (2022).
Hu, A., Wang, M., Li, Y., Tang, Q. & Gu, F. Dong speakers outperform Mandarin speakers in behavioral pitch discrimination. J. Acoust. Soc. Am.147, EL62–EL65, https://doi.org/10.1121/10.0000604 (2020).
Gerrits, E. The categorisation of speech sounds by adults and children. PhD Thesis, Utrecht University, Utrecht, The Netherlands (2001).
Chiu, F., Rakusen, L. L. & Mattys, S. L. Phonetic categorization and discrimination of voice onset time under divided attention. J. Acoust. Soc. Am. 147, EL484–EL490, https://doi.org/10.1121/10.0001374 (2020).
Gerrits, E. & Schouten, M. E. H. Categorical perception depends on the discrimination task. Percept. Psychophys. 66, 363–376. https://doi.org/10.3758/BF03194885 (2004).
Macmillan, N. A. & Creelman, C. D. Detection theory: A user’s guide. Detection theory: A user’s guide (Cambridge University Press, New York, 1991).
Borsboom, D. Measuring the Mind: Conceptual Issues in Contemporary Psychometrics, 1 edn (Cambridge University Press, 2009).
Furr, R. M. & Bacharach, V. R. Psychometrics: An Introduction, 1 edn(Sage Publications, Inc, Thousand Oaks, CA, USA, 2007).
Nunnally, J. & Bernstein, I. Psychometric Theory, 3 edn (McGraw-Hill Humanities/Social Sciences/Languages, 1994).
Dima, A. L. Scale validation in applied health research: Tutorial for a 6-step R-based psychometrics protocol. Health Psychol. Behav. Med. 6, 136–161. https://doi.org/10.1080/21642850.2018.1472602 (2018).
Embretson, S. E. & Reise, S. P. Item Response Theory (Psychology Press, 2013).
Jolliffe, I. & Cadima, J. Principal component analysis: A review and recent developments. Philosoph. Trans. Royal Soc. A: Math., Phys. Eng. Sci. 374, 20150202. https://doi.org/10.1098/rsta.2015.0202 (2016).
Kline, R. B. Principles and Practice of Structural Equation Modeling (Guilford Publications, 2023).
Hays, R. D., Morales, L. S. & Reise, S. P. Item Response Theory and Health Outcomes Measurement in the 21st Century. Medical care38, II28–II42 (2000).
Schuur, W. H. v. Mokken Scale Analysis: Between the Guttman Scale and Parametric Item Response Theory. Polit. Anal.11, 139–163, https://doi.org/10.1093/pan/mpg002 (2003).
Jolliffe, I. Principal Component Analysis, 2 edn. Springer Series in Statistics (Springer Verlag, 2002).
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2023).
Goldinger, S. D. Signal detection comparisons of phonemic and phonetic priming: The flexible-bias problem. Percept. Psychophys. 60, 952–965. https://doi.org/10.3758/BF03211931 (1998).
Hanulíková, A., Dediu, D., Fang, Z., Bašnaková, J. & Huettig, F. Individual differences in the acquisition of a complex L2 phonology: A training study. Lang. Learn. 62, 79–109. https://doi.org/10.1111/j.1467-9922.2012.00707.x (2012).
Huang, Y. & Ferreira, F. The application of signal detection theory to acceptability judgments. Front. Psychol. 11 (2020).
Donaldson, W. Measuring recognition memory. J. Exp. Psychol. Gen. 121, 275–277. https://doi.org/10.1037/0096-3445.121.3.275 (1992).
Pallier, C. Computing discriminability and bias with the R software (2002).
Child, D. The Essentials of Factor Analysis (A&C Black, 2006).
Schumacker, R. E. & Lomax, R. G. A Beginner’s Guide to Structural Equation Modeling: Third Edition (Routledge, 2012).
Pearl, J. Causality: Models, Reasoning, and Inference (Cambridge University Press, 2000).
Pearl, J. & Mackenzie, D. The Book of Why: The new science of cause and effect (Penguin UK, 2018).
Rohrer, J. M. Thinking clearly about correlations and Causation: Graphical causal models for observational data. Adv. Methods Pract. Psychol. Sci. 1, 27–42. https://doi.org/10.1177/2515245917745629 (2018).
Hu, L. & Bentler, P. M. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Struct. Equ. Modeling 6, 1–55. https://doi.org/10.1080/10705519909540118 (1999).
Kaiser, H. F., Rice, J., Little, J., & Mark, I. V. Educ. Psychol. Measur. 34, 111–117. https://doi.org/10.1177/001316447403400115 (1974).
Bartlett, M. S. The Effect of Standardization on a \(\chi ^2\) Approximation in Factor Analysis. Biometrika 38, 337–344. https://doi.org/10.2307/2332580 (1951).
Revelle, W. psych: Procedures for Psychological, Psychometric, and Personality Research. Northwestern University, Evanston, Illinois (2023). R package version 2.3.6.
Meijer, R. R. & Baneke, J. J. Analyzing psychopathology items: A case for nonparametric item response theory modeling. Psychol. Methods 9, 354–368. https://doi.org/10.1037/1082-989X.9.3.354 (2004).
Guttman, L. A basis for scaling qualitative data. Am. Sociol. Rev. 9, 139–150. https://doi.org/10.2307/2086306 (1944).
Meijer, R. R., Niessen, A. S. M. & Tendeiro, J. N. A practical guide to check the consistency of item response patterns in clinical research through person-fit statistics: Examples and a computer program. Assessment 23, 52–62. https://doi.org/10.1177/1073191115577800 (2016).
Ark, L. A. v. d. Mokken Scale Analysis in R. J. Stat. Softw. 20, 1–19, https://doi.org/10.18637/jss.v020.i11 (2007).
MacKinnon, D. P., Fairchild, A. J. & Fritz, M. S. Mediation Analysis. Annu. Rev. Psychol. 58, 593. https://doi.org/10.1146/annurev.psych.58.110405.085542 (2007).
Lefcheck, J. S. piecewiseSEM: Piecewise structural equation modelling in r for ecology, evolution, and systematics. Methods Ecol. Evol. 7, 573–579. https://doi.org/10.1111/2041-210X.12512 (2016).
Boersma, P. P. A system for doing phonetics by computer. Glot Int. 5, 341–345 (2001).
Acknowledgements
We wish to thank our participants, as well as Ms. Liu Xianxian, Ms. O, Xianglan, Ms. Wu Yanglin, Ms. Wu Yunxia, Mr. Tao Kexing, Mr. Zhang Bolun. Ms. Liao Zengliang, Ms. Wu Pinglu, and Ms. Wu Jinhua for assistance with data collection. We thank Alexandra L. Dima for making the 6 steps R script freely available and for help with psychometric questions. We also thank Hoyee Wong Hirai for her assistance with data analysis, and five anonymous reviewers for feedback and suggestions. DD acknowledges Grant No. PID2022-138501NB-I00 funded by MICIU/AEI/10.13039/501100011033 (Spain) and by ERDF/EU.
Author information
Authors and Affiliations
Contributions
P.C.M.W and M.W. conceived the study. P.C.M.W, M.W. and L.W. designed the tasks. M.W. and L.W. conducted the experiment and collected the data. L.W. performed a preliminary analysis. D.D. conducted the analysis and wrote the manuscript. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
This study was approved by the Joint Chinese University of Hong Kong and New Territories East Cluster Clinical Research Ethics Committee Ref. No. 2013.663 (initial approval date: 23/01/2014, renewal date: 23/01/2022). The research was performed in accordance with the Declaration of Helsinki and in accordance with relevant guidelines and regulations. Informed consent was obtained from all participants and/or their legal guardian(s).
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Dediu, D., Wang, L., Wong, P.C.M. et al. Psychometric methods and signal detection theory uncover subtle differences in the perception of tone contrasts in speakers of Kam (Dong). Sci Rep (2026). https://doi.org/10.1038/s41598-026-46380-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-46380-6


