Semantic similarity across languages reflects neurocognitive dimensions shaped by climate

Fu, Ze; Chu, Yuxi; Zhang, Tangxiaoxue; Li, Yawen; Wang, Xiaosha; Bi, Yanchao

doi:10.1038/s41467-026-70608-8

Download PDF

Article
Open access
Published: 16 March 2026

Semantic similarity across languages reflects neurocognitive dimensions shaped by climate

Nature Communications , Article number: (2026) Cite this article

2902 Accesses
2 Altmetric
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Human languages differ widely, yet they share systematic regularities in the underlying semantic representations being expressed. How such similarities and differences arise remains unclear, in part because semantic theories often lack a principled link to neurocognitive constraints. Drawing on neurocognitive accounts in which semantic knowledge is grounded in biologically salient information dimensions, we examine how environmental factors shape conceptual representations in language. Here we show that word meanings across languages are organized along shared neurocognitive dimensions, while systematic variation along these dimensions is associated with climate. Using word embeddings from 53 languages and behavioral ratings from speakers of 8 languages, we find converging evidence that climatic variables explain semantic variation beyond commonly considered sociocultural factors. Complementary exploratory brain data further suggest climate-related modulation of activity patterns in the right anterior temporal lobe. Together, these findings indicate that semantic representations in language reflect biologically grounded dimensions that are flexibly shaped by long-term environmental conditions.

Prediction, syntax and semantic grounding in the brain and large language models

Article Open access 10 March 2026

Breakdown of category-specific word representations in a brain-constrained neurocomputational model of semantic dementia

Article Open access 10 November 2023

A large dataset of semantic ratings and its computational extension

Article Open access 23 February 2023

Data availability

The multilingual fastText embeddings used in Study 1 are publicly available at https://fasttext.cc. Additional distributional semantic vectors used in validation analyses were obtained from the subs2vec model (https://github.com/jvparidon/subs2vec). The NorthEuraLex word list and concept translations are available at http://northeuralex.org. Colexification data were obtained from the CLICS database (version 3.0; https://clics.clld.org). Climate variables were obtained from the WorldClim 2.0 database (https://www.worldclim.org). Cultural variables were obtained from the D-PLACE database (https://d-place.org). The behavioral rating data from Study 2 (13-dimensional ratings for 207 concepts in 8 languages), together with derived semantic matrices and environmental distance matrices, are available at OSF. The fMRI data re-analyzed in Study 3 were originally collected and shared by Malik-Moraleda et al.⁵¹ and are available at OSF. Our derived language-level neural representational dissimilarity matrices and analysis scripts are available at OSF. Source Data underlying the main figures and tables are provided with this paper. Source data are provided with this paper.

Code availability

All data analyses were conducted using Python (version 3.9.1) and R (version 4.5.2). Complete details of software packages and analysis code are available at OSF.

References

Wierzbicka, A. Semantics, Culture, and Cognition: Universal Human Concepts in Culture-Specific Configurations (Oxford University Press, USA, 1992).
Kay, P. & Kempton, W. What is the Sapir-Whorf hypothesis?. Am. Anthropol. 86, 65–79 (1984).
Google Scholar
Majid, A., Bowerman, M., Kita, S., Haun, D. B. & Levinson, S. C. Can language restructure cognition? The case for space. Trends Cogn. Sci. 8, 108–114 (2004).
Google Scholar
Berlin, B. & Kay, P. Basic Color Terms: Their Universality and Evolution (Univ of California Press, 1991).
Wierzbicka, A. Defining emotion concepts. Cogn. Sci. 16, 539–581 (1992).
Google Scholar
Ekman, P. An argument for basic emotions. Cogn. Emot. 6, 169–200 (1992).
Google Scholar
Majid, A. et al. Differential coding of perception in the world’s languages. Proc. Natl. Acad. Sci. USA 115, 11369–11376 (2018).
Google Scholar
Passmore, S. & Jordan, F. M. No universals in the cultural evolution of kinship terminology. Evolut. Hum. Sci. 2, e42 (2020).
Google Scholar
Majid, A., Jordan, F. & Dunn, M. Semantic systems in closely related languages. Language Sciences 49, 1–18 (2015).
Jackson, J. C. et al. From text to thought: how analyzing language can advance psychological science. Perspect. Psychol. Sci. 17, 805–826 (2022).
Google Scholar
Youn, H. et al. On the universal structure of human lexical semantics. Proc. Natl. Acad. Sci. USA 113, 1766–1771 (2016).
Google Scholar
Thompson, B., Roberts, S. G. & Lupyan, G. Cultural influences on word meanings revealed through large-scale semantic alignment. Nat. Hum. Behav. 4, 1029–1038 (2020).
Google Scholar
Lewis, M., Cahill, A., Madnani, N. & Evans, J. Local similarity and global variability characterize the semantic space of human languages. Proc. Natl. Acad. Sci. USA 120, e2300986120 (2023).
Google Scholar
Jackson, J. C. et al. Emotion semantics show both cultural variation and universal structure. Science 366, 1517–1522 (2019).
Google Scholar
Martin, A., Haxby, J. V., Lalonde, F. M., Wiggs, C. L. & Ungerleider, L. G. Discrete cortical regions associated with knowledge of color and knowledge of action. Science 270, 102–105 (1995).
Google Scholar
He, C. et al. Selectivity for large nonmanipulable objects in scene-selective visual cortex does not require visual experience. NeuroImage 79, 1–9 (2013).
Google Scholar
Fernandino, L. et al. Concept representation reflects multimodal abstraction: a framework for embodied semantics. Cereb. Cortex 26, 2018–2034 (2016).
Google Scholar
Fernandino, L., Tong, J.-Q., Conant, L. L., Humphries, C. J. & Binder, J. R. Decoding the information structure underlying the neural representation of concepts. Proc. Natl. Acad. Sci. USA 119, e2108091119 (2022).
Google Scholar
Capitani, E., Laiacona, M., Mahon, B. & Caramazza, A. What are the facts of semantic category-specific deficits? A critical review of the clinical evidence. Cogn. Neuropsychol. 20, 213–261 (2003).
Google Scholar
Miceli, G. et al. The dissociation of color from form and function knowledge. Nat. Neurosci. 4, 662–667 (2001).
Google Scholar
Buxbaum, L. J., Veramontil, T. & Schwartz, M. F. Function and manipulation tool knowledge in apraxia: knowing ‘what for’ but not ‘how’. Neurocase 6, 83–97 (2000).
Google Scholar
Deen, B. et al. Organization of high-level visual cortex in human infants. Nat. Commun. 8, 13995 (2017).
Google Scholar
Wen, H., Xu, T., Wang, X., Yu, X. & Bi, Y. Brain intrinsic connection patterns underlying tool processing in human adults are present in neonates and not in macaques. NeuroImage 258, 119339 (2022).
Google Scholar
Bottini, R. et al. Brain regions involved in conceptual retrieval in sighted and blind people. J. Cogn. Neurosci. 32, 1009–1025 (2020).
Google Scholar
Wang, X., Men, W., Gao, J., Caramazza, A. & Bi, Y. Two forms of knowledge representations in the human brain. Neuron 107, 383–393.e385 (2020).
Google Scholar
Bi, Y. Dual coding of knowledge in the human brain. Trends Cogn. Sci. 25, 883–895 (2021).
Google Scholar
Mahon, B. Z. & Caramazza, A. What drives the organization of object knowledge in the brain? Trends Cogn. Sci. 15, 97–103 (2011).
Google Scholar
Martin, A. GRAPES—Grounding representations in action, perception, and emotion systems: how object properties and categories are represented in the human brain. Psychon. Bull. Rev. 23, 979–990 (2016).
Google Scholar
Binder, J. R. & Desai, R. H. The neurobiology of semantic memory. Trends Cogn. Sci. 15, 527–536 (2011).
Google Scholar
Lambon-Ralph, M. A., Jefferies, E., Patterson, K. & Rogers, T. T. The neural and computational bases of semantic cognition. Nat. Rev. Neurosci. 18, 42–55 (2017).
Google Scholar
Patterson, K., Nestor, P. J. & Rogers, T. T. Where do you know what you know? The representation of semantic knowledge in the human brain. Nat. Rev. Neurosci. 8, 976–987 (2007).
Google Scholar
Binder, J. R. et al. Toward a brain-based componential semantic representation. Cogn. Neuropsychol. 33, 130–174 (2016).
Google Scholar
Buchanan, E. M., Valentine, K. D. & Maxwell, N. P. English semantic feature production norms: an extended database of 4436 concepts. Behav. Res. Methods 51, 1849–1863 (2019).
Google Scholar
McRae, K., Cree, G. S., Seidenberg, M. S. & McNorgan, C. Semantic feature production norms for a large set of living and nonliving things. Behav. Res. Methods 37, 547–559 (2005).
Google Scholar
Kemmerer, D. Grounded cognition entails linguistic relativity: a neglected implication of a major semantic theory. Top. Cogn. Sci. 15, 615–647 (2023).
Google Scholar
Zaslavsky, N., Kemp, C., Tishby, N. & Regier, T. Communicative need in colour naming. Cogn. Neuropsychol. 37, 312–324 (2020).
Google Scholar
Jonauskaite, D. et al. Universal patterns in color-emotion associations are further shaped by linguistic and geographic proximity. Psychol. Sci. 31, 1245–1260 (2020).
Google Scholar
Lindquist, K. A., Jackson, J. C., Leshin, J., Satpute, A. B. & Gendron, M. The cultural evolution of emotion. Nat. Rev. Psychol. 1, 669–681 (2022).
Google Scholar
Bentz, C., Dediu, D., Verkerk, A. & Jäger, G. The evolution of language families is shaped by the environment beyond neutral drift. Nat. Hum. Behav. 2, 816–821 (2018).
Google Scholar
Van de Vliert, E. The global ecology of differentiation between us and them. Nat. Hum. Behav. 4, 270–278 (2020).
Google Scholar
Wormley, A. S., Kwon, J. Y., Barlev, M. & Varnum, M. E. How much cultural variation around the globe is explained by ecology? Proc. R. Soc. B 290, 20230485 (2023).
Google Scholar
Grave, E., Bojanowski, P., Gupta, P., Joulin, A. & Mikolov, T. Learning word vectors for 157 languages. Preprint at https://arxiv.org/abs/1802.06893 (2018).
Grand, G., Blank, I. A., Pereira, F. & Fedorenko, E. Semantic projection recovers rich human knowledge of multiple object features from word embeddings. Nat. Hum. Behav. 6, 975–987 (2022).
Google Scholar
Lewis, M., Zettersten, M. & Lupyan, G. Distributional semantics as a source of visual knowledge. Proc. Natl. Acad. Sci. USA 116, 19237–19238 (2019).
Google Scholar
Chersoni, E., Santus, E., Huang, C.-R. & Lenci, A. Decoding word embeddings with brain-based semantic features. Comput. Linguist. 47, 663–698 (2021).
Google Scholar
Cole, M. W., Bassett, D. S., Power, J. D., Braver, T. S. & Petersen, S. E. Intrinsic and task-evoked network architectures of the human brain. Neuron 83, 238–251 (2014).
Google Scholar
Romney, A. K., Moore, C. C., Batchelder, W. H. & Hsia, T.-L. Statistical methods for characterizing similarities and differences between semantic structures. Proc. Natl. Acad. Sci. USA 97, 518–523 (2000).
Google Scholar
Rzymski, C. et al. The database of cross-linguistic colexifications, reproducible analysis of cross-linguistic polysemies. Sci. Data 7, 13 (2020).
Google Scholar
Kriegeskorte, N., Mur, M. & Bandettini, P. A. Representational similarity analysis-connecting the branches of systems neuroscience. Front. Syst. Neurosci. 2, 249 (2008).
Google Scholar
Hammarström, H., Forkel, R., Haspelmath, M. & Bank, S. Glottolog 4.6 Leipzig: Max Planck Institute for the Science of Human History. https://glottolog.org/ (2022).
Malik-Moraleda, S. et al. An investigation across 45 languages and 12 language families reveals a universal language network. Nat. Neurosci. 25, 1014–1019 (2022).
Google Scholar
Fedorenko, E., Hsieh, P.-J., Nieto-Castañón, A., Whitfield-Gabrieli, S. & Kanwisher, N. New method for fMRI investigations of language: defining ROIs functionally in individual subjects. J. Neurophysiol. 104, 1177–1194 (2010).
Google Scholar
Kobrick, J. L. Effects of hypoxia and acetazolamide on color sensitivity zones in the visual field. J. Appl. Physiol. 28, 741–747 (1970).
Google Scholar
Wang, Z.-X., Zhang, D.-L. & Ma, H.-L. The effect of high altitude on human color perception. Sheng Li Xue Bao 71, 833–838 (2019).
Google Scholar
Sigismondi, F., Xu, Y., Silvestri, M. & Bottini, R. Altered grid-like coding in early blind people. Nat. Commun. 15, 3476 (2024).
Google Scholar
Li, Y. et al. High-altitude exposure and time interval perception of Chinese migrants in Tibet. Brain Sci. 12, 585 (2022).
Google Scholar
Talhelm, T. et al. Large-scale psychological differences within China explained by rice versus wheat agriculture. Science 344, 603–608 (2014).
Google Scholar
Van de Vliert, E. Climato-economic habitats support patterns of human needs, stresses, and freedoms. Behav. Brain Sci. 36, 465–480 (2013).
Google Scholar
Miyashita, Y. Perirhinal circuits for memory processing. Nat. Rev. Neurosci. 20, 577–592 (2019).
Google Scholar
Brown, C. H. Hand and arm in The World Atlas of Language Structures (ed M. Haspelmath, Dryer, M.S., Gil, D., & Comrie, B.) 522–525 (Oxford University Press, 2005).
Dellert, J. et al. NorthEuraLex: a wide-coverage lexical database of Northern Eurasia. Lang. Resour. Eval. 54, 273–301 (2020).
Google Scholar
Günther, F., Rinaldi, L. & Marelli, M. Vector-space models of semantic representation from a cognitive perspective: a discussion of common misconceptions. Perspect. Psychol. Sci. 14, 1006–1033 (2019).
Google Scholar
Bojanowski, P., Grave, E., Joulin, A. & Mikolov, T. Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017).
Google Scholar
Van Paridon, J. & Thompson, B. subs2vec: word embeddings from subtitles in 55 languages. Behav. Res. Methods 53, 629–655 (2021).
Google Scholar
Tjuka, A., Forkel, R. & List, J.-M. Universal and cultural factors shape body part vocabularies. Sci. Rep. 14, 10486 (2024).
Google Scholar
Fu, Z. et al. Different computational relations in language are captured by distinct brain systems. Cereb. Cortex 33, 997–1013 (2023).
Google Scholar
Fick, S. E. & Hijmans, R. J. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 37, 4302–4315 (2017).
Google Scholar
LingPy. A Python Library for Quantitative Tasks in Historical Linguistics (MCL Chair at the University of Passau, Passau, 2023).
List, J.-M. Sequence Comparison in Historical Linguistics. Vol. 1 (Walter de Gruyter GmbH & Co KG, 2014).
Kirby, K. R. et al. D-PLACE: a global database of cultural, linguistic and environmental diversity. PLoS ONE 11, e0158391 (2016).
Google Scholar
Chen, G., Taylor, P. A., Shin, Y.-W., Reynolds, R. C. & Cox, R. W. Untangling the relatedness among correlations, Part II: inter-subject correlation group analysis through linear mixed-effects modeling. NeuroImage 147, 825–840 (2017).
Google Scholar
Swadesh, M. Lexico-statistic dating of prehistoric ethnic contacts: with special reference to North American Indians and Eskimos. Proc. Am. Philos. Soc. 96, 452–463 (1952).
Google Scholar
Kamholz, D., Pool, J. & Colowick, S. M. PanLex: Building a Resource for Panlingual Lexical Translation. In LREC (Vol. 14, pp. 3145–3150) (2014).
Adler, N. E., Epel, E. S., Castellazzo, G. & Ickovics, J. R. Relationship of subjective and objective social status with psychological and physiological functioning: preliminary data in healthy, White women. Health Psychol. 19, 586 (2000).
Google Scholar
Dockès, J. et al. NeuroQuery, comprehensive meta-analysis of human brain mapping. elife 9, e53385 (2020).
Google Scholar

Download references

Acknowledgements

We thank Haojie Wen, Xi Yu, Shuang Tian, Dingchen Zhang, and Shuyue Wang for help and comments on earlier drafts of the manuscript. We are grateful to Yongqin Liu for insightful discussions on climate-related analyses. This work was supported by the Brain Science and Brain-like Intelligence Technology - National Science and Technology Major Project 2021ZD0204100 (2021ZD0204104 awarded to Y.B.), and the National Natural Science Foundation of China (32595491 awarded to Y.B., 32171052 to X.W.).

Author information

Authors and Affiliations

State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China
Ze Fu, Yuxi Chu, Xiaosha Wang & Yanchao Bi
Faculty of Psychology, Beijing Normal University, Beijing, China
Tangxiaoxue Zhang & Yawen Li
School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing, China
Xiaosha Wang & Yanchao Bi
IDG/McGovern Institute for Brain Research, Peking University, Beijing, China
Xiaosha Wang & Yanchao Bi
Key Laboratory of Machine Perception (Ministry of Education), Beijing, China
Xiaosha Wang & Yanchao Bi
Institute for Artificial Intelligence, Peking University, Beijing, China
Yanchao Bi

Authors

Ze Fu
View author publications
Search author on:PubMed Google Scholar
Yuxi Chu
View author publications
Search author on:PubMed Google Scholar
Tangxiaoxue Zhang
View author publications
Search author on:PubMed Google Scholar
Yawen Li
View author publications
Search author on:PubMed Google Scholar
Xiaosha Wang
View author publications
Search author on:PubMed Google Scholar
Yanchao Bi
View author publications
Search author on:PubMed Google Scholar

Contributions

Y.B. conceived and supervised the research; Z.F. performed the research; Y.C., T.Z., and Y.L. collected rating data; Z.F. and Y.C. analyzed the data; X.W. contributed valuable discussions; and Y.B., Z.F., and X.W. wrote the paper.

Corresponding authors

Correspondence to Xiaosha Wang or Yanchao Bi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Johann-Mattis List and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. [A peer review file is available].

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Reporting Summary (download PDF )

Transparent Peer Review file (download PDF )

Source data

Source Data (download XLSX )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fu, Z., Chu, Y., Zhang, T. et al. Semantic similarity across languages reflects neurocognitive dimensions shaped by climate. Nat Commun (2026). https://doi.org/10.1038/s41467-026-70608-8

Download citation

Received: 30 October 2024
Accepted: 25 February 2026
Published: 16 March 2026
DOI: https://doi.org/10.1038/s41467-026-70608-8