Abstract
Human languages differ widely, yet they share systematic regularities in the underlying semantic representations being expressed. How such similarities and differences arise remains unclear, in part because semantic theories often lack a principled link to neurocognitive constraints. Drawing on neurocognitive accounts in which semantic knowledge is grounded in biologically salient information dimensions, we examine how environmental factors shape conceptual representations in language. Here we show that word meanings across languages are organized along shared neurocognitive dimensions, while systematic variation along these dimensions is associated with climate. Using word embeddings from 53 languages and behavioral ratings from speakers of 8 languages, we find converging evidence that climatic variables explain semantic variation beyond commonly considered sociocultural factors. Complementary exploratory brain data further suggest climate-related modulation of activity patterns in the right anterior temporal lobe. Together, these findings indicate that semantic representations in language reflect biologically grounded dimensions that are flexibly shaped by long-term environmental conditions.
Similar content being viewed by others
Data availability
The multilingual fastText embeddings used in Study 1 are publicly available at https://fasttext.cc. Additional distributional semantic vectors used in validation analyses were obtained from the subs2vec model (https://github.com/jvparidon/subs2vec). The NorthEuraLex word list and concept translations are available at http://northeuralex.org. Colexification data were obtained from the CLICS database (version 3.0; https://clics.clld.org). Climate variables were obtained from the WorldClim 2.0 database (https://www.worldclim.org). Cultural variables were obtained from the D-PLACE database (https://d-place.org). The behavioral rating data from Study 2 (13-dimensional ratings for 207 concepts in 8 languages), together with derived semantic matrices and environmental distance matrices, are available at OSF. The fMRI data re-analyzed in Study 3 were originally collected and shared by Malik-Moraleda et al.51 and are available at OSF. Our derived language-level neural representational dissimilarity matrices and analysis scripts are available at OSF. Source Data underlying the main figures and tables are provided with this paper. Source data are provided with this paper.
Code availability
All data analyses were conducted using Python (version 3.9.1) and R (version 4.5.2). Complete details of software packages and analysis code are available at OSF.
References
Wierzbicka, A. Semantics, Culture, and Cognition: Universal Human Concepts in Culture-Specific Configurations (Oxford University Press, USA, 1992).
Kay, P. & Kempton, W. What is the Sapir-Whorf hypothesis?. Am. Anthropol. 86, 65–79 (1984).
Majid, A., Bowerman, M., Kita, S., Haun, D. B. & Levinson, S. C. Can language restructure cognition? The case for space. Trends Cogn. Sci. 8, 108–114 (2004).
Berlin, B. & Kay, P. Basic Color Terms: Their Universality and Evolution (Univ of California Press, 1991).
Wierzbicka, A. Defining emotion concepts. Cogn. Sci. 16, 539–581 (1992).
Ekman, P. An argument for basic emotions. Cogn. Emot. 6, 169–200 (1992).
Majid, A. et al. Differential coding of perception in the world’s languages. Proc. Natl. Acad. Sci. USA 115, 11369–11376 (2018).
Passmore, S. & Jordan, F. M. No universals in the cultural evolution of kinship terminology. Evolut. Hum. Sci. 2, e42 (2020).
Majid, A., Jordan, F. & Dunn, M. Semantic systems in closely related languages. Language Sciences 49, 1–18 (2015).
Jackson, J. C. et al. From text to thought: how analyzing language can advance psychological science. Perspect. Psychol. Sci. 17, 805–826 (2022).
Youn, H. et al. On the universal structure of human lexical semantics. Proc. Natl. Acad. Sci. USA 113, 1766–1771 (2016).
Thompson, B., Roberts, S. G. & Lupyan, G. Cultural influences on word meanings revealed through large-scale semantic alignment. Nat. Hum. Behav. 4, 1029–1038 (2020).
Lewis, M., Cahill, A., Madnani, N. & Evans, J. Local similarity and global variability characterize the semantic space of human languages. Proc. Natl. Acad. Sci. USA 120, e2300986120 (2023).
Jackson, J. C. et al. Emotion semantics show both cultural variation and universal structure. Science 366, 1517–1522 (2019).
Martin, A., Haxby, J. V., Lalonde, F. M., Wiggs, C. L. & Ungerleider, L. G. Discrete cortical regions associated with knowledge of color and knowledge of action. Science 270, 102–105 (1995).
He, C. et al. Selectivity for large nonmanipulable objects in scene-selective visual cortex does not require visual experience. NeuroImage 79, 1–9 (2013).
Fernandino, L. et al. Concept representation reflects multimodal abstraction: a framework for embodied semantics. Cereb. Cortex 26, 2018–2034 (2016).
Fernandino, L., Tong, J.-Q., Conant, L. L., Humphries, C. J. & Binder, J. R. Decoding the information structure underlying the neural representation of concepts. Proc. Natl. Acad. Sci. USA 119, e2108091119 (2022).
Capitani, E., Laiacona, M., Mahon, B. & Caramazza, A. What are the facts of semantic category-specific deficits? A critical review of the clinical evidence. Cogn. Neuropsychol. 20, 213–261 (2003).
Miceli, G. et al. The dissociation of color from form and function knowledge. Nat. Neurosci. 4, 662–667 (2001).
Buxbaum, L. J., Veramontil, T. & Schwartz, M. F. Function and manipulation tool knowledge in apraxia: knowing ‘what for’ but not ‘how’. Neurocase 6, 83–97 (2000).
Deen, B. et al. Organization of high-level visual cortex in human infants. Nat. Commun. 8, 13995 (2017).
Wen, H., Xu, T., Wang, X., Yu, X. & Bi, Y. Brain intrinsic connection patterns underlying tool processing in human adults are present in neonates and not in macaques. NeuroImage 258, 119339 (2022).
Bottini, R. et al. Brain regions involved in conceptual retrieval in sighted and blind people. J. Cogn. Neurosci. 32, 1009–1025 (2020).
Wang, X., Men, W., Gao, J., Caramazza, A. & Bi, Y. Two forms of knowledge representations in the human brain. Neuron 107, 383–393.e385 (2020).
Bi, Y. Dual coding of knowledge in the human brain. Trends Cogn. Sci. 25, 883–895 (2021).
Mahon, B. Z. & Caramazza, A. What drives the organization of object knowledge in the brain? Trends Cogn. Sci. 15, 97–103 (2011).
Martin, A. GRAPES—Grounding representations in action, perception, and emotion systems: how object properties and categories are represented in the human brain. Psychon. Bull. Rev. 23, 979–990 (2016).
Binder, J. R. & Desai, R. H. The neurobiology of semantic memory. Trends Cogn. Sci. 15, 527–536 (2011).
Lambon-Ralph, M. A., Jefferies, E., Patterson, K. & Rogers, T. T. The neural and computational bases of semantic cognition. Nat. Rev. Neurosci. 18, 42–55 (2017).
Patterson, K., Nestor, P. J. & Rogers, T. T. Where do you know what you know? The representation of semantic knowledge in the human brain. Nat. Rev. Neurosci. 8, 976–987 (2007).
Binder, J. R. et al. Toward a brain-based componential semantic representation. Cogn. Neuropsychol. 33, 130–174 (2016).
Buchanan, E. M., Valentine, K. D. & Maxwell, N. P. English semantic feature production norms: an extended database of 4436 concepts. Behav. Res. Methods 51, 1849–1863 (2019).
McRae, K., Cree, G. S., Seidenberg, M. S. & McNorgan, C. Semantic feature production norms for a large set of living and nonliving things. Behav. Res. Methods 37, 547–559 (2005).
Kemmerer, D. Grounded cognition entails linguistic relativity: a neglected implication of a major semantic theory. Top. Cogn. Sci. 15, 615–647 (2023).
Zaslavsky, N., Kemp, C., Tishby, N. & Regier, T. Communicative need in colour naming. Cogn. Neuropsychol. 37, 312–324 (2020).
Jonauskaite, D. et al. Universal patterns in color-emotion associations are further shaped by linguistic and geographic proximity. Psychol. Sci. 31, 1245–1260 (2020).
Lindquist, K. A., Jackson, J. C., Leshin, J., Satpute, A. B. & Gendron, M. The cultural evolution of emotion. Nat. Rev. Psychol. 1, 669–681 (2022).
Bentz, C., Dediu, D., Verkerk, A. & Jäger, G. The evolution of language families is shaped by the environment beyond neutral drift. Nat. Hum. Behav. 2, 816–821 (2018).
Van de Vliert, E. The global ecology of differentiation between us and them. Nat. Hum. Behav. 4, 270–278 (2020).
Wormley, A. S., Kwon, J. Y., Barlev, M. & Varnum, M. E. How much cultural variation around the globe is explained by ecology? Proc. R. Soc. B 290, 20230485 (2023).
Grave, E., Bojanowski, P., Gupta, P., Joulin, A. & Mikolov, T. Learning word vectors for 157 languages. Preprint at https://arxiv.org/abs/1802.06893 (2018).
Grand, G., Blank, I. A., Pereira, F. & Fedorenko, E. Semantic projection recovers rich human knowledge of multiple object features from word embeddings. Nat. Hum. Behav. 6, 975–987 (2022).
Lewis, M., Zettersten, M. & Lupyan, G. Distributional semantics as a source of visual knowledge. Proc. Natl. Acad. Sci. USA 116, 19237–19238 (2019).
Chersoni, E., Santus, E., Huang, C.-R. & Lenci, A. Decoding word embeddings with brain-based semantic features. Comput. Linguist. 47, 663–698 (2021).
Cole, M. W., Bassett, D. S., Power, J. D., Braver, T. S. & Petersen, S. E. Intrinsic and task-evoked network architectures of the human brain. Neuron 83, 238–251 (2014).
Romney, A. K., Moore, C. C., Batchelder, W. H. & Hsia, T.-L. Statistical methods for characterizing similarities and differences between semantic structures. Proc. Natl. Acad. Sci. USA 97, 518–523 (2000).
Rzymski, C. et al. The database of cross-linguistic colexifications, reproducible analysis of cross-linguistic polysemies. Sci. Data 7, 13 (2020).
Kriegeskorte, N., Mur, M. & Bandettini, P. A. Representational similarity analysis-connecting the branches of systems neuroscience. Front. Syst. Neurosci. 2, 249 (2008).
Hammarström, H., Forkel, R., Haspelmath, M. & Bank, S. Glottolog 4.6 Leipzig: Max Planck Institute for the Science of Human History. https://glottolog.org/ (2022).
Malik-Moraleda, S. et al. An investigation across 45 languages and 12 language families reveals a universal language network. Nat. Neurosci. 25, 1014–1019 (2022).
Fedorenko, E., Hsieh, P.-J., Nieto-Castañón, A., Whitfield-Gabrieli, S. & Kanwisher, N. New method for fMRI investigations of language: defining ROIs functionally in individual subjects. J. Neurophysiol. 104, 1177–1194 (2010).
Kobrick, J. L. Effects of hypoxia and acetazolamide on color sensitivity zones in the visual field. J. Appl. Physiol. 28, 741–747 (1970).
Wang, Z.-X., Zhang, D.-L. & Ma, H.-L. The effect of high altitude on human color perception. Sheng Li Xue Bao 71, 833–838 (2019).
Sigismondi, F., Xu, Y., Silvestri, M. & Bottini, R. Altered grid-like coding in early blind people. Nat. Commun. 15, 3476 (2024).
Li, Y. et al. High-altitude exposure and time interval perception of Chinese migrants in Tibet. Brain Sci. 12, 585 (2022).
Talhelm, T. et al. Large-scale psychological differences within China explained by rice versus wheat agriculture. Science 344, 603–608 (2014).
Van de Vliert, E. Climato-economic habitats support patterns of human needs, stresses, and freedoms. Behav. Brain Sci. 36, 465–480 (2013).
Miyashita, Y. Perirhinal circuits for memory processing. Nat. Rev. Neurosci. 20, 577–592 (2019).
Brown, C. H. Hand and arm in The World Atlas of Language Structures (ed M. Haspelmath, Dryer, M.S., Gil, D., & Comrie, B.) 522–525 (Oxford University Press, 2005).
Dellert, J. et al. NorthEuraLex: a wide-coverage lexical database of Northern Eurasia. Lang. Resour. Eval. 54, 273–301 (2020).
Günther, F., Rinaldi, L. & Marelli, M. Vector-space models of semantic representation from a cognitive perspective: a discussion of common misconceptions. Perspect. Psychol. Sci. 14, 1006–1033 (2019).
Bojanowski, P., Grave, E., Joulin, A. & Mikolov, T. Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017).
Van Paridon, J. & Thompson, B. subs2vec: word embeddings from subtitles in 55 languages. Behav. Res. Methods 53, 629–655 (2021).
Tjuka, A., Forkel, R. & List, J.-M. Universal and cultural factors shape body part vocabularies. Sci. Rep. 14, 10486 (2024).
Fu, Z. et al. Different computational relations in language are captured by distinct brain systems. Cereb. Cortex 33, 997–1013 (2023).
Fick, S. E. & Hijmans, R. J. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 37, 4302–4315 (2017).
LingPy. A Python Library for Quantitative Tasks in Historical Linguistics (MCL Chair at the University of Passau, Passau, 2023).
List, J.-M. Sequence Comparison in Historical Linguistics. Vol. 1 (Walter de Gruyter GmbH & Co KG, 2014).
Kirby, K. R. et al. D-PLACE: a global database of cultural, linguistic and environmental diversity. PLoS ONE 11, e0158391 (2016).
Chen, G., Taylor, P. A., Shin, Y.-W., Reynolds, R. C. & Cox, R. W. Untangling the relatedness among correlations, Part II: inter-subject correlation group analysis through linear mixed-effects modeling. NeuroImage 147, 825–840 (2017).
Swadesh, M. Lexico-statistic dating of prehistoric ethnic contacts: with special reference to North American Indians and Eskimos. Proc. Am. Philos. Soc. 96, 452–463 (1952).
Kamholz, D., Pool, J. & Colowick, S. M. PanLex: Building a Resource for Panlingual Lexical Translation. In LREC (Vol. 14, pp. 3145–3150) (2014).
Adler, N. E., Epel, E. S., Castellazzo, G. & Ickovics, J. R. Relationship of subjective and objective social status with psychological and physiological functioning: preliminary data in healthy, White women. Health Psychol. 19, 586 (2000).
Dockès, J. et al. NeuroQuery, comprehensive meta-analysis of human brain mapping. elife 9, e53385 (2020).
Acknowledgements
We thank Haojie Wen, Xi Yu, Shuang Tian, Dingchen Zhang, and Shuyue Wang for help and comments on earlier drafts of the manuscript. We are grateful to Yongqin Liu for insightful discussions on climate-related analyses. This work was supported by the Brain Science and Brain-like Intelligence Technology - National Science and Technology Major Project 2021ZD0204100 (2021ZD0204104 awarded to Y.B.), and the National Natural Science Foundation of China (32595491 awarded to Y.B., 32171052 to X.W.).
Author information
Authors and Affiliations
Contributions
Y.B. conceived and supervised the research; Z.F. performed the research; Y.C., T.Z., and Y.L. collected rating data; Z.F. and Y.C. analyzed the data; X.W. contributed valuable discussions; and Y.B., Z.F., and X.W. wrote the paper.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Johann-Mattis List and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. [A peer review file is available].
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fu, Z., Chu, Y., Zhang, T. et al. Semantic similarity across languages reflects neurocognitive dimensions shaped by climate. Nat Commun (2026). https://doi.org/10.1038/s41467-026-70608-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-026-70608-8


