Abstract
We used a colour-term elicitation task to study the Russian colour inventory and compare the regiolects of young native speakers from Kazan (NK = 112) and Smolensk (NS = 143), two cities in European Russia about 2000 km apart. In Kazan, 362 unique terms were elicited, and in Smolensk, 423. Zipf-functions of term frequency and the Cognitive Salience Index based on the frequencies of terms and their positions in participants’ colour-term lists indicated the predominance of 12 Russian basic colour terms (BCTs), including two ‘blues’: sinij and goluboj. Other frequent, salient terms included an emerging BCT: birûzovyj ‘turquoise’. The core colour inventory was highly consistent across the two samples; by contrast, less frequent terms showed slight regional variation, likely reflecting differences in the local chromatic “visual diet”. The results highlight an ongoing process of lexical refinement within specific areas of colour space. Socio-cultural factors since the 1990s have spurred a surge of colour neologisms and non-canonical noun forms of Russian colour names. Russian developments align with broader cross-language tendencies in the enrichment of colour inventories, a process driven by cultural transmission in an increasingly globalised world.
Similar content being viewed by others
Introduction
This study investigates the colour-term (CT) vocabulary of young Russian speakers using an elicitation (list) task. We focus on dynamic processes in the Russian colour lexicon, specifically, neologisms and emerging linguistic models of colour naming that have developed over the past 30 years in response to the dramatic socio-economic changes of the post-Soviet era. A further aim is to examine diatopic variation in the CT inventory across two cities of European Russia: Kazan and Smolensk. Despite Russia’s geographical vastness, Russian is relatively homogeneous, particularly among younger speakers. Dialectal vocabulary differences are minor and typically limited to vernacular or pronunciation features, which do not hinder mutual intelligibility (Vendina 2016).
Universal colour-term inventory
Following Berlin and Kay’s (1969/1991) seminal work, it is widely accepted that there are universal, pan-cultural basic colour categories (BCCs) that partition the perceived colour gamut into conceptual regions. These are named by basic colour terms (BCTs), a “core” vocabulary posited to reach a maximum of 11: ‘white’, ‘black’, ‘red’, ‘green’, ‘yellow’, and ‘blue’ (primary BCTs), ‘brown’, ‘grey’, ‘orange’, ‘pink’, ‘purple’ (secondary BCTs).
Since 1969, the universalist hypothesis has been revised in light of theoretical critique of the proponents of the relativist view, who argue that colour categories are language-specific and shaped by non-trivial cultural constraints (Saunders & van Brakel 1997). Empirical findings have partly supported the linguistic-convention account of colour lexicon (for reviews, see Davidoff 2015; Paramei 2020; Paramei and Bimler 2022).
Currently, broadly accepted is the weak relativist account (for a review, see Lindsey and Brown 2021). According to it, along with universal cognitive factors, the variety of the terms that allow lexical distinction of perceived colour nuances is influenced by salient natural features, i.e., the “colour diet” of the language speakers (Josserand et al. 2021), and by the man-made visual environment (Thompson et al. 2020), e.g., prominent colour shades of artefactual objects, colourant technology etc.
Socio-cultural and historical factors (cultural transmission bias) shape the semantic structure of colour categories also non-universally, driving the emergence of novel (basic) CCs/CTs due to the need for communicative efficiency (Gibson et al. 2017; Jameson and Komarova 2009; Zaslavsky et al. 2019, 2022). The cultural transmission bias explains why the tenet of the upper limit of 11 BCTs has recently been reconsidered. Based on their data for Russian, with salient sinij ‘dark blue’ and goluboj ‘light blue’, Berlin and Kay (1969/1991) admitted the possibility of a 12th BCT in a language. Categorical partition of the BLUE area, i.e., two BCTs that differentiate light and dark(er) shades of blue, has since been confirmed for Russian, as well as for a number of Slavic, circum-Mediterranean and Far East languages (for a review, see Paramei and Bimler 2022).
Forestalling the following part, on Russian (basic) CT inventory, we conclude this subsection by recalling Berlin and Kay’s (1969/1991, p. 6) criteria for basicness of a colour term: it must (i) be monolexemic, (ii) not be subsumed under another BCT, (iii) apply not only to a limited class of objects, and (iv) be psychologically salient for all informants, i.e., appearing early in elicited lists, used frequently across contexts, and occur in all speakers’ idiolects.
Russian colour terms
In modern Russian, (psycho)linguistic studies consistently identify counterparts of ten Berlin–Kay BCTs: belyj ‘white’, čёrnyj ‘black’, krasnyj ‘red’, zelёnyj ‘green’, žёltyj ‘yellow’ (primary BCTs), and koričnevyj ‘brown’, oranževyj ‘orange’, fioletovyj ‘purple’, rosovyj ‘pink’, and seryj ‘grey’ (secondary BCTs) (Corbett and Davies 1995, 1997; Corbett and Morgan 1988; Davies and Corbett 1994; Frumkina 1984; Kul’pina 2001, 2007, 2019; Paramei et al. 2018; Vasilevich et al. 2016). Two distinct BCTs for ‘blue’ – sinij ‘dark blue’ and goluboj ‘light blue’ – make Russian an exceptional case of a 12-term inventory (Corbett and Davies 1995, 1997; Griber et al. 2018; Martinovic et al. 2020; Paramei 2005, 2007; Paramei et al. 2018). In its frequency, sinij is comparable to primary, while goluboj to secondary BCTs (Laws et al. 1995; Vamling 1986).
Several recurring findings characterise Russian CTs:
-
PURPLE: Russian fioletovyj (lit. ‘violet’) most closely meets the BCT criteria for ‘purple’ but covers a limited denotative range (Paramei et al. 2018). The PURPLE area is lexically elaborated with highly frequent non-BCTs: in the bluish sub-area, fioletovyj is “competed” by sirenevyj ‘lilac’ and lilovyj ‘mauve’; in the reddish part, it is complemented by malinovyj ‘raspberry’, bordovyj ‘claret’, purpurnyj ‘cardinal red’, and višnëvyj ‘cherry-coloured’, as well as older terms like bagrovyj ‘crimson’ and bagrânyj ‘purplish-red’ (Frumkina 1984; Safuanova and Korzh 2007; Vasilevich 1983, 1988). Recent decades have seen enrichment through English loanwords such as fuksiâ ‘fuchsia’ and madženta ‘magenta’ (Griber et al. 2018, 2021; Paramei et al. 2018; Vasilevich et al. 2016).
-
BROWN: The BCT koričnevyj, introduced in the 17th century, has a high-frequency ranking although lower than that of other BCTs (Corbett and Davies 1995; Davies and Corbett 1994; Morgan and Corbett 1989). Its usage is constrained predominantly to combinations with nouns for artefacts, while its older counterpart buryj (‘dust brown’) denotes natural objects (Bochkarev et al. 2023; Krylosova and Tomachpolski 2013; Rakhilina and Paramei 2011). Analysis of the Russian sub-corpus of Google Books Ngram shows that koričnevyj steadily increases the range of denoted objects, gradually expanding into the realm of natural objects and supplanting buryj (Bochkarev et al. 2023).
-
Richness and morphology: Russian exhibits great lexical richness in colour naming, including numerous hyponyms (e.g., vasil’kovyj ‘cornflower’, salatovyj ‘lettuce-coloured’). Furthermore, as “a highly inflectional language with a rich morphological system” (Masini and Benigni 2012, p. 447), Russian allows nuanced colour naming through a variety of suffixes expressing attenuative, approximate and/or evaluative meanings such as -ovat- (translated by English -ish) or -en’k- (affective diminutive). Abundant are polylexemic names – compounds (e.g., žёlto-zelёnyj ‘yellow-green’), terms with achromatic modifiers such as ‘light’, ‘bright’, ‘dark’, ‘dull’, ‘pale’ or ‘tender’, as well as CTs reflecting expressive complexity (e.g., glubokij mšisto-zelënyj ‘deep lichen green’). Terms of the pattern cveta Х ‘colour of X’ are also common (e.g., cvet svežej travy ‘colour of fresh grass) (Kharchenko 2023; Kravcova 2017; Kul’pina 2001, 2019; Rakhilina 2008; Vasilevich et al. 2016).
-
Neologism boom: In the post-Soviet era (since the 1990s), Russian has seen a “neologism boom”, the expression coined by Krylosova (2013) to capture an explosive growth of lexical innovations in Russian colour vocabulary. Its manifestation is an ongoing emergence of novel CTs and an increased use of previously infrequent terms, e.g., lavandovyj ‘lavender’ and metalličeskij ‘metallic’ (Griber et al. 2018, 2021; Krylosova 2013; Paramei et al. 2018). Another observed tendency, especially in youth lexicon, is the truncation of canonical adjectival CTs derived from objects into noun stems (e.g., lajmovyj → lajm ‘lime’, baklažanovyj → baklažan ‘aubergine’, etc.), the metonymic pattern typical of English. These innovations are linked to a surge of international trade contacts accompanied by a dramatic transformation of consumer markets with a substantial influx of Western products, whose advertising often includes Russian transliterations of English, French or Italian colour names (Griber and Mylonas 2015; Kravcova 2017; Krylosova 2013; Krylosova and Tomachpolski 2014, 2017; Marinova 2019, 2023a, b; Vasilevich 2003; Vasilevich et al. 2016).
Elicitation task
We applied the elicitation technique, or method of free-listing developed by Weller and Romney (1988), in which participants list as many domain-related terms as possible within a short time (typically 5 min) by either writing the terms down or giving oral responses. This technique yields rich, domain-relevant data and is powerful for ascertaining the salience of frequently mentioned items (Borgatti 1999; Quinlan 2019). The method was shown to be convenient in the colour domain in several languages (Alldrick 2025; Borgatti 1998; Corbett and Morgan 1988; Del Viva et al. 2023; Jakovljev and Zdravković 2018; Uusküla 2007; Xu et al. 2023, to name just a few).
Across the obtained lists, one estimates the number of times each CT is elicited and the decreasing order of the terms’ frequency (F): a few CTs are mentioned by many respondents, but many other terms are listed by only a few. The frequency (and its rank) reliably distinguishes BCTs from non-BCTs and can differentiate between primary and secondary BCTs (Corbett and Davies 1997; Davies and Corbett 1995). The elicited CTs, sorted according to their frequency (from largest to smallest), are a function that reflects the term’s salience gradient and follows a roughly exponential decline. It often shows an “elbow”, a natural break that allows identification of the most salient CTs as having basic status. These are separated from non-BCTs – hyponyms, complex and idiosyncratic terms, with stretches of slower decline and less punctuated drops (Bimler and Uusküla, 2018; Del Viva et al. 2023; Jakovljev and Zdravkoviċ 2018; Kuriki et al. 2017; Lindsey and Brown 2014).
The second measure is the term’s occurrence (position) on the list: the closer the CT is to the top of the list, the more salient it is. Across the lists, the term is indexed by its mean position (mP) and the corresponding rank. Sutrop (2001) introduced the Cognitive Salience Index (S), which combines both the term’s F and mP, allowing for a clear-cut break between BCTs and non-BCTs. The S-index is broadly used to operationalise Berlin–Kay’s criterion (iv).
Kazan and Smolensk: dialectal background
Kazan, the capital of the Republic of Tatarstan, in the southwest of Russia (55°47′ N 49°6′ E), lies 700 km east of Moscow on the Volga River (Fig. 1). It is an important cultural centre with a population of 1.31 million (Čislennost’ 2023). Historically part of Volga Bulgaria, the Golden Horde, and the Kazan Khanate, Kazan inherited Islamic-Turkic traditions alongside Finno-Ugric, Bulgarian, and Russian influences. In modern Tatarstan, with two official languages, Russian is influenced by Tatar (Alldrick 2025). Middle Russian dialects of the Volga-Kama region are shaped by contact with Finno-Ugric and Turkic languages of neighbouring republics Mari El, Udmurtia, Chuvashia and Bashkortostan (Heard 2011; Vendina 2014).
Map of European Russia; highlighted are the locations of Kazan and Smolensk.
Smolensk (54°45′ N 32°4′ E), 360 km west-southwest of Moscow on the Dnieper River (Fig. 1), has 312,896 inhabitants (Čislennost’ 2023); it is an administrative, industrial and cultural centre, one of the oldest Russian cities. As a Belarusian-Russian border city, Smolensk culture reflects strong Belarusian influence. Its dialect belongs to the western group of Southern Russian dialects, closely linked to Belarusian.
The natural environment in Kazan and Smolensk is not very dissimilar: Both cities share a continental climate with four distinct seasons (average +17–22 °C in summer, –9 to –12 °C in winter) and both lie at cultural crossroads. Yet, their historical trajectories have fostered distinct linguistic environments.
Aims of the study
The present study pursues three aims:
-
1.
To assess the Russian CT lexicon in young native speakers through elicitation data, estimating the BCT inventory on the basis of frequency and cognitive salience.
-
2.
To explore the salience of high-frequency non-BCTs that may be emerging as culturally basic terms.
-
3.
To evaluate diatopic variation in CT inventories between Kazan and Smolensk, where regional lexical differences have been noted in semantic domains other than colour (Kazan: Safonova et al. 2014; Smolensk: Lunkova 2020).
This study contributes novelty by (i) quantifying cognitive salience of Russian BCTs (N = 12) and about 20 high-frequency non-BCTs; (ii) documenting the post-Soviet “neologism boom” in the colour domain; (iii) analysing polylexemic expressions that enhance communicative efficiency; (iv) identifying colour space areas of ongoing lexical refinement in Russian and aligning them with cross-linguistic trends.
Methods
Participants
Respondents from two cities, Kazan and Smolensk, were comparable in age, education, and socio-economic background. The Kazan sample comprised students of Kazan Federal University (NK = 112; 79 females; aged 18–20 years). The Smolensk sample comprised students of the Smolensk State University (NS = 143; 103 females; aged 16–32 years). All participants were native Russian speakers; some were simultaneous bilinguals who acquired and used two languages from birth (e.g., Tatar in Kazan or Belarusian in Smolensk).
All participants reported normal or corrected-to-normal visual acuity. They were recruited as volunteers and were unaware of the study’s purpose. The study followed the guidelines of the Declaration of Helsinki and was approved by the respective local ethics committees. Written informed consent was obtained from all participants prior to testing.
Procedure
The elicitation task followed Davies and Corbett (1994). In Kazan, it was administered in person at the university; in Smolensk, either in person or online (via Zoom or Meet). Before data collection, participants received an Excel file (via Google Drive) containing the following instructions in Russian: “Please write as many colours as you know by typing them with your keyboard. You have 5 min for this task”. The experimenter started and stopped the clock after the 5-min interval. Participants entered their responses using the Cyrillic alphabet, and individual lists of CTs were recorded in the order they were produced.
Results
Individual CT lists (‘Raw data_Kazan’, ‘Raw data_Smolensk’) can be found in the Figshare repository: https://doi.org/10.6084/m9.figshare.27663297. The individual raw data were cleansed (see Supplementary Materials: ‘Details of data analysis → Data cleansing’) and then consolidated as group data (see ‘Preprocessed_data_Kazan’, ‘Preprocessed_data_Smolensk’ in https://doi.org/10.6084/m9.figshare.27663297). Analyses were conducted separately for the Kazan and Smolensk datasets (frequency estimates are available in ‘DATA_Kazan’, ‘DATA_Smolensk’ in https://doi.org/10.6084/m9.figshare.27663297). Cleansed lists of the Russian colour names were transliterated into Latin script using the free Online Transliterator (n/d). English glosses followed Frumkina and Mikhejev (1996).
Comparisons of the Kazan and Smolensk data samples focused on the following aspects:
-
the number, variety, and word composition of elicited CTs;
-
CT frequency and list position;
-
core colour inventory vs. less common CTs (Zipf-function);
-
Cognitive Salience Index (S; see above) of frequent CTs;
-
conceptual associations of frequent CTs, visualised by dendrograms and semantic maps.
The number and variety of elicited colour terms
We assessed the total number of unique colour names and the proportions of monolexemic vs. polylexemic CTs. For polylexemic CTs, we further analysed the distribution of part-of-speech (POS) compositions using the pymorphy2 PoS-tagging algorithm.
Kazan participants produced 2775 items (362 unique CTs), while Smolensk participants produced 3918 items (423 unique CTs). Individual lists ranged from 9 to 46 items in Kazan (mean 24.1) and 16 to 42 in Smolensk (mean 27.0). (Full frequency-ordered lists are in Tables S1 and S2 of Supplementary Materials.)
The large variety of CTs reflects both sensitivity to colour shades and the need for their nuanced lexicalisation. In addition to the 12 Russian BCTs, both samples include numerous derived forms with achromatic modifiers (e.g., tëmno-seryj ‘dark grey’), substantive qualifiers (e.g., nebesno-goluboj ‘sky light blue’, lajmovo-zelënyj ‘lime green’), and compounds (e.g., rozovo-koričnevyj ‘pink-brown’). We also observe emotionally-laden and/or aestheticising (evaluative) qualifiers (e.g., nežno-rozovyj ‘tender pink’, boleznenno-žëltyj ‘painfully yellow’), and (polylexemic) descriptions with affective diminutive suffixes (-ovat-, -en’k-) (e.g., želtovato-kremovyj ‘yellowish-creamy’, sinen’kij ‘endearing sinij’).
Derivational productivity differs across BCTs. In Kazan, the richest sets of polylexemic descriptors are for krasnyj ‘red’ (25), rozovyj ‘pink’ (20), sinij ‘dark blue’ (19), and zelënyj ‘green’ (19). In Smolensk, the most productive are sinij ‘dark blue’ (25), žëltyj ‘yellow’ (24), and zelënyj ‘green’ (22).
Both datasets contain frequent non-BCTs (with %F in Kazan and Smolensk, respectively): beževyj ‘beige’ (71, 69), birûzovyj ‘turquoise’ (61, 59), salatovyj ‘lettuce-coloured’ (52, 52), bordovyj ‘claret’ (51, 54), malinovyj ‘raspberry’ (43, 52), sirenevyj ‘lilac’ (32, 39). These also appear in modified, compounded or suffixed forms (e.g., svetlo-birûzovyj ‘light turquoise’, malinovo-seryj ‘raspberry-grey’, sirenevatyj ‘lilac-ish’). Many non-BCTs are motivated by food and beverages, (semi-)precious stones, objects of fauna and flora, metals, artefacts, and natural phenomena.
Composition of colour descriptors
As shown in Fig. 2 and Table S3, types of colour descriptors are very similar in both lists: most CTs are adjectival derivatives of object glosses (Kazan 70.9%, Smolensk 67.7%). Smolensk respondents more often produced non-canonical noun CTs (e.g., oxra ‘ochre’, mâta ‘mint’) than Kazan respondents (13.3% vs. 5.4%). Proportion of not-yet-conventionalised forms of cvet X “colour of X” or ottenok X “tint/shade of X” was low: 6.5% in Kazan (16 items) and 2% in Smolensk (six items).
Proportion of (polylexemic) colour names with different part-of-speech compositions in Kazan (left) and Smolensk (right).
Frequency of colour terms
For each CT, we calculated its frequency (F) and derived corresponding rank values (RF). Since the two samples differed in size, we report predominantly relative frequency (%F = 100 × F/N), the percentage of participants’ lists containing the term. Tables S4 and S5 list frequent CTs produced by ≥5 participants (Kazan: NK-F = 86; Smolensk: NS-F = 110), with frequency (F, %F), alongside respective rank (RF).
Figure 3 plots relative frequencies (%F) of the CTs with the highest frequency (for brevity, truncated to NF = 40 of each dataset). In both samples, 77% of responses comprise the most frequent CTs. The 12 BCTs dominate the top ranks, though in different order, with ‘black’ championing in Kazan and ‘red’ in Smolensk. The five most frequent non-BCTs (RF = 13–17) are identical across samples: ‘beige’, ‘turquoise’, ‘lettuce-coloured’, ‘claret’, ‘raspberry’. For 75 CTs present in both Kazan and Smolensk lists (Tables S4 and S5), frequency ranks (RF) correlate strongly between the cities (ρRF = 0.906, p < 0.001), indicating a largely stabilised colour lexicon.
Bars are colour-coded by gloss.
Both %F functions decline exponentially. Guided by Borgatti (1998), we searched for natural breaks in the distribution. In both %F functions, we discern a distinct break between koričnevyj ‘brown’ (RF = 11.5) and beževyj ‘beige’ (RF = 13), which manifests the hurdle between BCTs and non-BCTs. In Kazan, one more break occurs between malinovyj ‘raspberry’ (RK-F = 17) and purpurnyj ‘cardinal red’ (RK-F = 18), while in Smolensk between birûzovyj ‘turquoise’ (RS-F = 15) and salatovyj ‘lettuce-coloured’ (RS-F = 16).
The Zipf-function
We estimated a lognormal function reflecting the term’s “popularity”, the function intrinsically connected to a power-law relation (Mitzenmacher 2004) and called here the Zipf-function to align with the terminology in previous pertinent works. Following Lindsey and Brown (2014), we regressed the log-transformed proportion of participants listing a term (log%F) against the log-transformed frequency rank (RF). On a logarithmic scale, Zipf-functions typically yield a near-linear decline. We fitted the Zipf-function limbs with piecewise linear approximation using Muggeo’s (2003) algorithm (Python implementation), and estimated 95% confidence intervals (CIs) for slope coefficients using the Wald test.
In large language corpora, Zipf-functions typically reveal two segments, differing in the exponents and separating a kernel lexicon from a less popular lexicon used for specific communication (Cancho and Solé 2003). In the present context, two Zipf-function segments would be expected to reflect a division between BCTs and non-BCTs. However, recent studies of colour lexicons in other languages (Brown et al. 2016; Del Viva et al. 2023; Jakovljev and Zdravković 2018; Lindsey and Brown 2014) show that a three-segment Zipf-function fits better, differentiating frequent non-BCTs – potentially emergent BCTs (second segment) – from rarer tertiary CTs (third segment) (Lindsey and Brown 2021).
Applying Muggeo’s (2003) algorithm to high-frequency CTs (NK-F = 86, NS-F = 110), we identified three segments in both datasets (R2 > 0.98), and estimated break-points and segment slopes (Fig. 4). Limb 1 comprises the 12 BCTs (in Tables S4–S5 shaded dark grey) listed by a great majority of the participants (Kazan: %F ≥ 79.5; Smolensk: %F ≥ 82.5). Limb 1 slopes are close to zero: Kazan (−0.07; 95% CI [(-0.09), (-0.05)]); Smolensk (–0.06; 95% CI [(-0.08), (-0.04)]).
Limb slopes fitted with trilinear models; data points of limbs 1 and 2 are colour-coded by gloss. Kazan (left): Limb 1, 12 BCTs (RK-F = 1–12). Limb 2, seven highly popular non-BCTs (RK-F = 13–19). Limb 3, low-popularity non-BCTs (RK-F = 20–81.5; unfilled circles). Smolensk (right): Limb 1, 12 BCTs (RS-F = 1–12). Limb 2, 13 highly popular non-BCTs (RS-F = 13–24.5). Limb 3, low-popularity non-BCTs (RS-F = 26–102.5; unfilled circles).
For the CTs with RF ≥ 12, Muggeo’s (2003) algorithm marked the second negatively sloping segment corresponding to highly frequent non-BCTs (in Tables S4 and S5 shaded light grey). For Kazan, limb 2 has a slope of (-1.95; 95% CI [(-2.35), (-1.55)]) and includes seven popular non-BCTs (70.5% ≥ %F ≥ 32.1%): beževyj ‘beige’, birûzovyj ‘turquoise’, salatovyj ‘lettuce-coloured’, bordovyj ‘claret’, malinovyj ‘raspberry’, purpurnyj ‘cardinal red’, and sirenevyj ‘lilac’. For Smolensk, limb 2 has a slope of (−1.21; 95% CI [(-1.32), (-1.10)]) and comprises 13 popular non-BCTs (69.2% ≥ %F ≥ 34.3%), including, in addition, xaki ‘khaki’, fuksiâ ‘fuchsia’, alyj ‘scarlet’, persikovyj ‘peach’, and two terms for metallic brilliance, zolotoj ‘gold’ and serebrânyj ‘silver’.
Limb 3 includes all remaining non-BCTs, with a slope for Kazan (−1.39; 95% CI [(-1.44), (-1.34)]) and for Smolensk (−1.52; 95% CI [(-1.56), (-1.48)]). This segment includes non-common, tertiary CTs: Kazan, 30.4% ≥ %F ≥ 4.5% (ranked 20.5–81.5); Smolensk, 28.7% ≥ %F ≥ 3.5% (ranked 26–102.5). The Mann–Whitney U test showed that limb 1 slopes do not differ (p = 0.08), confirming the equivalence of the BCT lexicon in the two locations. In comparison, slopes of limb 2 and limb 3 differ between the two datasets (p = 0.01 and p < 0.001, respectively).
Mean position of colour terms
For each CT, we calculated its mean position (mP). A derived rank value (RmP) is the diagnostic measure that usually discerns primary and secondary BCTs. In Kazan, the six primary BCTs have the highest RmP (Table S4) in the order: krasnyj ‘red’, zelёnyj ‘green’, žёltyj ‘yellow’, sinij ‘dark blue’, čёrnyj ‘black’, belyj ‘white’. In Smolensk (Table S5), the first six RmP are taken by krasnyj ‘red’, sinij ‘dark blue’, žёltyj ‘yellow’, zelёnyj ‘green’, čёrnyj ‘black’, goluboj ‘light blue’, while RmP = 8 of belyj ‘white’ falls within the rank range of secondary BCTs.
Overall, RmP correlation between cities was moderate (ρ RmP = 0.561, p < .001). However, as prompted by Fig. 5, RmP of the 13 lowest-ranking CTs – the 12 BCTs plus birûzovyj ‘turquoise’ – correlate highly (ρ RmP = 0.945, p < 0.001). RmP correlation of further eight frequent non-BCTs is also high (ρ RmP = 0.873, p < 0.001): bordovyj ‘claret’, malinovyj ‘raspberry’, beževyj ‘beige’, sirenevyj ‘lilac’, salatovyj ‘lettuce-coloured’, purpurnyj ‘cardinal red’, lilovyj ‘mauve’, and izumrudnyj ‘emerald’. For rarer non-BCTs (RmP > 21), correlation was low. This is an expected outcome of listing behaviour, whereby highly salient CTs appear near the list start, limiting the differences among their ranks; conversely, less salient CTs can appear anywhere in a list, which allows larger ranking differences among them.
Colour-coded are the points for the terms with the highest RmP correlation (12 BCTs and nine frequent non-BCTs).
Cognitive Salience Index
To integrate CT’s frequency and mean position, we computed the Cognitive Salience Index (S) following Sutrop (2001):
where F is the number of participants listing the term, N the total number of participants, and mP the mean list position. The S-index is independent of list length; it ranges between 1 (term present in all lists) and 0 (term absent in any list), and yields a Cognitive Salience ranking (RS) in descending order. An “elbow” in the S-index function provides an additional criterion for discriminating between basic and non-basic CTs. The S-index allows cross-language comparison with previously published values.
Tables 1 and 2 present CTs with the highest S-index (S > 0.01) in Kazan (NK-HS = 36) and Smolensk (NS-HS = 33). As expected, the 12 BCTs dominated (shaded dark grey in Tables 1 and 2). As illustrated by Fig. 6, in both samples, krasnyj ‘red’ was the most salient (Kazan: S = 0.215, Smolensk: S = 0.267), followed by the other five primary BCTs. Note a drop-off before the second-salient BCT – žёltyj ‘yellow’ in Kazan (S = 0.124) and sinij ‘dark blue’ in Smolensk (S = 0.142). Across the CTs with the highest S-index, shared by Kazan and Smolensk respondents (NHS = 30), RS correlation was high (ρRS = 0.98, p < 0.001), indicating a consistent Russian core colour inventory.
Bars are colour-coded by gloss.
Lists of most salient non-BCTs (0.04 ≤ S ≤ 0.02) overlap extensively (shaded light grey in Tables 1 and 2). Smolensk respondents revealed slightly higher salience of persikovyj ‘peach’, a ‘fancy’ term, and fuksiâ ‘fuchsia’, a recent loanword. More salient in this sample is also xaki ‘khaki’, which we attribute to a perceptible uniform population in Smolensk that hosts a military academy.
Following Bimler and Uusküla (2021), we computed log-transformed estimates of normalised frequency Fi/N (where NK = 112, NS = 143) and mPi for each salient term i (named by at least five participants, as specified in Tables S4 and S5). Figure 7 presents scatterplots that depict CT’s mean position [−log10(mPi)] in the Kazan and Smolensk lists as a function of its normalised frequency [−log10(Fi/N)].
In Fig. 7, Fi/N decreases from left to right, whereas mPi decreases from the top downward. At the top left, one finds the 12 BCTs, i.e., the terms that are most readily retrieved when participants are asked to “think of colour names”. From there, other terms align along a steep linear segment, where Fi/N decreases only slightly but mPi declines more rapidly. This trajectory subsequently reaches an “elbow,” which marks the transition to a second, less steep linear segment characterised by a sharper progressive decline in frequency. In both datasets, this “elbow” is located around birûzovyj ‘turquoise’: it borders the least basic koričnevyj, while abutting the (next) most frequent non-BCTs (salatovyj, bordovyj, beževyj and malinovyj). Further toward the right, the distribution of other non-BCTs diverges into a wider scatter.
Conceptual closeness of frequent colour terms
We assumed that in a semantic network, closely associated terms tend to prime each other and, thus, occur in the lists in close proximity, with one term often immediately preceding or following the other (Friendly 1977). Following Uusküla and Bimler (2016), we estimated an association measure of ‘adjacency’, ADJ(i,j), between the i-th and the j-th CTs. (For adjacency formulae, see Supplementary Materials: ‘Details of data analysis → Measure of colour-term adjacency’.) To reconstruct patterns of conceptual closeness among the elicited CTs, we analysed ADJ(i,j) matrices by applying hierarchical cluster analysis (HCA), with inter-term adjacencies as entries. For the analysis, we included CTs with S > 0.01, whose ADJ(i,j) could be estimated with confidence: Kazan: NK-HS = 36; Smolensk: NS-HS = 33 (adjacency matrices are available in https://doi.org/10.6084/m9.figshare.27663297).
Outcomes of HCA for Kazan and Smolensk are presented as dendrograms (Fig. 8). The dendrograms bring out CT clusters, the ‘chunking’ of the terms, that tended to emerge as self-contained sublists within the listing sequence, such that listing any one CT made it easier to include the rest of a ‘chunk’ before continuing with the listing sequence. Figure 8 reveals strong clustering of BCTs: at the highest agglomerative level, in both samples, these are conceptually distinct from frequent non-BCTs. Noteworthy is a robust link between sinij and goluboj within the BCT clusters. Notably, the placement of koričnevyj ‘brown’ differs between the samples: it is grouped with other BCTs in Smolensk but with beževyj in Kazan. The two samples also slightly differ in a distinction pattern within the BCT cluster: while Kazan speakers contrast chromatic and achromatic BCTs, Smolensk speakers discern BCTs, which denote vivid chromatic colours, and achromatic BCTs adjoined by BCTs denoting low-saturation chromatic colours, rozovyj and koričnevyj.
Dendrograms of colour terms with S > 0.01 in Kazan (top) and Smolensk (bottom), colour-coded by gloss.
At the intermediate agglomerative level, associational connections vary between clusters of non-BCTs: some comprise CTs of comparable salience, while other clusters manifest a gradient in the shared chromatic content or an achromatic distinction. The associational grounds slightly differ between the two samples. In the Kazan dendrogram, for instance, one can discern a cluster that comprises frequent hyponyms denoting bluish (‘lilac’, ‘lavender’) and reddish shades (‘cardinal red’, ‘fuchsia’) of the PURPLE area; the rightmost cluster groups two entrenched phrasal units (‘colour of sea wave’, sero-buro-malinovyj). In comparison, in Smolensk, the rightmost cluster groups non-BCTs denoting colours of low chromaticity (‘beige’, ‘lilac’); in another cluster, ‘dark’-modified terms (tëmno-sinij, tëmno-zelënyj) coalesce.
Along with HCA, we processed the ADJ(i,j) matrices using multidimensional scaling (MDS) analysis. This enabled reconstructing semantic maps of CTs, an auxiliary way of representing their conceptual closeness. (For algorithms, see Supplementary Materials: ‘Details of data analysis → HCA and MDS analyses of conceptual closeness of colour terms’.) Semantic maps visualise CTs geometrically as points in a low-dimensional space, where inter-point distances reflect the corresponding inter-term adjacencies, while dimensions represent semantic attributes that govern the succession of recalled CTs.
Semantic maps for Kazan and Smolensk (Fig. S1) and their discussion, as well as details of MDS solutions and coordinates of CT-points, are presented in Supplementary Materials: ‘Semantic maps of the elicited colour terms’. Here, we shortly remark that, superimposed onto the semantic maps, the HCA clusters provide additional insight into the semantic features of conceptually close terms (van der Klis and Tellings 2022).
Discussion
General factors shaping colour-term inventories
The development of the colour-term vocabulary in a language is influenced by both universal and culture-specific factors. As surmised by Berlin and Kay (1969/1991), the fine-grainedness of a colour-naming system is associated with societal complexity and its level of industrialisation. As was demonstrated more recently by Majid et al. (2018), expressibility of discriminated colours (“codability”) is higher for larger populations, since a higher likelihood of interaction with strangers increases the need for more specific vocabulary. Thus, the emergence of fine-grained meanings corresponding to colours and shades of lightness is driven by the need for efficient communication, ensuring that colour descriptions can be made consistently and accurately (Conway et al. 2020; Gibson et al. 2017; Zaslavsky et al. 2019, 2022). Lexical refinement in the colour domain is stipulated both by “nature” – variation in reflectance encountered in the visual environment (Baddeley and Attewell 2009; Komarova and Jameson 2008) – and by “culture”, the degree of interest in colour (Kemp et al. 2018; Regier et al. 2016) and exposure to variation of colour in artefacts (Josserand et al. 2021).
Richness of Russian colour vocabulary
Our data reveal an extensive lexical variety in the modern Russian colour vocabulary (Tables 1, 2, and S1–S5): alongside BCTs and hyponyms, young speakers use numerous polylexemic terms and phrasal units (Fig. 2). The total number of unique colour names, 362 in Kazan and 423 in Smolensk, far exceeds earlier findings for Moscow (126 terms; Davies and Corbett 1994) or St. Petersburg (146 terms; Uusküla and Bimler 2016). This suggests that the Russian colour lexicon has expanded, partly through neologisms and loanwords introduced in recent decades.
Individual list lengths reinforce this trend, with, on average, 24.1 items in Kazan (range 9–46) and 27.0 in Smolensk (range 16–42), compared to about 20 terms in earlier studies. The discrepancy may partly be explained by differences in the procedure of data collection: in earlier studies, CTs were written down by respondents or elicited orally and written down by the experimenter, whereas our participants typed responses directly. Demographic factors – gender and age – may also matter. Specifically, in earlier Russian studies, there was a greater proportion of male participants, whereby men tend to list fewer CTs than women (Krimer-Gaborović and Jakovljev 2022). Russian participant samples in earlier studies were somewhat older (between 18 and 65 years), so may have exhibited a reduced colour vocabulary compared to those in their 20s, as in our participant cohort (cf. Griber et al. 2021).
Core Russian colour inventory
Approximately 25 terms, very similar in both datasets, show the highest cognitive salience (Tables 1 and 2), forming a shared core Russian colour inventory. Unsurprisingly, the 12 BCTs dominate. Krasnyj ‘red’ consistently ranks highest, which attests to the persistence of its cognitive salience for Russian speakers across a considerable period of time (Bimler and Uusküla, 2014; Davies and Corbett 1994; Morgan and Corbett 1989). It is followed by zelёnyj ‘green’, žёltyj ‘yellow’, and sinij ‘dark blue’, denoting the canonical ‘cardinal hues’, along with the achromatic čёrnyj ‘black’ and belyj ‘white’. Secondary BCTs rank below these. Notably, in Kazan, černyj has the highest frequency (RK-F = 1) and higher salience (RK-S = 3) than in Smolensk (RS-F = 2.5 and RS-S = 5 respectively), plausibly reflecting the symbolic prominence of black in Kazan’s cultural and religious environment (Doherty 2017). The two “Russian blues” are both highly salient, with sinij (RK-S = 5, RS-S = 2) being within primary and goluboj (RK-S = 9, RS-S = 7) secondary BCTs. Their basic status confirms earlier (psycho)linguistic findings (e.g., Corbett and Morgan 1988; Paramei 2005, 2007).
Salience ranking of the Russian BCTs overall resembles that in many other languages (e.g., Corbett and Davies 1995; Krimer-Gaborović and Jakovljev 2022; Moreira et al. 2024; Uusküla and Sutrop 2007). The noteworthy exception is goluboj, although, as a 12th BCT, it has counterparts in other languages with two “basic blues”, such as Greek (Androulaki et al. 2006), Italian (Del Viva et al. 2023), Japanese (Kuriki et al. 2017), Turkish (Bimler and Uusküla 2014), and Spanish dialects (Lillo et al. 2018).
Non-BCTs with the highest salience included beževyj ‘beige’, birûzovyj ‘turquoise’, bordovyj ‘claret’, salatovyj ‘lettuce-coloured’, and malinovyj ‘raspberry’. These five terms were among the most frequent non-BCTs 30 years ago (Davies and Corbett 1994), as well as more recently (Bimler and Uusküla 2014), and likely are emerging BCTs.
Notably, this mirrors processes of developing most “popular” non-BCTs into BCTs documented in several other languages. Specifically, for ‘turquoise’, denoting an insertion category at the border of the BLUE and GREEN areas, there is accumulating evidence in German (Jones 2013; Zollinger 1984), American English (AE; Lindsey and Brown 2014), and Italian (Del Viva et al. 2023); Mylonas and MacDonald (2016) argue that turquoise has augmented basic colour inventory of British English (BE). ‘Beige’, the term originating from French (Morgan 1993), that emerged to lexicalise a hard-to-name area between YELLOW and BROWN, is, too, on steep salience rise in many languages (Boynton 1997; Boynton and Olson 1987; Eessalu and Uusküla 2013; Jakovljev and Zdravković 2018; Lindsey and Brown 2014). The other frequent non-BCT, salatovyj ‘lettuce-coloured’, ostensibly is cognate of popular AE lime partitioning light green area (Lindsey and Brown 2014).
In salience ranking, several other Russian non-BCTs are close successors of the five “popular” non-BCTs, such as sirenevyj ‘lilac’, purpurnyj ‘cardinal red’, lilovyj ‘mauve’, alyj ‘scarlet’, xaki ‘khaki’, zolotoj ‘golden’, persikovyj ‘peach’, izumrudnyj ‘emerald’ and the ‘fancy’ term fuksiâ ‘fuchsia’. These terms, widely used in Russian, were also among the most frequently listed non-BCTs in earlier studies (Davies and Corbett 1994; Uusküla and Bimler 2016).
We scrutinised productivity measures of sirenevyj (RK-F = 19, RS-F = 21), whose counterparts are highly salient in many languages (Bimler and Uusküla, 2014; Epicoco et al. 2024; Jraissati et al. 2012). In AE, lilac greatly overlaps with lavender, both emerging BCTs that partition the PURPLE area (Lindsey and Brown 2014); in BE, lilac is argued to have emerged as a BCT (Mylonas and MacDonald 2016). In Russian, by contrast, sirenevyj is declining in salience: 30 years ago, it was the most frequent term beyond the 12 BCTs (RF = 13; Davies and Corbett 1994; Morgan and Corbett 1989); 10 years ago in salience it ranked 17 (Bimler and Uusküla 2014); and in our study its rank is still lower (RK-S = 19, RS-S = 20). The decline of sirenevyj frequency may be caused by an increased use of denotatively close lilovyj (cf. Safuanova and Korzh 2007): 30 years ago, RS-lag of lilovyj behind sirenevyj was 9 (Davies and Corbett 1994), but 10 years ago it was just 2 (Bimler and Uusküla 2014). It seems that Russian speakers tend to denote light purple colours by lilovyj, in sirenevyj stead (Marjieh et al. 2024). In addition, sirenevyj might be supplanted by novel lavandovyj ‘lavender’, which was absent in Davies and Corbett’s list in the 1990s but is of relatively high salience here: RK- S = 35, RS-S = 41. (As illustrated by Fig. 8, in the Kazan dendrogram, sirenevyj and lavandovyj are closely associated.) This trend echoes semantic competition among closely related terms and suggests lexical restructuring within the PURPLE area.
In total, seven (Kazan) and 13 (Smolensk) non-BCTs constitute the second segment of the Zipf-functions (Fig. 4). Partial divergence in the composition and number of the popular non-BCTs, along with the slope difference, points to diatopic variation between Kazan and Smolensk, conceivably reflecting chromatic environment statistics in these locations.
The productivity measures of highly popular non-BCTs (up to 13) suggest that they form an essential part of Russian speakers’ core colour inventory, alongside the 12 BCTs. We attribute the great similarity of the core inventories of the two populations, residing over 2000 km apart, to educational centralisation in Russia with wider access to canonical culture, standardised education resources, and explicit instruction (cf. Majid et al. 2018).
Notably, the Russian core inventory is strikingly similar to Lindsey and Brown’s (2014) AE core lexicon of 20 named colour categories, comprising the 11 English BCTs plus several non-BCTs: teal (overlapping with turquoise), beige, lime, lavender (greatly overlapping with lilac), and maroon (close cognate of burgundy, counterpart of Russian bordovyj ‘claret’).
Tertiary non-BCTs: recurring object references
Both participant samples offered numerous hyponyms (Tables S1 and S2). Tertiary non-BCTs mostly refer to specific objects (predominantly natural) integral to the life of Russians (cf. Griber et al. 2018). Frequent are CTs derived from (semi-)precious stones (e.g., rubinovyj ‘ruby’); berries, fruits and vegetables (e.g., višnëvyj ‘cherry’, morkovnyj ‘carrot’); flowers and trees (e.g., vasil’kovyj ‘cornflower’, kaštanovyj ‘chestnut’); food and beverages (gorčičnyj ‘mustard’, kofejnyj ‘coffee’); metals and alloys (e.g., bronzovyj ‘bronze’). These tertiary CTs exhibit regional variation shaped by “vernacular visuality” (Mitchell 2002) of salient referential objects (cf. Rubio-Fernandez 2021). (See Supplementary Materials: ‘Tertiary non-BCTs: regiolect variation’.)
The source-based derivation of Russian CTs parallels such models in other languages: items from similar semantic classes of objects (though culturally and ethnically specific) are widely used as referents in English (Casson 1994; Matschi 2004), German (Jones 2013), Polish (Kul’pina, 2001), Spanish (Lillo et al. 2018), Swedish (Bergh 2007), to name just a few languages.
We conclude this subsection with a remark on numerical outcomes in representation of tertiary non-BCT frequency by the third segment in the Zipf-function (Fig. 4). Our limb 3 slopes (Kazan: −1.39, Smolensk: −1.52) are closer to limb 3 slope for Italian (−1.21; Del Viva et al. 2023) but are not in the same ballpark as considerably steeper third-segment slopes for AE (−3.32; Lindsey and Brown, 2014) or Serbian (−2.96; Jakovljev and Zdravković 2018). Leaning upon Linders and Louwerse’s (2023) study on psychological mechanisms behind Zipf-function exponents, we conjecture that the cross-study discrepancy in the slope estimates is the consequence of the way participants were tasked. Specifically, in the AE and Serbian studies, participants were instructed to use solely monolexemic CTs, with no intrinsic modifiers; in comparison, the Russian and Italian studies allowed any CTs. The “monolexemic” instruction probably implied a greater degree of economisation of linguistic efforts, manifested by steeper slopes; in contrast, the “unconstrained” instruction was likely to encourage a more diversified, less economical language, which was demonstrated to yield shallower slopes in Zipf-functions (Linders and Louwerse 2023, pp. 81–82).
Colour names as phrasal units
Nuanced colour naming observed in both lists can be explained, to a certain degree, by the use of linguistic units of different levels (Marinova 2023a). As prompted by Table S3, polylexemic colour names partly are phrasal units based on metaphorical motivation, either entrenched or idiosyncratic, or new calques. The entrenched sero-buro-malinovyj (lit.) ‘grey-dust brown-raspberry’, a jocular vernacular expression used by Russian speakers to denote a drab, dull nondescript colour shade (Bochkarev et al. 2023), stands out as a highly salient phrasal unit.
We also observe instances of syntactic units of the form cvet Х ‘colour of X’, indicating that they are entering the language gradually (Rakhilina 2007). Some of these are entrenched in Russian, like “transparent” cvet morskoj volny ‘colour of sea wave’, a close synonym of birûzovyj ‘turquoise’ (Frumkina 1984), or playful vernacular cvet detskoj neožidannosti ‘colour of a child’s surprise’, paralleling the French caca du Dauphin (Kharchenko 2023; Kul’pina 2001, 2019; Rakhilina 2008; Vasilevich et al. 2016).
New developments in the Russian colour lexicon
Our lists contain plentiful novel CTs, as well as colour nominations in a non-canonical grammar form. This incites analysis of colouristic neologisms in relation to active transformations in Russian colour vocabulary in the 1990s and the following decades (Krylosova and Tomachpolski 2014, 2017; Marinova 2019, 2023a, b; Vasilevich et al. 2016).
As highlighted in the Introduction, the “neologism boom” in the colour domain is the consequence of the dramatic changes in post-Soviet Russia driven by market liberalisation with the influx of Western products and, with them, the import of novel CTs and new ways of colour nomination (Krylosova 2013). Via the advertising discourse, loanwords from English, French and Italian entered Russian and expanded the referent scope of CTs (Kravcova 2017; Krylosova and Tomachpolski 2014, 2017; Marinova 2019, 2023a, b; Nasibullina, 2010; Vasilevich et al. 2000, 2016). In many instances, CT transliterations were grammatically adapted as adjectival forms, with a suffix and ending canonical in Russian. Elicited from young Russians, active internet users and consumers, our lists testify to such novel terms, although their frequency is low, e.g., pudrovyj ‘powder-coloured’, metalličeskij ‘metallic’, amarantovyj ‘amaranth’, nudovyj ‘nude’, etc. In other cases, the terms were adopted as Russian translations, exemplified by the rising in frequency lososevyj ‘salmon’. Many of these terms are absent in the current explanatory dictionaries and are recorded only in special Russian language dictionaries or catalogues.
Our lists confirm another observation – of direct borrowings from English, French and Italian (Kravcova 2017; Krylosova 2013; Krylosova and Tomachpolski 2014, 2017; Marinova 2019, 2023a, b). Although in low frequencies, we record transliterations of foreign CTs functioning as object nouns, e.g., madženta ‘magenta’, tiffani ‘tiffany blue’, militari ‘military’, nûd ‘nude’, kapučino ‘cappuccino’, samo ‘salmon’, etc.
Across participant lists, we notice signs of grammar instability of some novel terms: e.g., along with fuksiâ, there are instances of cvet fuksii ‘colour of fuchsia’ (normative genitive construction in Russian) but also cvet fuksiâ ‘fuchsia colour’, marked by non-inflexion of the noun. The latter exemplifies a novel syntactic model that is spreading under the influence of advertising texts (Krylosova and Tomachpolski 2017; Marinova 2023a, b).
Notably, non-canonical nominal CTs, formed by the truncation of canonical adjectives, eventually became models for the grammar transformation of traditional Russian CTs (Krylosova and Tomachpolski 2014). In both lists we record nominal CTs, e.g., višnevyj → višnâ ‘cherry’, grafitnyj → grafit ‘graphite’, kobal’tovyj → kobalt ‘cobalt’, šokoladnyj → šokolad ‘chocolate’, etc.
Interestingly, almost all loanwords in our lists, whether transliterated or translated, have widely known Russian equivalents (Krylosova and Tomachpolski 2017): e.g., bordovyj for burgundi; cvet morskoj volny for akva; izumrudnyj for emerald; zaŝitnyj for militari; or telesnyj for nûd. However, a borrowed neologism generally does not occupy the entire “niche” of the respective Russian CT: the alleged equivalents develop denotative nuances apparent to a Russian speaker (Marinova 2019, 2023a, b).
More importantly is axiological aspect of novel coloratives (Vasilevich et al. 2016): loanwords, viewed as original and fresh, in product marketing are intended to enkindle connotations of “prestige” and “elegance” of the advertised product, in order to motivate favourable consumers’ decisions (Krylosova 2013; Krylosova and Tomachpolski 2014, 2017; Marinova 2019, 2023a, b; Skorinko et al. 2006; Vasilevich 2003).
In our lists, we see novel metaphoric phrasal units of French origin, which entered Russian from fashion catalogues and internet-fora, e.g., cvet bedra ispugannoj nimfy (lit.) ‘colour of a frightened nymph’s thigh’, loan translation of the fancy French term cuisse de nymphe effrayée. Present also are idiosyncratic expressions like cvet vlûblennoj lâguški ‘colour of a frog in love’ (Kazan) or cvet lâguški v obmoroke ‘colour of a fainted frog’ (Smolensk). As observed by Biggam (2012) for English, such ‘fancy’ object-derived loanwords frequently occur in the younger generation’s parlance, with colour references transparent for this generation’s speakers.
Finally, in both samples, noteworthy is a group of novel phrasal units that reflect the tendency towards analyticity characteristic of the contemporary Russian language (Marinova 2023a; Masini and Benigni 2012; Ohnheiser 2019) and that are inspired by marketing and advertising narratives. Such nominations are multiword “creative” CTs, nuanced (frequently evaluative) descriptions of chromatic properties (cf. Biggam 2012; Casson 1994) or metaphors alluding to speakers’ visual environment (Kravcova 2017).
As argued by Rubio-Fernandez (2021), such “redundant colour words” facilitate the speaker’s discriminability of the referent colour in the entire visual context, which, in turn, enhances referential communication. In our lists, “redundant colour words” are exemplified by adjective chains, like amarantovyj gluboko-purpurnyj ‘amaranth deep purple’ or marmeladno-goluboj ‘marmalade goluboj’, and [ADJ, N] compositions, e.g., cvet mokroj sireni ‘colour of wet lilac’, staroe zoloto ‘old gold’ (Kazan); nočnoê nebo ‘night sky’, morskaâ sol’ ‘sea salt’, ržavaâ med’ ‘rusty copper’ (Smolensk).
Patterns of conceptual association of Russian colour terms
The patterns of conceptual closeness of Russian CTs reveal the great extent of commonality between Kazan and Smolensk samples, implying that these language communities impose similar patterns of associations and interrelationships upon their core colour lexicon.
The governing conceptual theme is the cognitive salience, reflected at the highest agglomerative level: cluster analyses reveal a clear separation between BCTs and non-BCTs. Within a BCT cluster, there is a distinction between subclusters of the high- and low-chromatic BCTs, and achromatic BCTs. The semantic maps identified, in addition, a refined inter-term distinction that reflects the salience gradient of individual CTs, in accord with findings for other languages (Bimler and Uusküla 2014; Del Viva et al. 2023; Moreira et al. 2024). Non-BCT clusters reveal varying grounds of conceptual distinction: some group CTs by their semantic similarity and contiguity, others by the number of features (colour attributes) the two CTs share (cf. Uusküla and Bimler 2016), or by cultural associations and collocations that are likely to contribute to mutual CT priming (cf. Ronga et al. 2014).
The parallelism of cluster bases reflects possible trajectories through the elicited names, where an individual participant’s chain of associations might jump between, say, more- and less-basic chromatic terms, or between lighter and darker concepts. Some participants list primary chromatic BCTs, progressing to secondary chromatic BCTs and chromatic non-BCTs, while others, after the ‘cardinal hues’, list ‘white’ and ‘black’ and from there, follow the semantic links to ‘grey’, and still others pursue a systematic attempt to exhaust all variants of (say) ‘blue’ before moving on to (say) ‘green’.
Ongoing lexical refinement of colour space in Russian
An abundance of hyponyms and modified terms indicates that certain colour space areas are particularly prone to further lexical differentiation in Russian: BLUE, BLUE-GREEN, GREEN, PURPLE, and YELLOW-ORANGE-PINK. (For details, see Supplementary Materials: ‘Russian non-BCTs as indicators of ongoing colour space lexical refinement’.) The far majority of colour names in these areas are traditional and were attested in previous (psycho-)linguistic studies of Russian CTs (Bimler and Uusküla 2014; Davies and Corbett 1994; Frumkina 1984; Morgan and Corbett 1989; Safuanova and Korz 2007). Many of the recorded novel terms indicate attention to distinctiveness and allow a higher degree of lexical precision, creating new reference points within these colour space areas (Tribushinina 2008).
Similar processes of lexical refinement are attested cross-linguistically. Across languages of highest communicative efficiency, lexical refinement is observed in three colour clusters – cool (blue, green), warm (red, orange, yellow, brown), and intermediate (purple, pink) (Conway et al. 2020). The semantic alignment in colour lexicon enrichment across languages (Thompson et al. 2020) is credited to globalised cultural transmission through travel, media, trade and technological developments (Mylonas et al. 2022; Xu et al. 2013).
Limitations and future directions
Our study is limited by geography (restricted to European Russia), gender imbalance, and disciplinary heterogeneity among participants. Future work could (i) expand to other regions, including Asian Russia, (ii) balance gender samples, and (iii) systematically contrast Humanities vs. Science students. Finally, grammatical innovation in Russian CTs, especially the rise of non-canonical nominal forms and growing analyticity in colour vocabulary, merits dedicated investigation. Our raw data are available (Tables S1 and S2), and we invite interested linguists to further examine novel phenomena using our data.
Conclusions
Using elicited lists from Kazan and Smolensk, we mapped the Russian colour lexicon assessed by the terms’ frequency, cognitive salience, and conceptual associations. Our findings confirm 12 Russian BCTs (including two ‘blues’ sinij and goluboj), identify birûzovyj ‘turquoise’ as an emerging basic term, and show strong consensus around a shared core colour inventory of about 25 terms. Richness of hyponyms, polylexemic terms and phrasal units, and morphological productivity reflect Russian speakers’ awareness of colour nuances and the need for efficient communication of these, while the recent colouristic neologism boom highlights the dynamism of Russian colour naming.
The Russian case exemplifies how universal perceptual and cognitive pressures interact with culture-specific histories, producing both stability (BCTs, core colour inventory) and innovations (neologisms, phrasal units, adjective truncation). The ongoing lexical refinement of colour space in Russian aligns with cross-language trends, suggesting that globalisation fosters semantic convergence while leaving space for population-specific and local variation.
Data availability
The datasets generated and analysed during the current study are available in the Figshare repository: https://doi.org/10.6084/m9.figshare.27663297. The following supporting information can be downloaded: - Original individual lists (N = 112) elicited in Kazan (in Russian); - Original individual lists (N = 143) elicited in Smolensk (in Russian); - Preprocessed data elicited in Kazan (in Russian); - Preprocessed data elicited in Smolensk (in Russian); - Kazan matrix of adjacencies for cognitively salient CTs (N = 36) (in English transliteration); - Smolensk matrix of adjacencies for cognitively salient CTs (N = 36) (in English transliteration); - Scripts used for data processing and analysis.
References
Alldrick M (2025) Colour terms and bilingualism: an experimental study of Russian and Tatar. Open Linguist 11:20250050
Androulaki A, Gômez-Pestaña N, Mitsakis C, Lillo Jover J, Coventry K, Davies I (2006) Basic colour terms in modern Greek: twelve terms including two blues. J Greek Linguist 7:3–47
Baddeley R, Attewell D (2009) The relationship between language and the environment: Information theory shows why we have only three lightness terms. Psychol Sci 20(9):1100–1107. https://doi.org/10.1111/j.1467-9280.2009.02412.x
Bergh G (2007) The semiosis of Swedish car color names: descriptive and amplifying functions. In: MacLaury RE, Paramei GV, Dedrick D (eds) Anthropology of color: interdisciplinary multilevel modeling. John Benjamins, pp 337–345. https://doi.org/10.1075/z.137.22ber
Berlin B, Kay P (1969/1991) Basic color terms: their universality and evolution. University of California Press
Biggam CP (2012) The semantics of colour: a historical approach. Cambridge University Press
Bimler D, Uusküla M (2014) Analysing listing data with multidimensional scaling to reconstruct the cognitive organization of the colour lexicon in 11 languages. In: Rossi M (ed) Colour and colorimetry: multidisciplinary contributions, vol X B. Associazione Italiana Colore, pp 179–190
Bimler D, Uusküla M (2018) Individual variations in color-concept space replicate across languages. J Opt Soc Am A 35(4):B184–B191. https://doi.org/10.1364/JOSAA.35.00B184
Bimler D, Uusküla M (2021) Two indices are better than one: building on Robbins, Nolan, and Chen (2017). Field Methods 33(1):42–51. https://doi.org/10.1177/1525822X20945063
Bochkarev VV, Shevlyakova AV, Solovyev VD, Rakhilina EV, Paramei GV (2023) Linguistic mechanisms of colour term evolution: a diachronic investigation of “Russian browns” buryj and koričnevyj. Diachronica 40(4):492–531. https://doi.org/10.1075/dia.22013.boc
Borgatti SP (1998) Elicitation techniques for cultural domain analysis. In: Schensul JJ, LeCompte MD (eds) The ethnographer’s toolkit, vol 3. Altimira Press, pp 1–26
Borgatti SP (1999) Elicitation techniques for cultural domain analysis. In: Schensul JJ, LeCompte MD, Nastasi BK, Borgatti SP (eds) Enhanced ethnographic methods: audiovisual techniques, focused group interviews, and elicitation techniques. Altimira Press, pp 115–151
Boynton RM (1997) Insights gained from naming the OSA colors. In: Hardin CL, Maffi L (eds) Color categories in thought and language. Cambridge University Press, pp 135–150
Boynton RM, Olson CX (1987) Locating basic colors in the OSA space. Color Res Appl 12(2):94–105. https://doi.org/10.1002/col.5080120209
Brown AM, Isse A, Lindsey DT (2016) The color lexicon of the Somali language. J Vis 16(5):14. https://doi.org/10.1167/16.5.14
Čislennost’ postoânnogo naseleniâ Rossijskoj Federacii po municipal’nym obrazovaniâm na 1 ânvarâ 2023 goda (s učëtom itogov Vserossijskoj perepisi naseleniâ 2020 g.). Federal’naâ služba gosudarstvennoj statistiki (18 avgusta 2023) [The size of the permanent population of the Russian Federation by municipalities as of January 1, 2023 (taking into account the results of the All-Russian Population Census of 2020). Federal State Statistics Service (August 18, 2023)] (in Russian). https://rosstat.gov.ru/compendium/document/13282. Accessed 15 Mar 2023)
Casson RW (1994) Russett, rose, and raspberry: the development of English secondary color terms. J Linguist Anthropol 4(1):5–22. https://doi.org/10.1525/jlin.1994.4.1.5
Conway BR, Ratnasingam S, Jara-Ettinger J, Futrell R, Gibson E (2020) Communication efficiency of color naming across languages provides a new framework for the evolution of color terms. Cognition 195:104086. https://doi.org/10.1016/j.cognition.2019.104086
Corbett GG, Davies IRL (1995) Linguistic and behavioural measure for ranking basic colour terms. Stud Lang 19(2):301–357. https://doi.org/10.1075/sl.19.2.02cor
Corbett GG, Davies IRL (1997) Establishing basic color terms: measures and techniques. In: Hardin CL, Maffi L (eds) Color categories in thought and language. Cambridge University Press, pp 197–223
Corbett GG, Morgan G (1988) Colour terms in Russian: reflections of typological constraints in a single language. J Linguist 24(1):31–64. https://doi.org/10.1017/S0022226700011555
Davidoff J (2015) Color categorization across cultures. In: Elliot AJ, Fairchild MD, Franklin A (eds) Handbook of color psychology. Cambridge University Press, pp 259–278
Davies I, Corbett GG (1994) The basic color terms of Russian. Linguistics 32(1):65–89. https://doi.org/10.1515/ling.1994.32.1.65
Del Viva MM, Castelotti S, Paramei GV (2023) The Italian colour lexicon in Tuscany: Elicited lists, cognitive salience, and semantic maps of colour terms. Humanit Soc Sci Commun 10:900. https://doi.org/10.1057/s41599-023-02393-4
Doherty G (2017) Paradoxes of green. Cambridge University Press, pp 63–65, 72
Eessalu M, Uusküla M (2013) The special case of beige: a cross-linguistic study. In: Rossi R (ed) Colour and colorimetry: multidisciplinary contributions, vol IX B. Maggioli Editore, pp 168–176
Epicoco D, Mohr C, Uusküla M, Quiblier M, Meziane MB, Laurent E, Spagnulo GFM, Jonauskaite D (2024) The PURPLE mystery: semantic meaning of three purple terms in French speakers from Algeria, France, and Switzerland. Color Res Appl 49(1):93–112. https://doi.org/10.1002/col.22908
Ferrer I, Cancho R, Solé RV (2003) Least effort and the origins of scaling in human language. Proc Natl Acad Sci USA 100(3):788–791. https://doi.org/10.1073/pnas.0335980100
Friendly ML (1977) In search of the M-Gram: the structure of organization in free recall. Cogn Psychol 9(2):188–249. https://doi.org/10.1016/0010-0285(77)90008-1
Frumkina RM (1984) Cvet, smysl, sxodstvo. Aspekty psixolingvističeskogo analiza [Colour, meaning, and similarity. Aspects of a psycholinguistic analysis]. Nauka (in Russian)
Frumkina RM, Mikhejev AV (1996) Meaning and categorization. Nova Science
Gibson E, Futrell R, Jara-Ettinger J, Mahowald K, Bergen L, Ratnasingam S, Gibson M, Piantadosi ST, Conway BR (2017) Color naming across languages reflects color use. Proc Natl Acad Sci USA 114(40):10785–10790. https://doi.org/10.1073/pnas.1619666114
Griber YA, Mylonas D, Paramei GV (2018) Objects as culture-specific referents of color terms in Russian. Color Res Appl 43(6):958–975. https://doi.org/10.1002/col.22280
Griber YА, Mylonas D (2015) Kartografija cveta: èmpiričeskij analiz cvetonaimenovanij russkogo jazyka [Colour mapping: an empirical analysis of Russian colour names]. Čelov Kul’t (in Russian). http://www.e-notabene.ru/ca/article_16636.html
Griber YA, Mylonas D, Paramei GV (2021) Intergenerational differences in Russian color naming in the globalized era: linguistic analysis. Humanit Soc Sci Commun 8:262. https://doi.org/10.1057/s41599-021-00943-2
Heard AS (2011) Leksičeskij atlas russkix narodnyx govorov [Lexical atlas of Russian dialects]. Nestor-Istoria (in Russian)
Jakovljev I, Zdravković S (2018) The colour lexicon of the Serbian language – a study of dark blue and dark red colour categories. Part 1: Colour-term elicitation task. Psihologija 51(2):197–213. https://doi.org/10.2298/PSI160521002J
Jameson KA, Komarova NL (2009) Evolutionary models of color categorization. II. Realistic observer models and population heterogeneity. J Opt Soc Am A 26(6):1424–1436. https://doi.org/10.1364/JOSAA.26.001424
Jones WJ (2013) German colour terms: a study in their historical evolution from earliest times to the present. John Benjamins. https://doi.org/10.1075/sihols.119
Josserand M, Meeussen E, Majid A, Dediu D (2021) Environment and culture shape both the colour lexicon and the genetics of colour perception. Sci Rep 11:19095. https://doi.org/10.1038/s41598-021-98550-3
Jraissati Y, Wakui E, Decock L, Douven I (2012) Constraints on colour category formation. Int Stud Philos Sci 26(2):171–196. https://doi.org/10.1080/02698595.2012.703479
Kemp C, Xu Y, Regier T (2018) Semantic typology and efficient communication. Annu Rev Linguist 4:109–128. https://doi.org/10.1146/annurev-linguistics-011817-045406
Kharchenko VK (2023) Bogatstvo cveta v russkom jazyke [Richness of colour in the Russian language]. INFRA-M, Moscow (in Russian)
Komarova NL, Jameson KA (2008) Population heterogeneity and color stimulus heterogeneity in agent-based color categorization. J Theor Biol 253(4):680–700. https://doi.org/10.1016/j.jtbi.2008.03.030
Kravcova LP (2017) Nejming cveta v jazyke mody [Naming of colour in the language of fashion]. Kommun issled 2(12):69–86. (in Russian)
Krimer-Gaborović S, Jakovljev I (2022) [A study of gender-related differences in colour naming in standard Serbian language]. Zb Matice Srp filol linguist 65(1):47–75
Krylosova S (2013) Nouveautés et éternels retours: quelques observations sur l’évolution actuelle du vocabulaire de la mode en russe. Slovo 37(38):245–263. https://hal.science/hal-01461560
Krylosova SG, Tomachpolski VI (2013) Buryj vs. koričnevyj v slovare i tekste [Buryj vs. koričnevyj in a dictionary and text]. In: Ékologija perevoda: perspektivy meždisciplinarnyx issledovanij. Materialy meždunarodnoj naučno-praktičeskoj konferencii, 4-5.10.2013, Tumen’ State University, pp 226–235. Šuklin & Aleksandrov (in Russian)
Krylosova SG, Tomachpolski VI (2014) Sinkretičeskie kolorističeskie neologizmy v russkom jazyke [Syncretic coloristic neologisms in the Russian language]. Ross Olomucensia LIII:113–117. (in Russian)
Krylosova SG, Tomachpolski VI (2017) Èlegantnost’ cveta ajvori: kolorističeskie neologizmy v sovremennom russkom jazyke [Elegance of the ivory colour: coloristic neologisms in the contemporary Russian language]. In: Materialy VI Meždunarodnoj naučno-praktičeskoj internet-konferencii, Ekaterinburg. Ural Federal University, pp 109–114 (in Russian)
Kul’pina VG (2001) Lingvistika cveta. Terminy cveta v pol’skom i russkom jazykax [Linguistics of colour. Colour terms in Polish and Russian]. Russkie slovari (in Russian)
Kul’pina VG (2007) Sistema cvetooboznačenij russkogo jazyka [The system of Russian colour terms]. In: Vasilevich AP (ed) Naimenovanija cveta v indoevropejskix jazykax: Sistemnyj i istoričeskij analiz [Colour terms in Indo-European languages: a systematic and historical analysis]. KomKniga, pp 126–184 (in Russian)
Kul’pina VG (2019) Lingvističeskaja cvetologija: ot istorii k sovremennosti cvetovyx konceptosfer [Linguistic colour theory: from history to modernity of coloristic conceptual spheres]. MAKS Press (in Russian)
Kuriki I, Lange R, Muto Y, Brown AM, Fukuda K, Tokunaga R, Lindsey DT, Uchikawa K, Shioiri S (2017) The modern Japanese color lexicon. J Vis 17(3):1. https://doi.org/10.1167/17.3.1
Laws G, Davies I, Andrews C (1995) Linguistic structure and non-linguistic cognition: English and Russian blues compared. Lang Cogn Process 10(1):59–94. https://doi.org/10.1080/01690969508407088
Lillo J, Prado-León L, González Perilli F, Melnikova A, Álvaro L, Collado J, Moreira H (2018) Spanish basic colour categories are 11 or 12 depending on the dialect. In: MacDonald LW, Biggam CP, Paramei GV (eds) Progress in colour studies: cognition, language and beyond. John Benjamins, pp 59–82
Linders GM, Louwerse MM (2023) Zipf’s law revisited: Spoken dialog, linguistic units, parameters, and the principle of least effort. Psychon Bull Rev 30:77–101. https://doi.org/10.3758/s13423-022-02142-9
Lindsey DT, Brown AM (2014) The color lexicon of American English. J Vis 14(2):17. https://doi.org/10.1167/14.2.17
Lindsey DT, Brown AM (2021) Lexical color categories. Annu Rev Vis Sci 7:605–631. https://doi.org/10.1146/annurev-vision-093019-112420
Lunkova ES (2020) K voprosu o vzaimodeistvii smolenskix govorov i belorusskogo yazyka [On the problem of an interaction of the Smolensk dialect and the Belarusian language]. Neophilology 6(24):669–685. https://doi.org/10.20310/2587-6953-2020-6-24-669-685 (in Russian)
Majid A, Roberts G, Cilissen L, Levinson SC (2018) Differential coding of perception in the world’s languages. Proc Natl Acad Sci USA 115(45):11369–11376. https://doi.org/10.1073/pnas.1720419115
Marinova EV (2019) “Nebo cveta stal'”: sostavnye naimenovanija cveta v grammatičeskom aspekte [“The sky of colour steel”: grammar aspects of composite colour terms]. Soc’nye transformacii 30:129–134
Marinova EV (2023a) Koloronimy kak fakty slovarja i reči [Coloronyms as the facts of a dictionary and the speech]. In: Kozlovskaja NV (ed) Neologija. Neografija. Institut lingvističeskix issledovanij RAN, pp 127–136 (in Russian)
Marinova EV (2023b) Plat’e cveta moloko: ob odnom ustojčivom tipe grammatičeskogo varjirovanija [A dress of colour milk: about a certain stable type of grammatical variation]. Russ Lang Sch 84(6):92–99. https://doi.org/10.30515/0131-6141-2023-84-6-92-99
Marjieh R, Sucholutsky I, van Rijn P, Jacoby N, Griffiths TL (2024) Large language models predict human sensory judgments across six modalities. Sci Rep 14:21445. https://doi.org/10.1038/s41598-024-72071-1
Martinovic J, Paramei GV, MacInnes WJ (2020) Russian blues reveal the limits of language infl uencing colour discrimination. Cognition 201:104281. https://doi.org/10.1016/j.cognition.2020.104281
Masini F, Benigni V (2012) Phrasal lexemes and shortening strategies in Russian: the case for constructions. Morphology 22:417–451. https://doi.org/10.1007/s11525-011-9200-y
Matschi M (2004) Color terms in English: onomasiological and semasiological aspects. Onomasiol Online 5:56–139. https://www1.ku.de/SLF/EngluVglSW/matschi1041.pdf
Mitchell WJT (2002) Showing seeing: a critique of visual culture. J Vis Cult 1(2):165–181
Mitzenmacher M (2004) A brief history of generative models for power law and lognormal distributions. Internet Math 1(2):226–251. https://doi.org/10.1080/15427951.2004.10129088
Moreira H, Alvaro L, Lillo J (2024) Color blindness and semantic knowledge: Cognition of color terms from elicited lists in dichromats and normal observers. Color Res Appl 49(5):420–432. https://doi.org/10.1002/col.22925
Morgan G (1993) Basic colour terms: comparative results for French and Russian. J Fr Lang Stud 3(1):1–17. https://doi.org/10.1017/S0959269500000326
Morgan G, Corbett GG (1989) Russian colour term salience. Russ Linguist 13(2):125–141. https://doi.org/10.1007/BF02551669
Muggeo VMR (2003) Estimating regression models with unknown break-points. Stat Med 22(19):3055–3071. https://doi.org/10.1002/sim.1545
Mylonas D, Caparos S, Davidoff J (2022) Augmenting a colour lexicon. Humanit Soc Sci Commun 9:29. https://doi.org/10.1057/s41599-022-01045-3
Mylonas D, MacDonald L (2016) Augmenting basic colour terms in English. Color Res Appl 41(1):32–42. https://doi.org/10.1002/col.21944
Nasibullina FF (2010) Kolorativnaja leksika i ee funkcionirovanie v reklamax inter’era (na russkom i nemeckom jazykax) [Colorative lexicon and its functioning in interior advertisements (in Russian and German)]. PhD thesis, TGGPU, Kazan (in Russian)
Ohnheiser I (2019) Compounds and multi-word expressions in Russian. In: Schlücker B (ed) Complex lexical units: compounds and multi-word expressions. De Gruyter, pp 251–278
Online Transliterator (n/d). https://translit.cc
Paramei GV (2005) Singing the Russian blues: an argument for culturally basic color terms. Cross-Cult Res 39(1):10–38. https://doi.org/10.1177/1069397104267888
Paramei GV (2007) Russian ‘blues’: controversies of basicness. In: MacLaury RE, Paramei GV, Dedrick D (eds) Anthropology of color: interdisciplinary multilevel modeling. John Benjamins, pp 75–106. https://doi.org/10.1075/z.137.07par
Paramei GV (2020) Color categorization: patterns and mechanisms of evolution. In: Shamey R (ed) Encyclopedia of color science and technology, 2nd edn. Springer. https://doi.org/10.1007/978-3-642-27851-8
Paramei GV, Bimler D (2022) Language and psychology. In: Steinvall A, Street S (eds) A cultural history of colour, Vol. 6, The Modern Age: from 1920 to present. Bloomsbury, pp 117–134
Paramei GV, Griber YA, Mylonas D (2018) An online color naming experiment in Russian using Munsell color samples. Color Res Appl 43(3):358–374. https://doi.org/10.1002/col.22190
Quinlan MB (2019) The freelisting method. In: Liamputtong P (ed) Handbook of research methods in health social sciences. Springer, pp 1431–1446. https://doi.org/10.1007/978-981-10-5251-4_12
Rakhilina EV (2007) Linguistic construal of colors: the case of Russian. In: MacLaury RE, Paramei GV, Dedrick D (eds) Anthropology of color: interdisciplinary multilevel modeling. John Benjamins, pp 363–377. https://doi.org/10.1075/z.137.24rak
Rakhilina EV (2008) Kognitivnyj analiz predmetnyx imen: semantika i sočetaemost’ [Cognitive analysis of object names: demantics and combinability], 2nd edn. Russkie slovari (in Russian)
Rakhilina EV, Paramei GV (2011) Colour terms: evolution via expansion of taxonomic constraints. In: Biggam CP, Hough CA, Kay CJ, Simmons DR (eds) New directions in colour studies. John Benjamins, pp 121–131. https://doi.org/10.1075/z.167.15rak
Regier T, Carstensen A, Kemp C (2016) Languages support efficient communication about the environment: words for snow revisited. PLoS ONE 11(4):e0151138. https://doi.org/10.1371/journal.pone.0151138
Ronga I, Bazzanella C, Strudsholm E, Salvati L (2014) Black as night or as a chimney sweep? Color words and typical exemplars. Intercult Pragmat 11(4):485–520. https://doi.org/10.1515/ip-2014-0022
Rubio-Fernandez P (2021) Color discriminability makes over-specification efficient: theoretical analysis and empirical evidence. Humanit Soc Sci Commun 8:147. https://doi.org/10.1057/s41599-021-00818-6
Safonova SS, Chupryakova O (2014) Substantivnye ekspressivy v âzykovom prostranstve russkix govorov Volgo-Kamskogo regiona respubliki Tatarstan [Substantive expressive phrases in the sprachraum of Russian dialects in the Volga-Kama region of the Republic of Tatarstan]. Philol Cult 3(37):19–26. (in Russian)
Safuanova OV, Korzh NN (2007) Russian color names. Mapping into a perceptual color space. In: MacLaury RE, Paramei GV, Dedrick D (eds) Anthropology of color: interdisciplinary multilevel modeling. John Benjamins, pp 55–74. https://benjamins.com/catalog/z.137.06saf
Saunders BAC, van Brakel J (1997) Are there nontrivial constraints on colour categorization? Behav Brain Sci 20(2):167–228. https://doi.org/10.1017/S0140525X97531426
Skorinko JL, Kemmer S, Hebl MR, Lane DM (2006) A rose by any other name…: color-naming influences on decision making. Psychol Mark 23(12):975–993. https://doi.org/10.1002/mar.20142
Sutrop U (2001) List task and a cognitive salience index. Field Methods 13(3):263–276. https://doi.org/10.1177/1525822X0101300303
Thompson B, Roberts SG, Lupyan G (2020) Cultural influences on word meanings revealed through large-scale semantic alignment. Nat Hum Behav 4:1029–1038. https://doi.org/10.1038/s41562-020-0924-8
Tribushinina E (2008) Cognitive reference points. Semantics beyond the prototypes in adjectives of space and colour. PhD thesis, University of Leiden, pp 92–99. https://scholarlypublications.universiteitleiden.nl/handle/1887/13224
Uusküla M, Bimler D (2016) From listing data to semantic maps: cross-linguistic commonalities in cognitive representation of colour. Folklore 64:57–90. http://folklore.ee/Folklore/vol64/colour.pdf
Uusküla M, Sutrop U (2007) Preliminary study of basic colour terms in Modern Hungarian. Linguist Ural 43(2):102–123
Vamling K (1986) A note on Russian blues. Slav Lundensia 10:225–233. https://www.academia.edu/999698/A_note_on_Russian_blues
van der Klis M, Tellings J (2022) Generating semantic maps through multidimensional scaling: linguistic applications and theory. Corpus Linguist Linguist Theory 18(3):627–665. https://doi.org/10.1515/cllt-2021-0018
Vasilevich AP (1983) Psixolingvističeskij podxod k ustanovleniju leksičeskix sootvetstvij (na materiale bolgarskix, russkix i anglijskix cvetonaimenovanij) [A psycholinguistic approach to estimation of lexical equivalents (on material of Bulgarian, Russian, and English colour terms)]. Sapostav Jezykosnanie (Sofia) 8:5–17. (in Russian)
Vasilevich AP (1988) Cvetonaimenovanija i problema perevoda teksta [Colour terms and the problem of text translation]. In: Komissarov VN, Chernjakhovskaja LA, Latyshev LK (eds) Text i perevod [Text and translation]. Nauka, pp 84–96 (in Russian)
Vasilevich AP (2003) Jazykovaja kartina mira cveta. Metody issledovanija i prikladnye aspekty [A language view of the colour world. Methods of investigation and applied aspects]. Post-doctorate dissertation. Institute of Language Studies (in Russian)
Vasilevich AP, Kuznecova SN, Mishchenko SS (2016) Cvet i nazvanija cveta v russkom jazyke [Colour and colour terms in the Russian language]. LCI Publisher (in Russian)
Vasilevich AP, Mishchenko SS, Kuznecova SN (2000) Cvet i ego nazvanie. Razvitie leksiki cvetooboznačenij v sovremennoj Rossii [Colour and its name. Evolution of the colour term lexicon in modern Russia]. Vestn RFFI 1:56–61. (in Russian)
Vendina ТI (2014) Russkoe dialektnoe slovo vo vremeni i prostranstve [Russian dialect word in time and space]. Nestor-Istoria (in Russian)
Vendina TI (2016) Russkie dialekty v sovremennoj ètnoâzykovoj i ètnokul’turnoj situacii [Russian dialects in the contemporary ethno-linguistic and ethno-cultural situation]. Curr Probl Slav Linguist Lit Stud Folk Stud 11:425–449. (in Russian)
Weller SC, Romney AK (1988) Systematic data collection. Sage Publications
Xu J, Dowman M, Griffith TL (2013) Cultural transmission results in convergence towards colour term universals. Proc R Soc B 280(1758):20123073. https://doi.org/10.1098/rspb.2012.3073
Xu M, Zhu J, Benítez-Burraco A (2023) A comparison of basic color terms in Mandarin and Spanish. Color Res Appl 48(6):709–720. https://doi.org/10.1002/col.22863
Zaslavsky N, Garvin K, Kemp C, Tishby N, Regier T (2022) The evolution of color naming reflects pressure for efficiency: evidence from the recent past. J Lang Evol. https://doi.org/10.1093/jole/lzac001
Zaslavsky N, Kemp C, Tishby N, Regier T (2019) Color naming reflects both perceptual structure and communicative need. Top Cogn Sci 11(1):207–219. https://doi.org/10.1111/tops.12395
Zollinger H (1984) Why just turquoise? Remarks on the evolution of color terms. Psychol Res 46:403–409. https://doi.org/10.1007/BF00309072
Acknowledgements
The authors thank Venera R. Bayrasheva and Yulia A. Volskaya (Kazan), Grigorij Elkind and Alyona Nankevich (Smolensk) for their help in data collection. We are grateful to Angela M. Brown and Delwin T. Lindsey for their valuable advice on piecewise polynomials applied to Zipf-functions. We thank David L. Bimler for help with the HCA and embedding HCA outcomes into the semantic maps. The authors are indebted to Artemiy Kulemin for the design of the map of European Russia and colour-coding of dendrograms. We are especially grateful to the reviewers for their many helpful comments and constructive criticisms. These contributed substantially to a much-improved article. We thank Rita Cavonius for proofreading the revised manuscript. All participants are gratefully acknowledged for their time and goodwill. Results of the study were partly presented at the 17th International Karskiy Readings “The Slavic world and national speech culture in modern communication” (16th-17th May 2024, Grodno, Belarus) and at the 10th International Conference on Cognitive Science (26th–30th June 2024, Pyatigorsk, Russia). At Kazan Federal University, data collection for the project was supported by the Strategic Academic Leadership Program “PRIORITY-2030” (Strategic Project № 5).
Author information
Authors and Affiliations
Contributions
YAG: investigation; data collection in Smolensk; methodology; data curation; project administration; writing – original draft. AII: investigation; software; data curation; formal quantitative analysis; visualisation. VDS: investigation; data collection in Kazan; writing – original draft; funding acquisition. GVP: conceptualisation; methodology; data curation; research coordination; resources; supervision; writing – original draft; writing - review and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
The study has been approved by the Ethics Committees of Smolensk State University (20th April 2023, No. 22-18-00407-2) and Kazan Federal University (15th February 2024, No. 46); it was conducted according to the guidelines of the Declaration of Helsinki.
Informed consent
All participants have signed the informed consent prior to data collection.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Informed consent
All participants have signed the informed consent prior to data collection.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Griber, Y.A., Ivleva, A.I., Solovyev, V.D. et al. The Russian colour lexicon and its diatopic variation: elicited lists, cognitive salience of colour terms, and neologism boom. Humanit Soc Sci Commun 13, 681 (2026). https://doi.org/10.1057/s41599-026-07021-5
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1057/s41599-026-07021-5









![Fig. 7: Scatterplots of log-transformed normalised frequency [−log10(Fi/N)] and mean position [−log10(mPi)] for salient colour terms elicited in Kazan (NK-F = 86) (left) and Smolensk (NS-F = 110) (right).](http://media.springernature.com/lw685/springer-static/image/art%3A10.1057%2Fs41599-026-07021-5/MediaObjects/41599_2026_7021_Fig7_HTML.png)
