Introduction

The Late Cenozoic marks a period of significant climatic change on Earth, during which the global climate transitioned from a greenhouse to an icehouse state1,2. This shift led to the gradual transformation of mid-latitude forests into grasslands, fostering the development of more open ecosystems3. In this context, the evolution and adaptive changes of mammals became a key scientific question. The uplift of the Tibetan Plateau is considered one of the major drivers of Late Cenozoic climate change, affecting global atmospheric circulation, forming the East Asian monsoon, and causing Central Asia’s aridification4,5,6. This geological event not only shaped the region’s climate but also significantly influenced mammalian evolution7,8.

The response of the mammals to the regional climatic change is an interesting question. Especially in the transition of the Mid-Miocene Climatic Optimum (MMCO) and the Mid-Miocene Climatic Transition (MMCT), which is a key period for the Late Cenozoic climatic change. The MMCO (17–14.5 Ma) was characterized by globally warm and humid conditions, leading to the expansion of forested environments and a diversification of mammalian taxa9,10. In contrast, the MMCT (14.5–13 Ma) marked a shift toward cooler and more arid conditions, driven by declining atmospheric CO₂ levels and Antarctic ice sheet expansion, leading to significant ecological restructuring1,11. Previously, the mammalian adaptation to this change has been studied in Europe and North America12,13,14,15,16. However, a comprehensive study of its impact on Late Cenozoic faunal changes in Asia, especially backdropped by the elevation of the Tibetan Plateau, is still lacking17.

The Junggar Basin and the Tongxin region in China, covering MMCO and MMCT, offer an ideal model for addressing these questions. These regions feature rich fossil records, well-preserved stratigraphy, and reliable paleomagnetic dating18. Recent studies on the local changes of vegetation and geochemical records have provided a strong geological and biological basis for studying Late Cenozoic climate change19,20. Wang et al.20 conducted a detailed study on the vegetation of the Junggar Basin, during the MMCT and found that, this region was dominated by xerophytic shrubs and Chenopodiaceae (> 50%), with an increased presence of cold-tolerant coniferous trees such as Pinus and Picea, indicating a transition to a cooler and more arid climate.

The primary aim of this study is to investigate paleoenvironmental changes and mammalian adaptations in northern China and Central Asia during the MMCO and MMCT. To achieve this, we analyze stable carbon and oxygen isotope data from mammals across various faunas, comparing isotope values from China with those from Europe and North America during the same periods. Additionally, we apply tooth hypsodonty and cenogram methods to explore the diversity of herbivorous mammals and examine the differences in feeding strategies and ecological adaptations among different taxa.

Geological settings

Geological information of Junggar basin and Tongxin region

The northern Junggar Basin is located in the Xinjiang Uygur Autonomous Regions, at approximately 46.5° latitude and 88.5° longitude, with an elevation of about 500 m. Situated in the non-monsoon zone of Central Asia, this region receives an average annual precipitation of 173.7 mm (Source: http://data.cma.gov.cn/). During the Cenozoic era, the continuous subduction of the Indian plate beneath the Eurasian continent led to the intense uplift of the Tibetan Plateau, significantly impacting northern regions. This process resulted in the formation of the Altai Mountains and the subsidence of the Junggar-Hami block, creating continuous sedimentary strata in the Junggar Basin18,21.

The Neogene strata in the northern Junggar Basin are well-exposed, with clear stratification and abundant fossils, making the region an ideal site for studying mammalian evolution and climate change. The Cenozoic stratigraphy of the northern Junggar Basin can be divided into six lithostratigraphic units, listed from bottom to top: Ulungur River Formation (Eocene to early Oligocene), Tieersihabahe Formation (late Oligocene), Sosoquan Formation (23.3–16.9 Ma), Halamagai Formation (16.9–15.5 Ma), Kekemaideng Formation (15.5–13.1 Ma), and Dingshanyanchi Formation (13.1–6 Ma)18,22,23. Each unit has yielded significant mammalian fossils. The Halamagai Formation, characterized by its green color, indicates a reducing sedimentary environment and contains the most abundant mammalian fossils, reflecting a relatively warm and humid environment during the MMCO21. In contrast, the Kekemaideng Formation, with its rusty yellow color and lower fossil diversity, suggests a relatively dry and cold environment, possibly corresponding to the MMCT.

The Tongxin, located in the Ningxia Hui Autonomous Regions at approximately 37° latitude and 106° longitude, has an elevation of about 1500 m. This region lies at the margin of the East Asian monsoon, with an average annual precipitation of 478 mm (Source: http://data.cma.gov.cn/). The Cenozoic subduction of the Indian plate beneath the Eurasian continent also triggered the Qilian-Liupan orogenesis, contributing to continuous sedimentation in the region24,25. The Cenozoic stratigraphy of the Tongxin region is divided into the following units: Sikouzi Formation (Paleocene to Eocene), Qingshuiying Formation (Oligocene to earliest Miocene), Zhangenbao Formation (18–13 Ma), and Ganhegou Formation (late Miocene)25. The Early to Middle Miocene Zhangenbao Formation consists of five stable cycles of sandstone-mudstone alternations, with abundant mammalian fossils discovered in each layer. According to Wang et al.19,20the lower A–C layers of the Zhangenbao Formation, dated to approximately 18.0–15.0 Ma, corresponding to the MMCO. The upper D and E layers, date to approximately 14.5–13.3 Ma, coincide with the MMCT.

Faunas examined and their age and geographic information

In this article, we present a broad comparison of mammalian faunas from mid-latitude regions of Eurasia and North America during the Miocene (Fig. 1). The new data primarily come from the faunas of the Junggar Basin and Tongxin region. Four faunas from these regions were carefully studied:

Fig. 1
figure 1

Geographic locations of different fossil sites in the Northern Mid-latitudes. Red circles represent early and middle Miocene study regions, while orange circles indicate late Miocene regions, as mentioned in the text (Table 1). The green dotted line marks the boundary between monsoon/non-monsoon regions. This geographic map was generated using GeoMapApp (V3.6.15) (https://www.geomapapp.org/).

Table 1 Geographical information of all faunas compared in this study.

Northern Junggar Basin:

1. Halamagai (mid-age, 16.2 Ma, MMCO); 2. Kekemaideng (14.3 Ma, MMCT).

Tongxin region:

3. lower Tongxin (16.5 Ma, MMCO); 4. upper Tongxin (13.9 Ma, MMCT).

The members of these faunas (including carnivores, herbivores, and small mammals) are listed in Data 2. The age of these faunas was determined using paleomagnetic dating18,22.

For comparison, we also examine the following faunas from previous studies:

  1. 1.

    Tunggur fauna, China, approximately N43.5° E113°, representing age, 12.6 Ma (13.6–11.6 Ma, MMCT)26.

  2. 2.

    Tuosu fauna (Qaidam Basin), China, approximately N37° E97°, representing age, 10.7 Ma (11.2–10.2 Ma, Late Miocene)24.

  3. 3.

    Lantian Fauna, China, approximately N34° E109.5°, representing age, 9 Ma (10.8–7.2 Ma, Late Miocene)27.

  4. 4.

    Baode Fauna, China, approximately N39° E111°, representing age, 6.8 Ma (7.2–6.4 Ma, Late MiTocene)28.

  5. 5.

    Sandelzhausen (North Alpine foreland basin), Germany, approximately N48.5° E12°, representing age, 16.3 Ma (16.5–16 Ma, MMCO)13.

  6. 6.

    Steinheim (North Alpine foreland basin), Germany, N48° E9°, approximately 14.3 Ma13.

  7. 7.

    Sansan, France, N43.5° E0.5°, approximately 15 Ma (MMCO/MMCT) (NOW database29).

  8. 8.

    Madrid Basin (“lower Madrid”), Spain, including sites such as Somosaguas, Arroyo del Olivar, Barajas 3 and 7, Paseo de las Acacias, Estación Imperial, PAR Peñuelas, San Isidro, Puente de Toledo, La Hidroeléctrica, and Casa de Campo, approximately N40° W4°, representing age, 15.2 Ma (16–14.5 MMCO)15.

  9. 9.

    Madrid Basin (“upper Madrid”), Spain, including sites such as Paracuellos 3 and 5, Alhambra-Túneles, Henares 1, and Puente de Vallecas, approximately N40° W4°, representing age, 13.8 Ma (14.5–13 MMCT)15.

  10. 10.

    Mascall Formation (“lower Oregon”), Columbia Plateau, Oregon, USA, approximately N44° W118°, representing age, 15.1 Ma (MMCO)30.

  11. 11.

    Sucker Creek, Quartz Basin, and Red Basin (“upper Oregon”), Oregon, USA, approximately N44° W118°, representing age, 13.6 Ma (MMCT) (14.8–12.5)30.

  12. 12.

    Myers Farm, Valentine Quarry, Norden Bridge, and Hottell Immense, Nebraska, Great Plains, USA, approximately N41° W100°, representing age, 14.2 and 13.1 Ma (MMCT)31.

These faunas are all located in the mid-latitude regions of Eurasia and North America, and primarily span the MMCO and MMCT (some extending into the late Miocene). This allows for a comparison of paleoenvironment and mammalian paleoecology in Central and East Asia with those in Europe and North America during major climate transitions in the Neogene. In Table 1, we provide the geographical information for all faunas included in this comparison.

Methods

Stable carbon and oxygen isotopes analysis of tooth enamel

We collected 187 enamel samples from fossil teeth of various herbivorous mammals in the Junggar Basin and Tongxin region for stable carbon and oxygen isotope analysis. All tooth specimens from the early and middle Miocene of northern China were housed at the Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing, China. Stable isotope analyses were conducted at the National High Magnetic Field Laboratory, Florida State University, USA.

For comparison, we analyzed previously published δ13C values from other mid-latitude regions (Table 1), including the Linxia Basin and Tunggur region in China, as well as data from Europe and North American, such as Sandelzhausen and Steinheim (Northern Alpine Foreland Basin, Germany), the Madrid Basin (Spain), the Columbia Plateau (Oregon, USA.), and the Great Plains (Nebraska, USA.)12,13,14,15,32,33,34,35. In timeline, we also extend the comparison to late Miocene localities in China, including the Qaidam Basin, Lantian, and Baode regions. For the detailed data sources, please see Data 1.

To extract enamel samples, small patches were cut from the tooth or drilled along the entire length using a rotary drill. The samples were ground into fine powder (2–3 mg). These were then pre-treated with 5% sodium hypochlorite (NaOCl) overnight to remove organic contaminants and cleaned with distilled water. The samples were further treated with 1 M acetic acid overnight to remove non-structural carbonates and cleaned again with distilled water. The samples were then freeze-dried32,36. The dried enamel samples were reacted with 100% phosphoric acid (H3PO4) at 25 °C for approximately 72 hours36,37.

Carbon and oxygen isotope data were measured using a Finnigan MAT Delta Plus XP stable isotope ratio mass spectrometer coupled with a Thermo Scientific GasBench II at Florida State University. The lab standards used included MERK, MB-CC, ROY-CC, and PDA. Results are reported in the standard delta (δ) notation as δ13Cenamel and δ18Oenamel values relative to the international carbonate standard VPDB (Vienna Pee Dee Belemnite). The diet δ13C values of proboscideans using an enrichment factor (ε*) of 13‰ for non-ruminants like proboscideans, and 14.1‰ for ruminants36,38. However, Tejada-Lara et al.39 demonstrated a strong and significant correlation between δ13C enrichment and body mass across herbivorous mammals. For example, ε*diet−bioapatite values vary from 12.89‰ in pigs (Sus scrofa) and 14.65‰ in cattle (Bos taurus) among Artiodactyla, to 13.2‰ in zebra (Equus burchelli) and 14.4‰ in black rhinoceros (Diceros bicornis) among Perissodactyla. Smaller-bodied species such as the house mouse (Mus musculus) show lower values (9.1‰), with species like the giant panda (Ailuropoda melanoleuca), koala (Phascolarctos cinereus), and three-toed sloth (Folivora: Xenarthra) clustering around 10–10.5‰. According to Tipple et al.40 estimates, the CO2 levels in the middle Miocene atmosphere was approximately 2–1.5‰ higher than those of today. When comparing the mean plant δ13C values from modern C3 biomes with those from the middle Miocene in Junggar and Tongxin, several factors were considered: Latitude. C3 vegetation receiving equivalent light and water has δ13C values that vary with latitude by ~ 0.3‰/10°41. Altitude. Carbon isotope values in C3 vegetation receiving equivalent light and water vary with altitude by ~ 0.65‰/1000 m41.

To assess potential diagenetic alteration, Fourier Transform Infrared Spectroscopy (FTIR) analysis was conducted on a subset of pre-treated samples. The aim was to determine the crystallinity of hydroxyapatite in the enamel, which helps evaluate diagenetic alteration and its impact on the isotopic composition. FTIR analysis was performed using a NICOLET 6700 infrared spectrometer, with the crystallinity of minerals in 20 samples analyzed under the following parameters: resolution of 4 cm⁻¹, 32 scans, spectral range of 4000–400 cm⁻¹. The analysis revealed that the primary component of the samples is apatite, and for most samples, the Phosphate Crystallinity Index (PCI)—which indicates that higher PCI values correspond to higher mineral crystallinity—fell within the range typical of modern dental enamel apatite (3.5–3.8). The Crystallinity Index (CI) measures the degree of the PO₄³⁻ band splitting and is defined by CI = (A₆₀₅ + A₅₆₅)/A₅₉₀, where Aₓ is the absorbance at wave number x42,43. The crystallinity of enamel apatite, along with the carbonate content, showed no significant changes in the samples. Therefore, it can be concluded that most of the mammalian enamel samples have retained their original isotopic information.

To assess regional differences in the carbon enamel isotopic composition of mammals, we conducted two-tailed t-tests on δ¹³Cenamel values between each pair of regions. Statistical significance was determined at the 95% confidence level (p < 0.05). All statistical analyses were performed using PAST software, and the detailed results, including mean differences, t-values, p-values, and significance levels, are provided in the Table 2.

Table 2 Statistical significance of differences in the mean carbon isotopic compositions of mammals between different regions as determined by two-tailed t-tests.

Cheek tooth hypsodonty

Analyzing hypsodonty (molar tooth crown height) in large herbivorous mammals from a specific region allows inferences about environmental openness and climatic conditions (e.g., wet or arid conditions) in that area7. Increased hypsodonty is an adaptive response to the growing need for wear resistance and functional durability. This adaptation arises from the prevalence of fibrous or abrasive vegetation in environments that are progressively more open and arid-adapted44. We examined and classified the hypsodonty of herbivorous taxa from several regions, including Halamagai, Kekemaideng, lower Tongxin, upper Tongxin, Tunggur, Tuosu, Lantian, Baode, Sandelzhausen, Steinheim, Sansan, “lower Madrid” and “upper Madrid”, and compared these data with previously published data from North America Great Plain45. Our data were primarily sourced from the NOW Database.

Taxa were categorized into three classes: brachydont (hypsodonty < 0.8), mesodont (between 0.8 and 1.2), and hypsodont (> 1.2). To determine the mean hypsodonty value for each locality, we calculated the average score across its herbivorous species. For genera not found in the database, we measured the crown height (H) and length (L) of the second upper or lower tooth (M2 or m2), and calculated the ratio of H/L (using any typical individual or the population mean. Stacked area graphs of faunal hypsodonty from China, Europe, and North America were plotted across the age sequence. For data sources, please see Data 2.

Cenogram analysis

Cenograms are a useful method for exploring the relationship between the faunal body mass distribution and environmental conditions46. We produced cenograms of the following faunas: Halamagai, Kekemaideng, lower Tongxin, upper Tongxin, Tunggur, Tuosu, Lantian, Baode, Sandelzhausen, Steinheim, Sansan, Sangmelina (Cameroon) and Lokori (Kenya). The latter two serve as modern savanna representatives46. Cenogram data for Sandelzhausen, Steinheim, and Sansan were taken directly from Costeur et al.47. For the remaining faunas, body mass data for each herbivore and small mammal was sourced from previous publications (primarily after NOW database) or estimated using linear regression methods.

For large herbivores, we use the regression equation from Janis45 as follows:

log10 (mass in kg) = intercept + slope × log10 (X in cm).

where X represents the m2 or M2 tooth length (using either a typical individual or the mean of a population), and the intercept and slope values were provided by Janis (1990).

For small mammals, we use the regression equation from Legendre48 as follows:

ln (mass in g) = intercept + slope × ln (X in mm2).

where X represents the length × width of m1 (using either a typical individual or the mean of a population), and the intercept and slope values were provided by Legendre48.

Detailed calculation for each taxon is listed in Data 2 and Data 3. To minimize the influence of different sample sizes across the faunas, we replaced the rank order on the x-axis with the rank percentile.

Results and discussion

δ13C and δ18O of tooth enamel of various mammals from Junggar Basin and Tongxin region, northern China

The mean δ13Cenamel value (± standard deviation [SD]) for all Junggar Basin mammals, irrespective of taxon or level, is −9.1‰ ± 1.3‰, ranging from − 11.5‰ to −4.9‰ for 97 samples (n = 97) (Fig. 2). Within this fauna, giraffoid with high tooth crowns, possibly belonging to Climacocerotidae (cf. Vittoria sp.), has the highest mean δ13C value at −6.7‰ ± 0.2‰, although it is represented by only 2 samples. Proboscidea also have a relatively high mean δ13C value at −8.9‰ ± 1.3‰ (n = 33), as well as the widest range in values, from − 11.5‰ to −4.9‰. Among Proboscidea, Choerolophodon having the lowest and Platybelodon having the highest mean δ13C values. Differences in δ13C values among species likely reflect variations in their habitats and feeding habits. In Junggar Basin, the majority of the fossil mammals exhibited δ13C values consistent with C3 feeding, this suggests an open habitat primarily dominated by C3 plants. Examining proboscidean material in the Junggar Basin, the δ13C values exhibit a notable variation, particularly Platybelodon with the highest δ13C values, which suggests variable diets, ranging from a pure C3 diet to diets consisting of either mixed C3-C4 vegetation or primary C3 plants under water-stress conditions, and thus an expansion into more open habitats23. The taxon displaying the lowest mean δ13C value is Giraffidae, which has a mean δ13C value of −10.4‰ ± 0.3‰ (n = 2). Cervidae have the second lowest mean δ13C value at −10.1‰ ± 0.4‰, ranging from − 10.5‰ to −9.3‰ (n = 6). These values suggest that both Giraffidae and Cervidae may have had an affinity for forested habitats and a preference for pure C3 diets.

Fig. 2
figure 2

Plot of δ13Cenamel values vs. δ18Oenamel values of various mammals from Junggar Basin and Tongxin Region. Each point represents the δ13C and δ18O values based on bulk samples from one individual tooth.

The mean δ18Oenamel value of all Junggar Basin mammals, irrespective of taxon or level, is −10.0‰ ± 2.4‰, with a range from − 15.8‰ to −4.2‰. The highest mean δ18O value among the different taxa occurs in the Moschidae, which has a mean value of −7.6‰ ± 2.2‰, This implies that Moschidae primarily sourced their body water from leaf water or significantly evaporated lake/pond water. The proboscidean taxon has the most negative mean δ18O value at −11.2‰ ± 1.2‰, with a range in values from − 12.4‰ to −7.5‰ (n = 33), suggesting reliance on relatively stable and less evaporated water sources such as perennial rivers or large lakes. This interpretation is supported by modern isotopic data, which show that surface waters in the Junggar Basin today typically have δ¹⁸O values between − 12‰ and − 6‰49. The mean δ18O value of Discokeryx xiezhi is −7.8‰ ± 3.6‰, its δ18O values are significantly different from those of all other taxa due to the widest range in values, from − 14.8‰ to −4.0‰ (n = 9), exhibiting the greatest overall isotopic variability.

The mean δ13Cenamel value of the Tongxin region mammals was determined to be −10.2‰ ± 1.1‰, closely resembling that of the mammals analyzed from the Junggar Basin (Fig. 2). Notably, Rhinocerotidae displays the highest mean δ13C value of −9.1‰ ± 0.6‰, with values ranging from − 10.0‰ to −8.2‰ (n = 9). Suidae, on the other hand, exhibits the most negative mean δ13C value of −11.0‰ ± 0.9‰, with values ranging from − 12.8‰ to −9.6‰ (n = 13). Regarding the oxygen isotope data, the mean δ18Oenamel value of all Tongxin Region mammals was determined to be −6.4‰ ± 2.5‰, with a broad range from − 12.1‰ to 1.5‰ (n = 90). The Moschidae group display the highest mean δ18O value of −2.7‰ ± 4.2‰, with values ranging from − 10.7‰ to 1.5‰ (n = 7). Palaeomerycidae has the most negative mean δ18O value of −7.4‰ ± 0.4‰ (n = 2). Proboscideans have the second most negative mean δ18O value of −7.2‰ ± 1.5‰, encompassing a range from − 10.3‰ to −5.2‰ (n = 26).

The isotopic differences among species may indicate resource partitioning. Previous studies have shown that the δ18O of enamel in obligate drinkers generally reflect the δ18O of local water sources50,51whereas non-obligate drinkers are more strongly influenced by the δ18O of food plants. Besides, in the Junggar Basin, the low δ18Oenamel values likely reflect consumption of cold, high-altitude waters with limited seasonal variation, due to low summer rainfall and minimal evaporation. In contrast, higher δ18Oenamel values observed in Tongxin specimens suggest a mixed water sources and strong seasonality. During the summer, monsoonal rainfall and evaporation likely enriched surface water δ18O. For a paleoenvironmental perspective, the δ18O contrast implies that the Miocene climate of Junggar Basin was characterized by a cooler, more continental and non-monsoonal regime with less evaporative moisture loss. In comparison, Tongxin region likely experienced warmer, more seasonal conditions, influenced by summer monsoon incursions and greater aridity.

Stable isotopes in tooth enamel from different regions during the miocene

Figure 3 clearly show that the δ13Cenamel values from the Junggar Basin are generally less negative compared to those from other regions, particularly during the MMCO, a globally warm and humid period. The δ13C values from the Halamagai in the Junggar Basin are significantly higher than those from Tongxin and Linxia (monsoon margin), suggesting a more arid environment. The higher δ13C values in enamel from the Junggar Basin indicate either severe water-stressed conditions or the consumption of small amounts of C4 grasses by some animals32. Wang et al.19 identified C4 components in both pollen and paleosols from the strata of Halamagai, indicating that some mammals may have partially consumed C4 plants, although C4 plants were not a major part of their diet. The δ13Cenamel values of the Kekemaideng fauna are also relatively high, approaching those of Halamagai, and being less negative than those of the Tongxin and Linxia in the monsoon region, as well as the Tunggur fauna in the marginal monsoon region. Notably, only the δ13C range from Nebraska, in the Great Plains of the USA, approaches that of the Kekemaideng fauna (Fig. 3; Table 2). Nevertheless, the δ13C values from the Junggar Basin suggest the most open ecological environment and driest climate conditions across the mid-latitudes of the Northern Hemisphere. Kaya et al.52 based on the genus-level faunal resemblance index and paleoclimate modeling, proposed that the Baode fauna belongs to the Old World savanna palaeobiome of Central Asia. In the Junggar Basin, there were noticeably higher δ13C values during the Mid-Miocene, ranging from − 11.5‰ to −6.5‰, with an average of −9.1 ± 1.3‰. This range is closely similar to that of the Baodean fauna (−11.5‰ to −4.4‰, average − 8.6 ± 1.5‰), reflecting a relatively open or arid paleo-environment in Junggar Basin, similar to open and savanna habitats in the Baode region. The fauna of the Junggar Basin points to an open savanna-like environment.

Fig. 3
figure 3

Plot of δ13Cenamel values of various mammals across different regions from 18–5 Ma. The green and yellow curves represent the average and maximum enamel δ13C values for C3 diet, respectively, adjusted for past changes in the δ13C of atmospheric CO2 (Tipple et al., 2010). Data source: Sandelzhausen (Tütken et al. 2009); Steinheim (Tütken et al., 2006); Madrid Basin (Domingo et al., 2012); Oregon (Drewicz, 2018); Nebraska (Nguy and Secord, 2022); Linxia Basin (Wang and Deng, 2005); Qaidam Basin (Zhang et al., 2012); Tunggur region (Zhang et al., 2009); Baode and Lantian regions (Passey et al., 2009); others, this study.

The Junggar Basin (46°–47°N) and the North Alpine Foreland Basin (48°–49°N) have similar latitudes, but they diverge in terms of aridity and openness, clearly revealed by the tooth enamel δ13C data (Fig. 3; Table 2). While the North Alpine Foreland Basin was relatively humid during the Miocene, benefiting from moisture brought by westerlies from the Atlantic, the Junggar Basin, and more broadly the Central Asian region, experienced a markedly arid climate53. This aridity is thought to have been driven by several factors during the Miocene. First, the uplift of the Tibetan Plateau has contributed to the arid conditions during the late Cenozoic54,55. Second, the retreat of the Paratethys Sea has led to reduced rainfall, further intensifying aridity56. Last, global cooling has resulted in reduced water vapor and increased drought in Central Asia57. In contrast to the North Alpine Foreland Basin, where westerly moisture from the Atlantic provides humidity, the moisture reaching the Junggar Basin from the west is significantly depleted. Only the Indian Ocean monsoon contributes some summer precipitation to the region. Because of these influences, the Junggar Basin remains relatively arid overall.

The MMCO and MMCT represent key climatic events during the Middle Miocene. Around 14 million years ago (Ma), global temperatures began to decline, coinciding with the expansion of the Antarctic ice sheet, marking the onset of the MMCT58. In China, several indicators, such as pollen records, chemical weathering patterns, and anisotropy of magnetic susceptibility (AMS), demonstrate changes between the MMCO and MMCT19,57. However, tooth enamel stable carbon and oxygen isotope data from different regions do not show significant changes around the MMCT, which may suggest that despite environmental transformations, faunal dietary preferences in mid-latitude Eurasia remained largely unchanged. This implies that these mammals did not rapidly adjust their diets in response to the evolving climate. Both the northern Junggar Basin and the Tongxin region experienced a marked decline in biodiversity (Fig. 4), indicating heightened competition and ecological pressures driven by deteriorating climatic conditions. These pressures likely forced species to either adapt, migrate, or face extinction (see below).

Fig. 4
figure 4

Stacked area plots representing the diversity of large herbivores in representative mid-latitude faunas, categorized into three groups: hypsodonts (yellow), mesodonts (light cyan), and brachydonts (dark green). Data for the U.S.A. are from Janis (2000).

Herbivorous feeding ecology changes during the miocene

The feeding ecology of large herbivores (e.g., proportion of browsers, mixed feeders, and grazers) can reflect local paleoenvironment conditions. In the present analysis, we counted the numbers of brachydont, mesodont, and hypsodont taxa and their proportions within relevant faunas to roughly access their feeding ecology, following the method of Janis45. In Fig. 4, European faunas show very high ratios of brachydont taxa, with mesodonts and hypsodonts being very rare in both in MMCO and MMCT. Europe remained wet and maintained closed habitats during the MMCT, resulting in minimal changes in herbivore feeding ecology despite the global cooling event. In Central and East Asia and North America, the proportions of the three categories are similar, with high proportions of brachydont and mesodont taxa, and few hypsodont taxa. Interestingly, during the MMCT in Central and East Asia, the total number of herbivores decreased greatly, with a considerable loss of brachydonts and mesodonts. In contrast, North America showed an obvious increase in brachydonts and overall herbivores during the MMCT, even though both Asia and America displayed open woodland and savanna-like ecosystems. This suggests that the global cooling of the MMCT had an impact on mammal diversity in Central and East Asia, possibly because these faunas were located in non-monsoon or monsoon margin regions, which are more sensitive to global climatic changes. Given that the δ13Cenamel and δ18Oenamel show little differences between MMCO and MMCT in Halamagai and Tongxin faunas, it could be inferred that large herbivores continued to utilized similar plant sources, although the global cooling event may have led to a decline in plant. This, in turn, likely intensified interspecific competition and caused a loss in large herbivore diversity. The Tuosu fauna of Qaidam Basin at 11–10 Ma shows the lowest ratio of brachydonts and highest ratio of hypsodonts, indicating dry conditions in the non-monsoon region of Central Asia (Fig. 4), possibly due to the high elevation of northern Tibetan Plateau during this period54. In the Tuosu fauna, the genera present are entirely new to Central and East Asia (either migrated from elsewhere, like Hipparion, or newly evolved, see Data 2). However, in the Late Miocene of East Asia, compared with the faunas of the Great Plains of North America, the numbers of brachydonts and mesodonts are relatively high, potentially due to the relatively wetter conditions caused by the intensification of East Asian monsoon system19,20.

Mammalian adaptation to open environments

To examine the adaptation of various mammals to open environments, we compared δ13Cenamel values across different mammalian groups from Junggar Basin, Tongxin region, Germany, Spain, and the USA during the MMCO and MMCT (Fig. 5). The groups analyzed include elephantiforms, rhinos, horses, large and small size ruminants, and camelids. Each group likely occupied similar ecological niches within their respective fossil communities. Biome classification follows Nguy and Sceord31.

Fig. 5
figure 5

δ13Cenamel values of various herbivorous mammals from MMCO and MMCT localities across Junggar Basin, Tongxin region, North Alpine foreland Basin (Germany), Madrid Basin (Spain), Oregon and Nebraska (U.S.A.), illustrating the adaptation of different taxa to open environments. (a) Elephantiformes (non-deinothere proboscideans), (b) Rhinoceratidae, (c) Equidae, (d) Giraffomorpha (Giraffidae, Prolibytheriidae, Climacoceratidae, and Palaeomerycidae), Dromomerycidae, and Camelidae (tall ruminants and camelids), (e) Bovidae and Antilocapridae (high-crowned ruminants), (f) Cervidae, Moschidae, Blastomerycidae, and Leptomerycidae (low-crowned ruminants). The color spectrum in each panel represents different biomes (Nguy and Secord, 2022): blue, closed canopy; dark green, possible riparian within open canopy; light green, open canopy; yellow, water-stressed C3 biomes; orange, mixed C3/C4 biomes. Gray circles represent individual sample points, the stars represent means δ13C values, error bars show the standard deviation, and boxes represent the medians and quartiles for each group.

Elephantiformes (non-deinothere proboscideans) (Fig. 5a): Proboscideans in the Junggar Basin exhibit the highest δ13C ranges across all regions, even compared with any other mammalian groups in any regions. As the largest members of these communities, proboscideans showed a strong tendency to expand into open environments, particularly in the Junggar Basin and Tongxin region, where Platybelodon played a significant role in this expansion23. Platybelodon, adapt at consuming grass in open habitats, utilizing its well-developed trunks to gather grasses and then its shovel-shaped lower jaws to cut them23. In Europe and North America, proboscideans were mainly represented by Gomphotherium59 which lived in relatively closed environments (e.g., woodland or grassland margins) than Platybelodon.

Rhinoceratidae (Fig. 5b): As the second largest mammalian group after proboscideans, rhinoceroses displayed δ13C patterns similar to those of proboscideans during the MMCO period, with the highest mean δ13C values observed in Junggar Basin and the lowest in Germany. Rhinocerotids also showed a clear tendency for expansion into open environments across all regions, except in Germany. In China and Spain, elasmotheriines (Hispanotherium, Tongxinotherium and Caementodon) were prominent components60. These taxa developed high-crowned cheek teeth with substantial cementum, which allowed them to feed on grasses in more open habitats. In contrast, German rhinos were primarily represented by the low-crowned species Brachypotherium and Lartetotherium61 which likely inhabited relatively closed environments or riparian zones within more open landscapes31. In North America, the typical rhino was Teleoceras, a hippo-like amphibious mammal62 which also exhibited adaptations for expansion into more open environments.

Equidae (Fig. 5c): Horses exhibited significant diversity in North America, encompassing both low-crowned Anchitheriinae and high-crowned Equinae (Hipparioni and Equini). These groups occupied a wide range of ecological niches63. The equines, in particular, demonstrated strong adaptations to open environments31. Notably, δ13C values from North American horses show an increasing shift between MMCO and MMCT, indicating a significant adaptive transition. The rapid evolution and spread of high-crowned Equinae during MMCT likely facilitated their adaptation to the increasingly open habitats, setting the stage for their eventual migration into Eurasia in the late middle Miocene. In Eurasia, Anchitherium was the only equids genus64. Despite their low-crowned teeth, Anchitherium also exhibited adaptations to more open habitats, particularly in Junggar and Spain.

Tall ruminants and Camelidae (Fig. 5d): In large ruminants, Giraffoids were confined in Eurasia, while dromomerycids were found in North America. Although camelids of North America are not ruminants, their similar body type suggests that they likely occupied niches comparable to those of giraffoids65. The giraffoids in the Junggar Basin showed a strong adaptation to open environments, particularly the high-crowned taxa such as Discokeryx xiezhi22 and Vittoria sp. In contrast, in other Eurasia regions, low-crowned palaeomerycids inhabited relatively closed environments. Camelids from North America also indicate adaptations to open environments, though their adaptations were not as pronounced as those of giraffoids in the Junggar Basin.

Small-sized ruminants (Fig. 5e, f): Bovids (in Eurasia) and antilocaprids (in North America) are small-sized, high-crowned ruminants66. Across all regions, they exhibited stronger adaptations to open environments compared to the small, low-crowned ruminants (such as cervids and moschids in Eurasia, and blastomerycids and leptomerycids [Pseudoparablastomeryx] in North America)67 although the differences are not large. Interestingly, despite having higher tooth crowns, the δ13C values of bovids were lower than those of the giraffoids in the Junggar Basin. This suggests that body size may have been a more significant factor in determining the ability to expand into open environments. As bovids generally increased in size during the late Miocene, they demonstrated an enhanced ability to occupy more open habitats.

Environmental changes during the Miocene likely influenced mammalian evolution, with different animals exhibiting varying abilities to expand into open environments in China. Large mammals, such as Rhinocerotidae and Proboscidea, showed a greater tendency to spread into more open environments. In contrast, medium-sized to small mammals, such as cervids, tended to prefer riparian areas within woodlands or relatively closed forests. This difference in habitat preference suggests that factors beyond just environmental changes played a role in shaping mammalian distribution. Among these, size and body mass appear to be one of the primary factors influencing the ability to expand into more open environments.

Cenograms reveal a savanna environment since the late early miocene of central Asia

The cenogram is an effective method for estimating the structure of the mammalian community, especially for distinguishing between closed and open environments (Fig. 6a). It was observed that Sandelzhausen and Sansan (MMCO) had more medium-sized mammals and a smaller break, which may indicate a more closed environment than that of China (Fig. 6b, c). This is consistent with stable carbon and oxygen isotope data, which reflect a relatively closed environment and humid climate in Europe. In contrast, a break appears in the cenogram of Steinheim fauna, indicating the emergence of a relatively open habitat in the North Alpine Foreland Basin during MMCT.

Fig. 6
figure 6

Cenograms of mammalian assemblages from different regions. (a) modern faunas: Sagmelina, Cameroon (forest), and Lokori, Kenya (savanna) (data from Legendre 1989), (b) Junggar Basin, (c) Tongxin region, (d) Sansan (data from Costeur et al., 2012) and Tunggur, (e) North Alpine foreland Basin (data from Costeur et al., 2012), (f) Lantian and Baode. The blue solid and red open circles represent MMCO and MMCT mammalian assemblage, respectively.

In Central and East Asia, the records of the Halamagai and upper Tongxin faunas (MMCO) are relatively complete (Fig. 6d, e), revealing a noticeable gap in the cenogram. This gap occurs between ln6.6–ln9 (body mass between 0.76 kg and 10 kg) (Data 3), indicating the absence of medium-sized herbivores and cenograms suggest open environment. This pattern of weight distribution is similar to that of a modern savanna climate but with a higher prevalence of extremely large animals (greater than ln12 or 178 kg). This suggests a climate potentially moister than the contemporary savanna, which could support more megafauna. The number between ln16–ln14 in the Kekemaideng and upper Tongxin faunas was greater than in modern Lokori.

The mammalian diversity of the Kekemaideng and upper Tongxin faunas (MMCT) is significantly reduced. Although the fossil records are incomplete, the gap of medium-sized herbivores is remains present (Fig. 6d, e), indicating an open environmental structure. In contrast, the cenogram of Tunggur region (MMCT, monsoon margin) become more continuous, possibly associated with increased ecological abundance due to the strengthened East Asian monsoon (Fig. 6c). By the late middle Miocene, in Lantian and Bode faunas, the gap in medium-sized herbivores further increased, reflecting a more intensified open environment (Fig. 6f). Estimating body mass and distinguishing foregut from hindgut digesters can enable more refined paleoecological reconstructions68 particularly by applying different isotopic enrichment factors to better estimate the proportion of C₄ and C3 plants in paleodiets68. In future studies, we plan to incorporate body mass and digestive physiology into our analyses to improve the accuracy of paleoecological reconstructions. Collectively, these results indicate that open savanna-like environments appeared in northern China and Central Asia as early as the late early Miocene.

Conclusions

This study, through a comprehensive comparison of stable carbon and oxygen isotope data, cenogram analysis, and hypsodonty, reveals the evolution of open habitats and the adaptive strategies of mammals in northern China and Central Asia during the Miocene.

By comparing isotope data across different regions, we observed relatively high δ13C values in the Junggar Basin, indicating either a more open habitat or water-stressed conditions. Cenogram analysis, along with tooth hypsodonty, also supports the emergence of open and savanna habitats in northern China as early as the late early Miocene. This pattern became more pronounced in the Kekemaideng, Lantian, and Baode faunas of the middle to late Miocene, indicating that open savanna-like habitats widespread in northern China and Central Asia from the middle Miocene to late Miocene.

In terms of mammalian adaptation to open environments, large herbivores such as proboscideans, rhinocerotids, and giraffoids were the first to expand into open landscapes. Platybelodon played a key role in this transition in the Junggar Basin and Tongxin region, demonstrating a high degree of adaptation to open, savanna-like habitats. Meanwhile, bovids and antilocaprids showed potential for adaptation to open environments, but their expansion was less pronounced compared to larger taxa. In North America, equids exhibited a significant ecological shift, particularly during the MMCT, as high-crowned Equinae rapidly diversified in response to increasingly open habitats. In contrast, small-sized ruminants tended to prefer riparian areas within woodlands or relatively closed forests. Despite significant environmental changes, δ13Cenamel and δ18Oenamel data suggest that mammalian dietary preferences remained relatively stable across the MMCT, with many species continuing to rely on their original plant resources. This likely intensified interspecific competition, ultimately leading to a decline in species diversity. This trend ended in the early late Miocene, when high-crowned taxa became dominant, marking a further stage of mammalian adaptation to open environments.

In conclusion, our findings suggest that open savanna habitats emerged in northern China as early as the late early Miocene and expanded further during the late Miocene. Mammalian adaptation to these changes exhibited a size-dependent pattern, with large-bodied taxa leading the initial expansion, followed by high-crowned taxa, which became dominant in the late Miocene. This ecological transition not only highlights parallel evolutionary trends between East Asia and North America during the Miocene but also provides key paleoecological insights into the formation of modern grassland ecosystems.