Introduction

Predictions suggest an intensifying global evaporation-minus-precipitation pattern due to ongoing global warming, implying an overall trend of increasing aridity in already dry regions1. For the Atacama Desert, greenhouse warming projections suggest a 15–30% decrease in rainfall by the end of the 21st century2. Despite this, El Niño Southern Oscillation (ENSO) is expected to continue delivering intense rainfall events and storms to this arid region2,3. However, the future behavior of ENSO and the tropical Pacific circulation remains a subject of debate4,5, creating uncertainty about their impact on South America. Strong model biases in the South-Eastern Pacific6,7, uncertainty in South American monsoon simulations8,9, and long-term vegetation feedbacks on regional hydrology influenced by CO2 physiological effects10 further add complexity to predictions about the future of the arid Pacific coast of South America—stretching from the northern Peruvian coast to Argentinean Patagonia—and its habitability. Here, we present a unique paleontological record from the Coastal Peruvian Desert (Fig. 1), arguably the northern extension of the Atacama Desert, preserved from the Late Miocene (8 to 6 million years ago (Ma)). With atmospheric pCO2 levels estimated between 350 and 450 ppm, this period of global warmth is considered, to some extent, as a useful analog for an equilibrated greenhouse climate11,12.

Fig. 1: Fossil site location and stratigraphic position.
Fig. 1: Fossil site location and stratigraphic position.
Full size image

A Mean annual precipitation (MAP) along the Peruvian western Andean slopes, showing location of the Late Miocene fossil site (yellow filled star), Holocene pollen samples from Paracas Bay and the G-10 core (pentagon), and modern leaf herbarium samples (circles) used for recalibrating and validating MAP equations. B Simplified lithostratigraphic log for the Pisco sediments at the Aguada de Lomas Locality (AGL), indicating existing chronostratigraphic framework for the AGL, as well as stratigraphic position of studied pollen samples (solid circles), wood and leaf-bearing bed (leaf icon), and U-Pb dated ash layers (solid blue stars). c clay, s silt, ss sandstone, g gravel. C Panoramic view of the present-day landscape at the Aguada de Lomas Locality.

The Peruvian coast ranks today among the Earth´s driest regions (annual P < 15 mm)13, with fragmented ecosystems and vast landscapes devoid of vascular plants. This hyperaridity results from the combined influence of the southeastern Pacific anticyclone and the Andean cordillera14. Coastal winds driven by the anticyclone and latitudinal surface temperature gradients promote the upwelling of cool subsurface waters along the South-American Pacific coast, suppressing atmospheric convection15. The Andes Mountains further contribute to extreme aridity by creating a rain-shadow effect that limits westward moisture advection from the Amazon and the Atlantic14. However, modeling studies with reduced Andes heights16,17 and wet episodes during the Plio-Pleistocene, evidenced by large wetland and lake deposits18,19, suggest that global radiative forcing and regional atmospheric circulation play a more significant role in shaping the area’s precipitation regime20. Thus, the region´s extreme aridity is rooted in both its geological and oceanographic characteristics, but is heavily influenced by regional atmospheric dynamics16,17,21.The onset of hyperaridity and the process that drove the desertification of western South America remain highly controversial. One hypothesis proposes that hyperaridity started when the Andes reached ~2500 m height during the Middle Miocene uplift phase (12–10 Ma)22,23,24. However, estimates for the onset of Atacama hyperaridity in Chile range from the late Jurassic to the Late Pliocene25,26,27, a large uncertainty reflecting the diversity of geomorphological and geochemical indicators, and the complexity of interpretations28,29. A well-constrained paleoclimate record from the hyperarid Peruvian coast, arguably the northern extension of the Atacama, could offer insights into the region’s climate evolution. However, this history remains virtually unknown. Understanding past climate and biome dynamics during the Neogene greenhouse period is also critical for assessing the future habitability of this region, now home to ~17 million people.

We present a palynological and paleobotanical Miocene record from the Coastal Peruvian Desert (15.5°S; 74.8°W; Fig. 1), which offers rare insights into a region primarily known through marine deposits. Fossil leaves, wood fragments, and palynological samples (Fig. 2), dating from 8 to 6 Ma, were recovered from the Pisco Formation (“Methods,” Supplementary Figs. 1, 2). These fossils provide direct evidence of Late Miocene coastal biomes. The Pisco Formation, in part, composed of shoreface to inner shelf sediments deposited in semi-protected bays along the Peruvian shelf30, is renowned for the excellent preservation of marine vertebrates31,32,33. The chronology of the 200 m thick sedimentary sequence at the locality Aguada de Lomas (AGL), where botanical fossils were collected, is controlled by biostratigraphy and four stratigraphically consistent radiometric ages (“Methods”; Supplementary Figs. 1, 2). Through palynological and paleobotanical analyses (Supplementary Figs. 35), leaf-based precipitation estimates (Supplementary Fig. 6), and evaluation of climatic niches (Supplementary Figs. 7, 9), we discuss the response of western South American coastal biomes and climate to the Late Miocene greenhouse conditions. Our results offer new perspectives on rainfall patterns in a warmer world, while contributing to the assessment of regional boundary conditions, data/model calibrations, and continental climate variability.

Fig. 2: Late Miocene fossil leaves, wood, and palynomorphs from the Pisco Formation.
Fig. 2: Late Miocene fossil leaves, wood, and palynomorphs from the Pisco Formation.
Full size image

A Wood fragment with Teredolites. B Silicified wood anatomy of Neltuma (ex Prosopis). B1 Cross-sectional view showing axial parenchyma vasicentric and confluent with sclerotic tyloses (arrows). B2 Radial longitudinal section with procumbent cells and sclerotic tyloses. B3 Tangential longitudinal section with 4 to 10 seriate and short vessel element length. B4 Alternate and small intervessel pits. C–P Selected macrofossil plant remains. C Undetermined leaves as found in the field. D Poaceae flower. E Poaceae. F–F1 Arecaceae. F1 closeup of (F). G Rosaceae (Sanguisorbeae, possibly Polylepis). H Salicaceae (Salix cf. humboldtiana). I–L Undetermined leaves. M Myrtaceae. N-N1 Capparaceae (Capparis). N1 closeup of (N). O–Z Selected Late Miocene palynomorphs. Scale bars for palynomorphs is 10 μm. O, P Polylepis-Acaena. Q Bombacidites sp.1, Malvaceae. R Bombacidites sp.2, Malvaceae. S Malvacipolloides maristellae; Malvaceae. T Echiperiporites germeraadii. U Caesalpinoid (Fabaceae). V Polygalaceae. W Araceae. Ceratopteris sp. Y cf. Huperzia. Z Granulatisporis sp.

Fossil evidence for a vegetated desert in the Late Miocene

The plant macrofossils, comprising 55 fossil leaves and three wood fragments, include the genera Capparis, Salix (cf. humboldtiana), and members of the Myrtaceae, Rosaceae (Sanguisorbeae?), Asteraceae, Poaceae, and Arecaceae families (Fig. 2; Supplementary Data S2). The leaves are predominantly non-monocotyledonous, featuring entire margins (53.8%), acute apices (30.8%), and nanophyll leaf sizes (32.7%, range: 25–225 mm2). Silicified wood indicates the presence of drought-resistant Neltuma (ex Prosopis) trees (Fig. S5; Supplementary Data S4). Nine well-preserved palynological samples show a diverse Late Miocene assemblage, dominated by Polylepis-Acaena (35.4%), various Asteraceae morphotypes (19.1%), and the evergreen Podocarpus (2.6%) (Fig. 2; Supplementary Data S2).

These terrestrial macrofossils, found within shallow-marine sediments (Supplementary Figs. 1, 2), could have been transported from the Andes by rivers. However, the associated sedimentary facies show no evidence of alluvial, fluvial, or estuarine deposits (Supplementary Figs. 1, 2), nor have any relict geomorphological features of fluvial transport or riverine incision been identified in the area30,32,34. The well-preserved state of the fossil leaves precludes a long-distance fluvial transport, and longshore currents as a long-distance transport mechanism are also unlikely, given the relatively protected embayment of the coast at that time30,33, suggesting a local origin for the fossil leaves. While long-distance pollen deposition is possible, the prevalence of Polylepis-Acaena grains (over 30%), which have weak dispersal and limited deposition35, further supports a local origin for at least the Polylepis-Acaena component of the pollen record. This new fossil record very likely provides direct evidence of vegetation cover in the Late Miocene lowlands, where a non-vegetated landscape exists today (Fig. 2A).

A diverse yet non-analogous flora

The fossil palynoflora was compared against Holocene pollen assemblages to assess the characteristics of past vegetation. Nine Holocene samples [0–9.6 ky B.P.], collected from sedimentary settings similar to those of the Pisco Formation, served as a modern vegetation benchmark (Fig. 1; Supplementary Figs. 3, 4; “Methods”). The modern pollen assemblage, primarily wind-transported, reflects a flora dominated by alders (Alnus), grasses (Poaceae), sedges (Cyperaceae), goosefoot (Chenopodioideae, Amaranthaceae), and asters (Asteraceae). We found no significant variance in species abundance distribution within assemblages (p value = 0.22, df = 9.56; Fig. S3). However, Miocene palynoflora showed significantly higher diversity, both within-sample and standing, compared to the Holocene (Fig. 3; p value = 0.030, df = 4.1; Supplementary Fig. 3; Supplementary Data S2), along with distinct compositional differences (Fig. 3).

Fig. 3: Diversity and compositional comparison of Miocene and Holocene palynological assemblages.
Fig. 3: Diversity and compositional comparison of Miocene and Holocene palynological assemblages.
Full size image

A Coniss Cluster showing two distinct clusters, Miocene (yellow) and Holocene (blue). B First DCA axis, note Miocene samples are distinct from Holocene ones. C Species accumulation curves per period, boxplots indicate a range of accumulated number of individuals as samples are added to the sample pool. D Chao Dissimilarity Index (CDI) within Miocene (yellow) and Holocene (blue) palynofloras, and dissimilarities between both groups (gray). E Pollen-inferred plant habits. Note significant differences on herbs (p value: 4.114e-05; w = 0), shrubs (p value: 4.114e-05; w = 81), and trees (p value: 4.114e-05; w = 81).

In addition to a higher diversity, the Miocene palynoflora exhibits a higher proportion of Polylepis-Acaena (35.4%), alongside Podocarpus trees, palms, bromeliads, cactoids, caesalpinoids, bombacoids, and ferns. This contrasts with the herbaceous-dominated Holocene assemblage (Fig. 3E; Supplementary Data S2), characterized by grasses, sedges, goosefoot, and asters, typical of short-cycle annual successions36,37,38. We interpret the presence of Alnus (7.2%), a pioneer and prolific pollen producer, as evidence of wind transport from Holocene Andean biomes. The prevalence of trees and shrubs is further supported by the discovery of Neltuma (ex Prosopis) silicified wood (Supplementary Fig. 5) and leaves from Capparis, Salix, and Myrtaceae. The assemblage from this paleobotanical record starkly contrasts with the modern flora and the xeric landscape of the Coastal Peruvian Desert36,37,38.

The Miocene palynoflora of the southern Peruvian coast, with its diverse composition, represents a stable, slow-growing community. While its wooded landscape may resemble local riparian floras or dry forest found north of 8°S today36,37,38, it features a distinct mix of xerophilous and Andean taxa that are currently not found in association with each other. The co-occurrence of lowland elements, such as Neltuma, bombacoids and palms, with abundant modern high-Andean groups, like Polylepis-Acaena and Valeriana, indicates either a mixed lowland-Andean signal or a biome without a modern analog. Such non-analogy would result from a combination of factors, including shifted ecological habitats and competition due to Miocene atmospheric pCO2 levels (400 ppm)12, different climatic conditions17, varying pressures from grazing mammals30,32, plant evolutionary changes, and the lower Andean topography of the Late Miocene28.

Late Miocene coastal precipitation estimates

We reconstructed mean annual precipitation (MAP) for the Late Miocene Peruvian lowlands using the physiognomic characters of 47 fossil leaves (Supplementary Tables S1S3; Supplementary Data S5). We tested and refined existing global relationships between leaf size and MAP39,40,41,42, by adding 366 modern leaf specimens representative of the Peruvian coast (Supplementary Table S2). These adjustments made linear models more accurate for hyperarid settings like those in the Peruvian-Chilean region (Supplementary Fig. 6; Supplementary Data S5). We further introduced secondary vein density (2Dv) as an additional measure of leaf size, acknowledging uncertainty in leaf size estimates due to fragmentary fossil material41,43. Precipitation reconstruction uncertainty was estimated via Monte Carlo resampling (Supplementary Fig. 6; Supplementary Table S3). The reconstructed Late Miocene MAP value of 36.2 ± 4 (1σ) mm significantly exceeds the historical MAP of 9 mm ± 9 (1σ) (interquartile range (IQ): 3–11 mm) recorded along southern Peruvian lowlands (14–17°S, <1500 masl) (Fig. 4A). The Miocene MAP is even 75% higher than historical maxima of 23.7 mm associated with strong ENSO precipitation events (ENSO mean 9.8 ± 7.0 mm) in the area44,45. Comparable MAP values are currently observed above 1500 masl on the western slope of the Andes (Fig. 4B), 30–50 km inland from the present-day coastline (Fig. 4B).

Fig. 4: Late Miocene and modern climate variability along the central Peruvian coast and slopes (14–16°S).
Fig. 4: Late Miocene and modern climate variability along the central Peruvian coast and slopes (14–16°S).
Full size image

A Altitudinal distribution of modern Mean Annual Precipitation (MAP), indicating lowlands (0–1500 masl), slopes (1500–3500 masl), and altiplano settings (>3500 masl). Brown horizontal bar represents maximum paleoelevation values reconstructed for the Central Andes during the Late Miocene46,48. Note change of MAP scale at 100 mm. Boxplots indicate pollen-based western slope MAP estimates during the Late Miocene (yellow) and Holocene (blue). B Climate space, defined in terms of MAP (0–100 mm) vs. mean annual temperature (MAT; 19–24 °C), for present-day coastal Peruvian areas classified as belonging to the Dry Forest Biome (green) and Coastal Desert Biome (blue). Biome classification according to the Peruvian National Service for Natural Areas Protected-SERNANP. C Proportion of present-day Dry Forest Biome coverage across climate space, estimated on a grid with 10 mm precipitation and 1 °C temperature intervals. Gray and yellow boxes show modern (19–20 °C; 9 ± 9 mm) and Miocene (21–24 °C; 36.2 ± 4 mm) coastal conditions, respectively.

Pollen spectra reflect conditions across a broader spatial scale, including the western Andean flanks. MAP estimates for Miocene and Holocene pollen assemblages were derived using climate envelopes of the nearest living relatives (NLR) of palynomorphs (80 morphotypes, 54.1% of the count) and a quantitative coexistence analysis46 (see “Methods,” Supplementary Data S6S8; Supplementary Table S4). The pollen-based Miocene MAP was estimated at 772.2 mm ± 81.82 (1σ) (623.5–867.1 mm), slightly higher but not significantly different from the Holocene MAP value of 745.1 mm ± 95.4 (1σ) (558.6–826.2 mm; p value = 0.527). Compared to the Holocene, the Miocene exhibited enhanced seasonal rainfall fluctuations. Rainfall increased significantly during the austral summer rainfall season (p value = 0.003) and decreased during the austral winter (p value = 0.00009; Supplementary Fig. 7). Holocene pollen-derived MAP values are presently observed above 3000 masl in regions covered by grasslands and shrublands36,38. In contrast, the amount of Late Miocene precipitation, derived from a wooded-dominated pollen assemblage, is observed today above 3500 masl, but occurred necessarily at a much lower altitude at that time when maximum Andean height was estimated to be around 2800–3000 masl46,47,48. These pollen-based quantifications should be interpreted with caution. In arid environments, plants occupy the drier end of their climatic niche, potentially leading to overestimated rainfall when using their climatic optima. Nevertheless, multiple lines of evidence support the qualitative robustness of our results. Enhanced Andean MAP values are in agreement with Middle to Late Miocene climate reconstructions from the western Chilean Altiplano49, northern Central Andean Plateau46, and northern Bolivian Altiplano50, supporting substantially increased rainfall during the Late Miocene, both on the western slope of the Andes and in the lowlands (Coastal Peruvian Desert), as indicated by fossil pollen and leaves.

Our paleobotanical evidence shows that hyperaridity in the southern Coastal Peruvian Desert, and likely across the Atacama Desert was reduced during the Late Miocene (8–6 Ma), suggesting a later onset of extreme coastal aridity. Indeed, although coeval lowland paleobotanical remains have not been reported yet elsewhere, fossil fauna strongly suggests a similar environment in northern Chile during the Late Miocene. Two crocodilian species, indicative of warm and wet conditions, were common along the Peruvian coast and as far south as 27°S in Chile51, near the southern limit of the modern Atacama Desert. The Late Miocene coastal marine ecosystems of Peru and Chile, which included aquatic sloths, marine mammals, and mollusks, were similar52,53, suggesting that the coastal region, from ~12–27°S experienced similar oceanographic and climatic conditions. Our findings provide evidence enabling model-proxy data comparisons for the Coastal Peruvian Desert, which are essential for updating previous vegetation reconstructions from Global Circulation Models that suggest increased aridity north of 25°S during the Tortonian (11–7 Ma)54.

Surplus moisture origin

Three moisture sources may account for additional rainfall in the Coastal Peruvian Desert during the Late Miocene, supporting vegetation expansion and landscape transformation (winter coastal fog, moist air masses advected from the Amazon Basin, and local convection of warm Eastern Tropical Pacific waters). These sources underscore the interplay between coastal landscapes, vegetation dynamics, and atmospheric and oceanic processes linked to the Eastern Tropical Pacific.

Today, winter coastal fogs occur during austral winters along the Peruvian and Chilean coast and are the primary moisture source for “lomas” vegetation36,55 extending from sea level to 1400 m on the Andean foothills. Some Miocene plant genera, such as Neltuma and Capparis, are indeed typical of these “lomas,” potentially representing relicts of ancient vegetation cover. However, fog moisture, favored by cool ocean conditions during the coastal upwelling seasonal maximum, was likely infrequent under the Late Miocene warm climate. In addition to global greenhouse warming, the southeastern tropical Pacific was then 2–4 °C warmer, as a result of a weakened Walker circulation and a reduced East-West sea-surface temperature gradient56,57.

The second potential source involves westward advection of moist air masses from the Amazon Basin, particularly during the austral summer, when the South American monsoon is active14,58. The lower elevation of the Central Andes during the Late Miocene –1.0–1.5 km lower than today46,47,48—could have facilitated this airflow, potentially increasing rainfall and enhancing the western drainage network16,59. However, the impact that the Andean uplift imprinted on the aridity of the Peruvian-Chilean coastal region remains debated [e.g., refs.17,27].

The third, and most likely moisture source is local convection in the warm Eastern Pacific. A permanent El Niño-like state for the entire tropical Pacific during the Late Miocene57,60 has been associated with an El Niño-like precipitation pattern in regions known to be influenced by atmospheric teleconnections across the Tropical Pacific61. This means increased precipitation likely occurred on the Peruvian and Chilean coasts, as it occurs today during strong El Niño events45. This effect, amplified by higher atmospheric vapor content in a greenhouse climate62 and increased local evaporation and convection from a shallow-water sea along the southern Peruvian coasts30,32,33 likely drove the greening of the Coastal Peruvian Desert during the Late Miocene. We consider this mechanism also to be the primary moisture contributor to the western-flowing fluvial drainages.

Climate-driven biome shift, lesson from the past

The Late Miocene species community lacks a modern analog, making vegetation changes best characterized in terms of biome shifts. Modern climatic envelopes, defined using historical climate data from 1970 to 2000, were used to model the range of climatic conditions in which current biomes can persist (See “Methods”). Today, the northern Peruvian coast hosts a dry forest biome (DFB) from 3.4 to 7.8°S, thriving in precipitation ranges from 140 to 305 mm/yr (all climate ranges are given as IQ) and a mean annual temperature (MAT) range of 23–25.3 °C (Fig. 4). The DFB disappears when rainfall drops below 30 mm/yr. The coastal desert biome (CDB), extending from 6.8 to 27°S, exists under a precipitation range of 22–29 mm/yr and MAT of 21.4–22.4 °C (Fig. 4; Supplementary Fig. 9). Between 6.5 and 8°S, a transitional zone emerges, displaying a mosaic of vegetation patches that reflect a southward-decreasing rainfall gradient. Today, the CDB is the sole biome in the Pisco Formation area (Fig. 4B; 13–16.5°S; 0–700 masl), with a MAP of 10 mm (IQ: 2–11 mm) and MAT from 19 to 20 °C44. Considering a 2–4 °C temperature increase in the Late Miocene, as indicated by past SST reconstructions56, and the precipitation range reconstructed from fossil leaves, the coastal climate around the Pisco area covered an expanded climatic space (32–40 mm/yr; 21–24 °C). Based on its modern biome distribution, the likelihood of CDB occurrence in such a climatic space is less than 25% (Fig. 4C). Notably, with 2–4 °C warmer MATs, precipitation would only need to reach 45–60 mm/yr to sustain a permanent wooded greenery, which is consistent with our leaf-based MAP reconstruction (Fig. 4C).

The paleobotanical record reveals that increased rainfall allowed the development of a different continental landscape in the southern Peruvian lowlands. Additional moisture enabled the expansion of a diverse and wooded vegetation, with no present-day analog, supporting sub-xeric to mesic groups, such as Podocarpus, Polylepis-Acaena, Neltuma (ex Prosopis), bombacoids, palms, and various other taxa (Fig. 5). Such climate conditions facilitated both the southward expansion of the coastal DFB and possibly the survival of Andean groups, such as the Polylepis-Acaena clade. This flora then provided sustenance and suitable habitats for a variety of continental mammals (Fig. 5), including the hystricognath rodent Tetrastylus, medium-sized macraucheniid litopterns, small marsupials33, and the carnivoran procyonid Cyonasua63, while warm marine conditions were suitable for sea grass feeders like the aquatic sloth Thalassocnus64.

Fig. 5: Artistic representation of the Central Peruvian coastal landscape during the Late Miocene.
Fig. 5: Artistic representation of the Central Peruvian coastal landscape during the Late Miocene.
Full size image

This illustration emphasizes the primary floristic components identified in the fossil record, including Neltuma, Podocarpus, Salix, palms, and bombacoids, along with known continental vertebrates, such as the hystricognath rodent Tetrastylus, the procyonid Cyonasua, marsupials and macraucheniid litopterns. The depiction is intended to convey a generalized view of the landscape rather than a precise reconstruction, as the ecological affinities and habitats of many palynofloral and paleobotanical elements remain unresolved.

Climate models simulations of future greenhouse climate consistently predict a circulation change in the tropical Pacific toward El Niño-like mean conditions62, which, in addition to the global warming trend, may induce a warming of 2–4.5 °C in the Humboldt Current System65. The Late Miocene warming is similar, qualitatively and quantitatively, to the projected future warming, and may thus hold the clues to future changes in the hyperarid western South America. Due to high uncertainties in future precipitation regimes worldwide66, and to the complex interactions between climate, vegetation, and soil moisture67,68, the future response of vegetation in western South American Desert cannot be robustly predicted. The Late Miocene greening of the coastal desert, however, suggests (i) that changes in the tropical Pacific temperature and dynamics in greenhouse climate conditions may increase convective precipitation and in this region counterbalance the global strengthening of the evaporation-precipitation pattern1, and (ii) that this increased moisture source may be sustained by climate–vegetation feedbacks. Our results thus support the possibility of an increase in rainfall and vegetation cover in the Peruvian Desert, which, if well managed, may benefit the living conditions of millions.

Methods

Study material and chronology

We recovered 55 leaves, three wood fragments and nine palynological samples from sediments of the coastal to shallow-marine Pisco Formation, that crop out in the AGL Locality within the Sacaco area (15.5°S,74.8°W). The AGL occurs 2 km off the Panamerican Highway and about 10 km east of the village of Puerto Lomas (Supplementary Fig. 1).

The Pisco Formation is interpreted to have been deposited across open or protected shoreface and offshore shelf environments affected by waves and coastal upwelling30,69. The succession at AGL consists of alternations of gray and yellow sandstones, locally interbedded with shell banks, chert, and diatomaceous layers. Strata are mostly (sub)horizontal (So = 120/7, n = 20) with little to no structural deformation, resulting in a ~ 200 m-thick composite section34,70. The AGL includes five well-known fossiliferous horizons, namely, from oldest to youngest, Pascual, Sirenios, Hueso Blanco, the Water Wells, and Cerro Vildoso32. Studied samples were collected from sediments associated with the Hueso Blanco Zone and the Water Wells Zone (Supplementary Figs. 1, 2).

The sedimentary facies associated with the AGL lack any signs of alluvial, fluvial, or estuarine deposits (Supplementary Fig. 2). Sedimentary structures exhibit characteristics consistent with coastal environments, including evidence of bioturbation and sedimentary structures indicative of coastal to shallow-marine settings33,34,70, such ripple marks, cross-beddings, and occasional hummocky cross-stratification (Supplementary Fig. 2). These features, along with the extensive marine (vertebrate and invertebrate) paleontological record support a depositional environment influenced by marine processes rather than local riverine or alluvial processes30,32,69,70. Additionally, there are no relict geomorphological features, such as incised paleovalleys, paleochannels or deltas, that indicate past fluvial transport or riverine incision.

The lowermost Pisco strata at the AGL locality have been radiometrically dated as Tortonian (9.2 ± 0.08 Ma; 59), while the uppermost Pisco beds are Messinian in age based on diatom-derived biostratigraphic data (younger than 6.3 Ma but older than 5.6 Ma70). Two K-Ar radiometric dates, associated with the guide beds LM6 and LM7 of Brand et al.34, underlie the leaf-bearing levels and provide a maximum age for the collected material between ~8.8 and 7.9 Ma (samples 44 and 45 in ref.71. The youngest age for the studied material is estimated from a U/Pb radiometric age from a 50 cm-thick white ash layer located toward the upper part of the Water Wells Zone, about 5 m below the uppermost pollen sample (6.1 Ma; sample MG6-24; Supplementary Fig. 1).

Palynological analyses

Eighteen palynological samples were studied, nine of Miocene age and nine of Holocene. Miocene pollen samples were retrieved from leaf-bearing levels and upper AGL beds. Five of these samples were collected from the same levels as the macrobotanical remains, while the remaining four were taken from the Water Wells Zone in the AGL (Supplementary Fig. 1).

Holocene samples were used as benchmark of present-day vegetation. Holocene material originates from recent shallow-water sediments from the Paracas Bay and the G-10 gravity core drilled along the Peruvian continental margin72. Two samples correspond to the oldest sediments recovered from a 40 cm box core from the Paracas Bay (−13.81, −76.28; water depth: 10 m) dated around 1970–1975 A.D. The remaining seven Holocene samples were collected from the G-10 gravity core (−14.3833; −76.3983; 312 m water depth) during the Galathea-3 expedition72. All palynological samples were prepared using standard palynological techniques73. Whenever possible, at least 200 palynomorph grains were counted per sample (see Supplementary Data S2).

The final palynological dataset included 4860 occurrences of terrestrial palynomorphs (2413 of Miocene age and 1906 of Holocene age), classified into 288 morphospecies across 18 palynological samples (Fig. 2; Supplementary Data S2). Palynomorphs were identified by referring to regional pollen atlases and reference collections74,75,76,77. Where possible, updated botanical affinities at the family or genus level were conservatively assigned to the identified palynomorphs using the Rpackage taxize78 (last update 09/2024). After assigning botanical affinities, the identified palynomorphs were categorized into broad taxonomic categories (i.e., angiosperms, gymnosperms, pteridophytes, and unknown ID), by photosynthesis pathway (i.e., C3, C4, CAM, and mixed), and by their growth habit (i.e., arboreal, shrubs, herbs, ferns, and unknown habit; See Supplementary Fig. 3 and Supplementary Data S2).

Diversity and palynofloral composition in comparison with Holocene samples

Standing diversity values per sample were estimated after performing a range-through assumption, and reassessed following elimination of singletons; Fig. S3). Palynofloral standing diversity shows significantly higher diversity values in the Miocene than in the Holocene (Fig. 3A). A similar pattern is observed in within-sample diversity, using rarefied grain counts. At a count of 150 grains (Fig. S3), Miocene samples have a significantly higher number of species (mean = 42.8) compared to those from the Holocene (mean = 28.3, p value = 0.009, df = 6.56). However, the Simpson Index, which assesses the likelihood of randomly selecting two individuals belonging to the same species79, shows no difference between the Miocene (mean = 0.9) and the Holocene (mean = 0.89, p value = 0.36, df = 11.291), indicating an even distribution of species abundance in both assemblages.

The Chao dissimilarity index (CDI), cluster analysis, and detrended correspondence analyses reveal distinct palynofloral compositions between Miocene and Holocene samples (Fig. 3A, B; Fig. S4). Furthermore, the CDI median between the Miocene and Holocene floras, with a value of 0.49 (Fig. 3D), significantly differs from the within-mean values of the Miocene (0.35; 95% CI [0.2782, 0.4276], p value <0.001) and the Holocene (−0.43; 95% CI [−0.5016, −0.3522], p value <0.001).

Fossil wood anatomy

We collected three wood fragments, two 10 cm silicified pieces from the water wells zone and a 20 cm fragment containing Teredolites bioerosion from the leaf-bearing beds (MG8-01; −15.5050, −74.8154; Fig. 2). Transverse, radial longitudinal, and tangential longitudinal petrographic thin sections were prepared from the silicified wood pieces for taxonomic identification.

Taxonomic descriptions followed IAWA terminology and ID numbers80. Quantitative characteristics were evaluated measuring a minimum of 25 random vessels per cross and tangential section (Supplementary Data S4). Vessel and ray density were calculated as the average of ten field view samples (at 10 and 20× magnifications. Other anatomical characters, such as ray length or fiber wall thickness, were assessed from ten to 25 random measurements.

A Neltuma (ex Prosopis) botanical affinity was defined based on results from the inside wood database as follows: 25p, 30p, 42p, 47p, 66p, 79p, 83p, 98p, 104p, and 143p81. Diagnostic characters observed in the fossil Neltuma (ex Prosopis) include distinct wood growth rings marked by marginal bands of parenchyma, presenting a diffuse-porous pattern. Solitary vessels with occasional 2–3 radial multiples; simple perforation plates; intervessel pits alternate, ranging from minute to small; non-septate fibers; paratracheal vasicentric, slightly aliform and confluent; non-storied rays, strongly homocellular, and 2–4 seriate, and 2–4 cells per parenchyma strand. Common sclerotic tyloses and prismatic crystals were present and abundant in chambered axial parenchyma cells (Fig. 2; Fig. S5; Supplementary Data S4).

Botanical affinity was compared with Neltuma (ex Prosopis) and other Fabaceae genera including Caesalpinia, Inga, Parkinsonia, Senna, Tara, and Vachelia, using modern reference samples and photographs from the Inside Wood database. Non-metric multidimensional scaling analyses (Euclidean distance) and cluster analyses (Ward´s method, bootstrap n:100) were used to investigate taxonomic distances and relationships between fossil and reference material. A full wood description, taxonomic measurements, and comparisons are provided in Supplementary Data S4. Petrographic thin sections of silicified wood are housed at the Universidad Peruana Cayetano Heredia (Lima, Peru).

Angiosperm leaves

We collected 55 leaf specimens, comprising compressions and impressions, from AGL (MG8-01; −15.505, −74.815). Leaf morphology descriptions follow the terminology of Ellis et al.82. No evidence of insect herbivory was observed. Most specimens retained details of leaf shape and venation up to the fourth order (n = 47). We determine a few reliable, yet conservative, botanical affinities based on unique foliar characters corresponding to the living family assigned. These affinities are considered conservative estimates due to the limited number of observed characters (Supplementary Data S5). Because of the unresolved taxonomic affinities of the macrofossil material, no floristic composition analysis was performed based on macrofloral remains. Collected leaves are housed at the Universidad Peruana Cayetano Heredia (Lima, Peru). Each specimen retains a collection number using the prefix “MG8-01-XX”, where the last two digits represent the specimen number.

MAP estimates based on leaf physiognomic characteristics

Leaf size as MAP predictor

(Method 1—“observed leaf size”) The natural log of leaf sizes (MlnA) is positively correlated with the natural log of MAP42,83. To apply this relation, a summatory of proportional sizes by discrete leaf size categories must be first calculated (MlnA = Σaipi). To do so, all dicot specimens are classified into five discrete leaf size categories: leptophyll (<25 mm2), nanophyll (25–225 mm2), microphyll (225–2025 mm2), notophyll (2025–4500 mm2), and mesophyll (4500–18,225 mm2). The number of leaves per category is counted, and the leaf size proportion for each category (pi) is calculated based on these counts. This proportional value is then multiplied by the mean natural log areas (ai) of the respective five leaf size categories (i.e., ai: 2.12, 4.32, 6.51, 8.01, and 9.11; see Supplementary Table S1). The resulting values are summed to calculate the mean natural log of leaf size (MlnA = Σaipi). MAP estimates are then derived using regional (Tropical Africa-Bolivia, Tropical Africa-Bolivia + (sub)tropical Western Hemisphere39,42 and global regression models between MlnA and MAP41.

Vein density as leaf size predictor

(Method 2—“predicted size”) Estimating leaf sizes from fragments can introduce bias due to incomplete material, potentially leading to errors in the MAP estimation. To mitigate this, we also used 2Dv, a visible trait on the leaf laminae that is correlated with leaf size84. This trait allowed us to objectively estimate fossil leaf sizes by calculating mean 2Dv values. When possible, up to three circular regions of known area (a), ranging from 0.023 to 0.1 cm2, were selected in each laminae to calculate an average 2Dv value per specimen. Each circular region was photographed, and all veins present were manually traced. Their total length (L) was measured using the Topview software, and 2Dv average values were calculated as 2Dv = L/a85. The leaf size per specimen (cm2) was then predicted using the Eq. (1) developed by Sack et al.43.

$${Ln}\left({\mathrm{leaf\; area}},\,{{\mathrm{cm}}}^{2}\right)=1.96\,+\,2.04* {\mathrm{ln}}\,(2{\mathrm{Dv}})$$
(1)

Predicted sizes were used to calculate a new mean natural log of leaf size (MlnA) value as described above, which was subsequently applied to assess MAP values using regional and global equations39,41,42.

Performance of equations in hyperarid settings

We first evaluated the effectiveness of regional and global equations in estimating present-day MAP values in (hyper) arid environments, acknowledging that extremely low rainfall values may fall at the limit or even outside the range of these equations. For this, we utilized 366 georeferenced modern leaves from 33 coastal species living in xerophytic biomes (i.e., desert and dry forest) along the Peruvian coast (5–1068 masl). Leaf specimens were accessed from the National Herbarium of the Universidad Nacional Mayor de San Marcos in Lima, Peru (Supplementary Data S5). The selection of plant specimens was based on the availability of complete metadata, including taxonomic id, coordinates, altitude, site collection description, as well as the presence of at least ten well-preserved leaves. Sampling excluded individuals that grew in association with natural water bodies (rivers, streams) or in cultivated areas. Of the species sampled, 23 occur in habitats from the central Peruvian region (14–16°S), geographically close to our fossil site. The remaining 11 species belong to the DFB of northern Peru (4–7°S), which exhibits a climatic arid niche, somewhat similar to that of the Central Peruvian coastal region37. Leaf sizes and the mean natural log of leaf size (MlnA = Σaipi) were calculated using Method 1 (as described above), and modern MAP estimates were obtained again using the mentioned regional and global equations39,41,42. These values were then compared to historic MAP data at 5 min resolution ( ~ 10 km2) retrieved from the WorldClim database86. In all cases, the regression models overestimated modern MAP values, by up to two orders of magnitude (Supplementary Table S2).

Equation recalibration, validation, and testing

To develop a better-fitted regression suitable for (hyper)arid environments, we integrated the modern Peruvian leaf-size data—comprising 366 leaves from 33 species (previously described)—into regional and global regression models39,41,42. Although the recalibrated regression models exhibited low R-squared values, they still showed significant correlations between Ln (MAP) and Ln (leaf-size) (Supplementary Table S3). The effectiveness of these (hyper) arid-adapted equations was further tested by using a new set of georeferenced leaf-size observations from six species (52 new leaves) retrieved from the Field Museum’s online Botanical Collections Database (Fig. S6; Supplementary Data S5). These specimens were also collected from coastal Peruvian settings (8–13°S), and, as in the previous case, did not include individuals collected in proximity to water bodies or cultivated areas. The resulting MAP estimates agreed with historical MAP values retrieved from the WorldClim database (Supplementary Table S2), demonstrating that the (hyper) arid-recalibrated regression models are effective for inferring MAP in extremely dry settings (Fig. S6). Equation 2 describes the best-performing hyperarid-recalibrated regression model (r2: 0.2332, p value: 1.5e-12) relating MLnA (=Σaipi) and Ln (MAP), which was further used to assess paleoprecipitation estimates.

$${Ln}\left({MAP}\right)=-0.2357015+0.3098442* {MLnA}$$
(2)

Miocene MAP estimates

Miocene MAP values were thus estimated using two methods: observed leaf size (Method 1) and the 2Dv-predicted leaf-size (Method 2). 47 fossil leaves were used for Method 1, while only 36 fossil leaves were used Method 2, as vein density of 11 specimens (out of 47 samples) could not be measured due to their reduced size or poor preservation. Because of this, size of those specimens could only be assessed based on direct observation (Supplementary Table S1).

The best-performing hyperarid-recalibrated regression model (Eq. 2) was used to estimate the Miocene MAP (Supplementary Table S2). However, to consider uncertainty, the final MAP value was calculated by averaging the results obtained from the two methods (Table S3), integrating a bootstrapped value (n = 10,000, utilizing 95% of the data at a time) derived from Method 1 (i.e., direct size observations) with the bootstrapped value derived from Method 2 (i.e., 2Dv-predicted size). The combined result provides a robust estimate of the Miocene MAP by accounting for incompleteness, variability and uncertainty of the fossil data (Fig. S6). Leaf physiognomy, vein density measurements, leaf-derived sizes, and regression models for predicting Ln (MAP) based on leaf size are available in Supplementary Data S5.

MAP based on the climatic tolerance of coexistent NLR of identified palynofloral elements

We estimated pollen-based paleoMAP ranges following the methodology and sample criteria proposed by Martinez et al.46. Palynological relative abundances were converted to presence-absence to make Holocene and Miocene samples comparable. Only palynomorphs with known botanical affinities below the family level were chosen for this analysis (Supplementary Data S2). Georeferenced occurrences of the NLR, established based on known taxonomic affinities, were used to define climatic envelopes. Occurrences were retrieved from the BIEN database (downloaded 23/04/2024)87 using the BIEN R-package88. Due to the poor representation of biomes in the western Andean lowlands and slopes, occurrences were supplemented with over 37,000 records available in worldwide herbaria and more than 3500 published reports (Supplementary Data S6). Taxonomical status of each added record was reviewed and updated using the taxize R-package78. Historical climate information (MAP, MAT), and precipitation of the driest and wettest quarters; Supplementary Data S7) for each occurrence was obtained from the WorldClim database, using a 5 min resolution grid86.

The distribution of taxa with known affinities from the palynological dataset was described through bivariate probability density distributions (PDF) of modern taxon occurrence, using different combinations of the retrieved climatic variables (MAP, MAT, and precipitation of the driest and wettest quarters). Distributions were described through a bivariate Gaussian kernel density estimator (KDE), and reference libraries of bivariate environmental distributions per taxa (48 NLR affinities; see Supplementary Data S7) and temporal groups (Miocene versus Holocene; see Supplementary Data S8) were built. In order to have robust climate probabilities, we generate 100 subsampled KDE´s, each derived from a random 50% subset of the data, to capture variability and reduced noise. PDF retrieved significantly similar MAT and MAP the Miocene and the Holocene; while precipitation estimates for the wettest and driest quarter were found to be significantly different (Fig. S7; Supplementary Table S4). Comparison of Miocene and Holocene MAP values, however, requires caution given reduced Andean elevations during the Late Miocene (1.0–1.5 km lower46,48). Nonetheless, as occurs with the leaf physiognomy analysis, the pollen-based MAT estimate suggests enhanced Miocene precipitation.

Climatic envelopes of modern biomes

Climatic biome envelopes were defined based on the modern geographic occurrence of xeric coastal biomes. These envelopes were then used as models to predict the climatic variation range at which biomes can persist. Georeferenced occurrences were extracted from the biome distribution map (Ecoregion shapefile; Ecoregiones_SERNANP.shp) published by the Peruvian National Service for Natural Areas Protected (SERNANP) and used to define climatic envelopes for the DFB and CDB (Fig. S9). The climatic variables of each biome were extracted from the WorldClim database using a 5 min resolution grid86.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.