Introduction

Permafrost, ground (soil or rock, including ice and organic materials) remaining at or below 0 °C for at least two consecutive years1,2, underlies approximately 16% of Earth’s exposed land surface and a quarter of the Northern Hemisphere3. In recent decades, accelerated warming and intensified anthropogenic activity, particularly within Arctic and high-mountain environments, have driven widespread permafrost degradation, which is manifested by rising ground temperatures, deepening active layers, the formation of taliks, and the disappearance of sporadic or isolated permafrost4,5,6,7,8,9. The consequences of permafrost thaw are profound, threatening local-to-global systems by releasing vast quantities of greenhouse gases10,11,12, triggering abrupt landscape changes13,14,15, altering hydrological and ecological regimes16,17, and damaging critical infrastructure18,19. Mitigating these impacts requires a predictive understanding of the permafrost thermal regime, which depends on the complex interplay between atmospheric conditions, surface characteristics, and subsurface heat transfer20.

A precise characterization of the permafrost thermal state hinges on distinguishing among key near-surface temperatures: (1) the near-surface air temperature (Ta), typically measured at a screen height of 2.0 m above the ground surface; (2) the land-surface temperature (LST) measured on the canopy surface (including bare ground or bed rocks, vegetation canopy in summer, and snow surface in winter); and (3) the ground surface temperature (GST) measured at 0–5 cm depth just beneath the surface cover21. Of these, GST is the most critical as it directly governs ground thermal dynamics and serves as the upper boundary condition for subsurface heat-flow models. While Ta is a primary climatic driver in permafrost models22,23,24, its influence is modulated by the surface characteristics. Similarly, LST, increasingly available via remote sensing, is not a direct proxy for GST, and converting between them requires detailed, site-specific information that is often unavailable.

Systematic differences between the mean annual values of these temperatures give rise to two critical phenomena: the surface offset (SO) and thermal offset (TO). These offsets are fundamental for understanding permafrost-climate relationships and for accurately modelling permafrost distribution and evolution20,25,26. Conceptually, the SO bridges the gap from mean annual near-surface air temperature (MAAT), or the mean annual land-surface temperature (MAST), to the mean annual ground surface temperature (MAGST) in two steps: from near-surface air to the land surface, and from the land surface to the ground surface. In practice, however, the SO is calculated as a single, integrated metric: SO = MAGST - MAAT27,28.

The SO arises because the ground surface is decoupled from atmospheric temperature extremes by an array of moderating factors, such as snow cover insulation in winter and evaporative cooling or vegetation shading in summer20,25. Consequently, the spatial variability of the SO is fundamentally controlled by the surface energy balance, which is strongly influenced by site-specific characteristics, including vegetation, snowpack properties, organic layer thickness, soil moisture, surface water, hydrological processes, and microtopography29. To parameterize this complex relationship in models, the N-factor, the ratio of ground surface to air thawing/freezing indices, is commonly used. This factor provides a practical tool for linking MAAT and MAGST by capturing how surface characteristics modulate thermal energy transfer in permafrost regions.

The TO, originally described by Burn and Smith30 and Romanovsky and Osterkamp31, represents the phenomenon where the temperature at the top of the permafrost (TTOP) is typically lower than MAGST (TO = TTOPMAGST)32. This offset is primarily caused by the pronounced difference between the thermal conductivity of frozen (Kf) and thawed (Kt) soils in the active layer. Because Kf typically exceeds Kt, particularly in moist soils where ice is more conductive than water31, heat is conducted away from the permafrost table more efficiently in winter than it is conducted toward it in summer. This asymmetry results in a net cooling effect at the bottom of the active layer and thus at the permafrost table itself33,34. However, this equilibrium-based explanation is insufficient under modern climate change. Under such disequilibrium conditions, the observed TO also reflects transient thermal lags, complicating the direct inference of permafrost presence from MAGST alone. The magnitude of the TO is therefore highly variable, depending on soil-moisture content, ground-ice content, organic content, and the overall thermal regime. This leads to distinct TO values in different permafrost environments, ranging from cold-humid (ice-rich and cold) permafrost31,35 to warm, ice-rich permafrost; and from ultraxerous (ice-poor but cold) permafrost35,36 to warm and relatively dry permafrost25,37,38. These types manifest differently across the three poles: for instance, the cold-humid type is widespread in Arctic tundra lowlands, while warm, ice-rich permafrost is common in sub-Arctic regions. The ultraxerous permafrost in the surrounding Antarctica is cold but with limited ground-ice content, while the warm-dry permafrost in the Third Pole has relatively high temperature and less ground ice content25,35,39,40.

Despite the recognized importance of the SO and TO, a comprehensive, cross-polar synthesis assessing the universality of their drivers remains elusive. Research has often been confined to specific regions or permafrost types, leaving a critical knowledge gap: it is unknown whether the established drivers operate uniformly across disparate climate envelopes or if their relative importance shifts under the non-equilibrium conditions of rapid warming. This uncertainty fundamentally limits our ability to build robust, globally applicable permafrost models. Here, we address this gap by presenting the first comprehensive, data-driven assessment of SO and TO across the Arctic, Antarctica, and the Third Pole. Specifically, our aims are to: (1) establish a global baseline for SO and TO variability across these contrasting polar environments; (2) quantify and compare the relative importance of key environmental drivers, identifying both common principles and critical regional differences that may challenge existing paradigms; and (3) use these findings to highlight the limitations of current equilibrium-based assumptions and inform the development of next-generation permafrost models.

Results and discussion

A global baseline reveals contrasting polar offsets

The SO is highly heterogeneous across the selected Arctic and sub-Arctic permafrost sites, ranging from −0.5 to 8.4 °C with a mean of 3.1 ± 0.3 °C (CV = 0.7) (Figs. 1 and 3). While the warm-dry permafrost of the Third Pole exhibits a similar mean SO of 3.2 ± 0.2 °C but with markedly lower spatial variability (CV = 0.3). The distinct pattern of SO differs considerably from those typically found in high-latitude Arctic environments41, and it likely reflects the Third Pole’s mid-latitude setting and consistent dwarf vegetation or even bare ground, which lacks the high latitudes’ extreme seasonal contrasts in daylight and annual air temperature range. Antarctica’s ultraxerous permafrost presents a third distinct regime; the combination of the lowest MAAT (−11.1 ± 1.9 °C) and MAGST (−10.2 ± 1.9 °C) results in a much smaller mean SO of 0.9 ± 0.2 °C, which is nonetheless the most variable relative to its mean (CV = 0.8) (Figs. 1 and 2).

Fig. 1: Permafrost thermal state profiles across the three poles.
figure 1

This figure presents the permafrost thermal state profiles (numbers represent the mean ± standard error), with example sites shown: (a) Arctic (e.g. Arctic Village site, Forest tundra); (b) Third Pole (e.g. CLP site, Grassland); and (c) Antarctica (e.g. Scott Base site, No vegetation). ALT, MAAT, MAGST, TTOP, SO, and TO denote active layer thickness, mean annual air temperature, mean annual ground surface temperature, mean annual temperature at the top of permafrost, surface offset, and thermal offset, respectively. The photograph in (a) is courtesy of the Global Terrestrial Network for Permafrost (GTN-P). The photograph in panel (b) was taken by the authors. The photograph in panel (c) is provided by the USDA Natural Resources Conservation Service and is credited to Cathy Seybold.

Fig. 2: Surface offset (SO), thermal offset (TO), mean annual temperature at the top of permafrost (TTOP), mean annual ground surface temperature (MAGST), and mean annual air temperature (MAAT) in different permafrost regions across the Earth.
figure 2

The forest plot shows the mean (dot) and 95% confidence interval (horizontal line) for each sub-region.

Fig. 3: Spatial variability of surface offset (SO) in permafrost regions across the three poles.
figure 3

Each point represents a monitoring site of permafrost thermal state (see Fig. 8 for locations) in the Arctic, Antarctica, and the Third Pole. Point color indicates the SO (°C). Point transparency is used to reduce visual overlap in areas with high site density. The map uses a Plate Carrée projection.

These pan-polar patterns are further modulated by regional climate and landscape characteristics. For example, high SO is characteristic of forested areas of Alaska and Canada compared to the high Arctic tundra of Greenland or Svalbard (Fig. 2). The highest regional mean SO in cold-humid Arctic permafrost regions were observed in Alaska (4.6 °C), likely driven by strong surface insulation from thick winter snowpacks and dense vegetation, which create a significant contrast between a low MAAT of −7.1 °C and a considerably higher MAGST of −2.8 °C. Conversely, small SO is characteristic of maritime climates, such as those in Iceland, Norway, and Svalbard, where MAAT and MAGST are more closely coupled. It is important to note that while Antarctica exhibits the lowest mean temperatures, similarly cold and even colder thermal regimes exist in non-Antarctic regions, particularly at high elevations and in the high-latitude continental interiors of the Arctic.

The TO demonstrates even more pronounced divergence among the three poles. The cold-humid Arctic is characterized by a predominantly negative mean TO of −0.5 ± 0.2 °C (ranging from −4.2 to 2.4 °C) and high variability (CV = 2.3), with the most negative values (−0.6 °C to −1.0 °C) concentrated in continental North America, Russia, and Svalbard sites (Figs. 2 and 4). This classic negative offset, however, is absent in the other polar regions. Sites in Greenland and the Nordic region show mean values clustering near zero. While the warm-dry Third Pole exhibits a mean TO near zero (−0.2 ± 0.1 °C), this represents a reversal of the typically negative TO found in many Arctic regions. The TO range is also much narrower, from −1.5 °C to 2.0 °C, than in the Arctic. Notably, 72% of the study sites in the Third Pole exhibit positive TO, representing a reversal of the classic TO signal, which is a phenomenon consistent with observations from the cold, dry (ultraxerous) permafrost of East Antarctica42. Despite a slightly positive mean TO of 0.1 ± 0.1 °C, Antarctica exhibits the highest spatial heterogeneity of TO (CV = 8.0). This extreme variability is closely linked to the continent’s prevailing conditions of low air temperature (−11.1 ± 1.9 °C), thin snow depth (0.03 ± 0.02 m), and negligible soil moisture (0.13 ± 0.03 m3/m3). Collectively, this tri-polar grouping, from the strong negative TO in the continental Arctic to the near-zero or positive values in the Third Pole and Antarctica, powerfully underscores the sensitivity of subsurface heat transfer to both large-scale climate and local substrate properties34.

Fig. 4: Spatial variability of thermal offset (TO) in permafrost regions across the three poles.
figure 4

Each point represents a permafrost thermal state monitoring site, corresponding to locations presented in Fig. 8 and distributed across the Arctic, Antarctica, and the Third Pole. Point color indicates the TO (°C), revealing the direction of the offset: positive values signify that the permafrost top is warmer than the ground surface on average, while near-zero and negative values indicate it is colder. Simultaneously, point size corresponds to the absolute magnitude, |TO|, thereby illustrating the intensity of the subsurface thermal gradient deviation from a simple conductive regime. Point transparency is used to reduce visual overlap in areas with high site density. The map uses a Plate Carrée projection.

Surface offset is governed by large-scale climate forcing

SO is independent of permafrost presence, although the ground thermal regime influenced by permafrost can affect SO, which is strongly mediated by surface characteristics that control the energy exchange between the atmosphere and the ground25,36,43,44. To untangle the hierarchy of controls on the SO, we employed a machine learning approach quantified by SHAP values. Our analysis reveals that total annual precipitation is the overwhelmingly dominant global driver of SO (mean absolute SHAP = 0.81), highlighting the primary role of large-scale water input in setting the ground’s thermal state (Fig. 5). Geographic location, evaporation, and vegetation density serve as significant, albeit secondary, predictors.

Fig. 5: Influence of environmental drivers on the thermal offset (TO) and surface offset (SO), quantified using SHAP values.
figure 5

The violin plots show the distribution of SHAP values for the primary drivers influencing (a) the TO and (b) the SO. Each point represents a single observation; its position on the x-axis is its SHAP value, indicating the direction and magnitude of that driver’s contribution to the model prediction. The violin shape visualizes the probability density of SHAP values for each driver, with vertical dashed lines marking the 25th, 50th (median), and 75th percentiles. The drivers (features) are ranked vertically by their global importance, calculated as the mean absolute SHAP value across all observations. The color of each point corresponds to the feature’s value for that observation, with darker blue indicating lower feature values and lighter purple/white indicating higher values, as shown in the color bar. Feature abbreviations are as follows: SNOW_DENSITY, annual snow density; WIND_SPEED, 10 m annual wind speed; ALBEDO, annual surface albedo; ALT, active layer thickness; ELEVATION, site elevation; SWC1-4, annual volumetric soil water content at different depths (1: 0–10 cm, 2: 10–40 cm, 3: 40–100 cm, 4: 100–200 cm); EVAP annual evaporation from vegetation transpiration, LAI_HV annual leaf area index of high vegetation, LON site longitude, LAT site latitude, PRECIP total annual precipitation, SNOW_DEPTH annual snow depth, SNOW_COVER annual snow cover fraction.

While large-scale climate sets the baseline, local surface characteristics, most notably vegetation cover and snow regimes, ultimately modulate the magnitude of the SO by controlling the surface energy balance20,25. This results in a thermal decoupling between the atmosphere and the ground surface, a process strongly amplified by local conditions. In the Arctic, this heterogeneity is driven by the complex interplay of vegetation, snow, and soil properties. For example, ecosystems like coniferous forests generally enhance summer shading and trap insulating snow, which keeps the ground warmer than the near-surface air on an annual basis45. These responses are not unique to specific regions, as similar SO magnitudes have been reported in parts of the western Canadian Arctic46,47. In contrast, ecosystems with low-stature or sparse vegetation like tundra or polar deserts often display small or even negative SO (e.g., −0.1 °C at Chukochya River site, Russia) (Fig. 3), primarily driven by efficient radiative cooling and the reduced insulating effect of wind-redistributed snow. Furthermore, the insulating capacity of the snowpack is highly dependent on its physical structure; the light, low-density snow characteristic of the Arctic provides more effective insulation than the denser, more compacted snowpack often found in boreal regions. The ecosystems like forest-tundra also show smaller SO (e.g., 0.4 °C at Kevo (Valsjeaggl 1) site, Nordic region), particularly in deciduous stands where canopy loss after leaf fall reduces snow interception and winter insulation, while summer evapotranspirative cooling can also lower ground surface temperatures. Moreover, soil moisture and water table levels affect ground temperatures, with warmer conditions typically observed in drier soils and cooler temperatures in wetter soils due to differences in thermal conductivity. This thermal effect of soil moisture is less pronounced in conifer-dominated stands, where canopy structure is largely retained through winter.

In the Third Pole, vegetation is a primary local factor influencing the thermal regime of the ground surface. The magnitude of SO is a function of vegetation structure (cover density and height), which modulates the surface energy balance48. Despite the often-uniform appearance of sparse dwarf vegetation, the effect of vegetation on SO is complex and varies with microenvironment. For instance, the SO in sparse alpine steppe (e.g., 3.1 °C at ELH site) is notably higher than in alpine swamp meadow (e.g., 2.2 °C at CLP2 site) (Fig. 3). Similarly, grassland-covered sites have a larger SO (3.2 °C) compared to sparsely vegetated dwarfk surfaces (SO = 2.9 °C) or bare ground sites (SO = 2.5 °C). A key feature distinguishing the Third Pole from the Arctic, however, is the diminished insulating role of its typically thin and ephemeral snow cover, a result of strong winds and high solar radiation promoting sublimation49. This, in turn, reduced snow influence amplifies the importance of other subsurface processes, particularly seasonal freeze-thaw cycles. In the region’s widespread discontinuous and sporadic permafrost, where ground temperatures are close to 0 °C, these phase transitions involve substantial latent heat exchange. This process effectively buffers ground temperature fluctuations and thus strongly influences the SO magnitude. The strength of this buffering, in turn, depends on active layer thickness, substrate type, and ice content50,51. The spatial variability of this latent heat buffering, therefore, helps explain the large-scale patterns observed in SO. For example, lower SO (e.g., 1.3 °C at CLP3 site, 1.2 °C at ZLH1 site) is observed in the discontinuous permafrost of the Heihe and Yellow River source areas. In contrast, significantly larger SO (≥3.0 °C) is found in the colder, continuous permafrost of the Yangtze River source area (e.g., 5.8 °C at the Kunlun Basin site, 4.4 °C at TSH site) (Fig. 4).

In Antarctica, the SO is generally small, and its magnitude is strongly linked to the ambient thermal regime. For instance, extremely cold interior sites like Mt. Fleming and Victoria Valley 1 sites (near −23.0 °C) show minimal SO of 0.5 °C and 0.3 °C, respectively, while peripheral maritime environments like Limnopolar Lake site (1.6 °C) exhibit relatively larger SO. The small mean SO in the Antarctica is attributed to its unique seasonal thermal dynamics. During the austral summer, high insolation and thin snow cover allow the ground surface temperature to warm well above air temperature, creating a positive SO. Conversely, during the wintertime, a thin or non-existent snowpack provides minimal insulation, facilitating strong ground surface cooling. This can lead to a negative SO at some sites, including Bull Pass, MarblePoint, and Rothera, where the ground surface becomes colder than the overlying near-surface air. This winter cooling is intensified by a thin or absent snowpack, offering negligible thermal insulation due to strong winds and sublimation, a condition paralleling the limited insulating capacity on the Third Pole. The snowpack’s persistence is itself constrained by strong winds driving sublimation and high solar radiation promoting melt, often rendering winter-spring accumulation ephemeral44. The influence of vegetation on the ground thermal regime is minimal due to its extreme sparsity, with mosses and lichens covering < 5% of the ice-free ground surface52. This lack of plant cover prevents significant snow trapping and results in minimal thermal differences between land covers, as evidenced by the similar SO at Granite Harbour site (bare ground, 1.6 °C) and Low Head site (tundra, 1.6 °C). The absence of vegetation also means the ground moisture balance is almost entirely governed by ice and snowmelt dynamics. Therefore, this minimal biophysical regulation makes Antarctica a unique environment for studying abiotic controls on permafrost thermal states53.

A particularly striking result is the low predictive power of traditionally emphasized snow variables. This finding is counterintuitive, given that snow cover is considered a primary driver of SO differentiation in polar environments due to its insulating properties34,54,55. Conceptually, the thermal effect of snow on the SO is often quantified through the ‘nival offset’, which models how snow’s insulation modifies the surface freezing index20. However, this approach requires detailed parameters that are often unavailable at a global scale. Furthermore, our machine learning results suggest the snow-SO relationship is more complex and context-dependent than captured by a single index. To investigate this complexity, we conducted a targeted meta-analysis of the statistical relationship between key snow variables (snow depth, snow cover, and snow density) and SO across distinct biomes (Fig. 5a).

This synthesis reveals that the net effect of snow on SO is consistently negative in vegetated terrains, a relationship that is spatially variable and dependent on vegetation type. The effect was significant in both the vegetated Arctic (slope = −0.59, 95% CI: −1.04 to −0.15) and the Third Pole’s grasslands (slope = −0.45, 95% CI: −0.64 to −0.27) (Fig. 6). This pervasive negative correlation, where denser snow leads to a lower SO, indicates that snow’s direct insulating effect is often outweighed by a suite of interacting biogeophysical processes, such as summer shading from vegetation, altered melt dynamics, and modified subsurface thermal properties6,56,57. In contrast, non-vegetated terrains exhibit a different dynamic. While the relationship in Antarctica was non-significant, the non-vegetated Arctic showed a strong, significant negative effect (slope = −1.17, 95% CI: −2.12 to −0.21), notably stronger than in its vegetated counterpart. This suggests that in the absence of complex buffering from vegetation, the thermal effect of snow reverts to a more direct insulating role.

Fig. 6: Forest plot from the meta-analysis examining the relationship between snow variables and the surface offset (SO), with results grouped by vegetation type and polar region.
figure 6

The total sample size displayed in the forest plot represents only the samples from region-vegetation type combinations where a valid regression of snow-related variables (snow depth, density, and cover) on SO could be performed, excluding samples with missing data or insufficient variation within their group. Grey squares indicate the effect size (slope) from individual studies, with horizontal lines representing the 95% confidence intervals (CI). The size of each square is proportional to that study’s weight in the meta-analysis. The blue diamond shows the pooled summary effect and 95% CI for each vegetation-region subgroup and the overall meta-analysis. The vertical dashed line at zero represents the line of no effect. The red horizontal bar shows the 95% prediction interval for the overall summary effect. Heterogeneity statistics (I² and p-value) are provided for each subgroup.

Our dual-scale analysis thus reveals that while large-scale precipitation governs the overall SO, its final magnitude is finely tuned by the context-specific interactions between snow and local surface characteristics. This interplay drives substantial regional variability, reflecting critical differences in vegetation, such as shrub height in tundra areas and the contrasting thermal effects across treeline boundaries58. Fundamentally, these multi-faceted biogeophysical processes observed in vegetated terrain, where surface heterogeneity can amplify complex feedbacks between the atmosphere and ground thermal regime54,59, contrast with the more direct insulating role of snow on non-vegetated surfaces. The resulting complexity presents a significant challenge for modeling permafrost thermal regimes, as subsurface conditions, particularly latent heat effects during freeze-thaw cycles, also significantly influence how snow cover affects MAGST50,51. Therefore, improving projections of permafrost thermal response to environmental change will require integrated approaches that explicitly account for the coupled dynamics of snow distribution, vegetation, and subsurface thermal and hydrological properties54.

Thermal offset is fine-tuned by local substrate properties

The magnitude and sign of the TO are governed by the asymmetry of heat conduction during seasonal freeze-thaw cycles, a process primarily controlled by the ratio of thawed to frozen thermal conductivity (rk = Kt/Kf)50. A negative TO develops when frozen soil is more thermally conductive than thawed soil (rk < 1), a condition typical for most organic-rich soils such as peat. Organic matter enhances water retention, leading to an ice-rich, highly conductive frozen state (e.g., with a Kt of 0.6 W/mK and a Kf of 1.2 W/mK, the resulting rk is 0.5). This asymmetry causes winter heat loss through the frozen layer to be more efficient than summer heat gain through the less conductive thawed layer31,34,60, resulting in a net cooling effect at depth and causing the TTOP to be lower than the MAGST (negative TO). Consequently, sites with thick organic layers typically yield large negative TO61,62. This pattern is geographically widespread, with examples from peat sites in Kashin_01k site, Russia (−0.5 °C), Tavvavuoma site, Nordic (−1.2 °C) region, and Nunk site, Greenland (−1.9 °C), and comparable TO magnitudes reported from peatlands in Canada46,63.

Conversely, a minimal TO (near zero) occurs when thawed soil is comparably conductive to frozen soil (rk ≈ 1). This is characteristic of coarse-grained, well-drained materials like gravel, where low moisture content ensures that the thermal properties are dominated by the solid matrix20,38, a condition exemplified by the Clyde River site in Canada (TO = −0.04 °C). However, this simple conductive framework is frequently modulated by other factors. Moisture-rich, fine-grained mineral soils (e.g., silts and clays) can exhibit rk < 1 behavior similar to peat, while seasonal variations in soil moisture can alter or even reverse the expected offset37. Furthermore, non-conductive processes such as subsurface water flow can lead to positive TO, even in soil types where rk typically favors a negative offset, as observed in Greenland (Fig. 2) and at the organic-rich Azarova-1 site in Russia (1.8 °C). The ALT further modifies heat transfer by amplifying the thermal conductivity contrast, as a deeper ALT allows the seasonal temperature signal to penetrate further and magnify the net effect. This may partially explain why deep ALT in the Nordic region (350.9 ± 114.4 cm) is associated with a larger TO magnitude, whereas the shallower mean ALT in Alaska (60.0 ± 5.3 cm) and parts of Russia (78.9 ± 11.4 cm) correspond to lower TO.

Our SHAP analysis identifies a clear hierarchy of controls for TO. Local physical factors dominate, led by snow density (SHAP = 0.14), wind speed (SHAP = 0.13), and albedo (SHAP = 0.12) (Fig. 5b). Notably, soil moisture at greater depths (100-200 cm) serves as an important secondary predictor (SHAP = 0.07), whereas the direct influence of large-scale precipitation is negligible (SHAP = 0.03). This dominance of local substrate and surface properties creates distinct regional thermal regimes.

The principle that the magnitude of TO is primarily governed by the ratio of thawed to frozen thermal conductivity within the active layer20 explains the patterns observed in the Third Pole as well. Here, the prevalence of coarse-grained soils with limited organic matter and low moisture content results in comparable thermal conductivities between the frozen and thawed states, causing the rk ratio to approach unity25,37,38. Consequently, in line with the governing theory, the TO in this region is characteristically small. In contrast to moisture-rich Arctic sites where a large negative TO enhances permafrost thermal stability64, the smaller offset on the Third Pole provides less buffer against summer warming, contributing to a substantially thicker ALT (mean 157.7 ± 13.8 cm). Within this region, soil texture is a key control on spatial variability of TO. At 47% of sites dominated by sandy soils, the mean TO is small (−0.1 °C), but can become significantly positive, consistent with the ‘balch effect’ in coarse debris65. For example, the TO is 0.7 °C at the ELH site (coarse sand) but only 0.3 °C at the MDX3 site (fine-grained sand) (Fig. 4). In contrast, sites with clay soils exhibit a consistently negative TO (mean: −0.7 °C). The high water-retention capacity of clay leads to a significantly higher thermal conductivity when frozen than thawed, resulting in a large negative offset. Organic soils like peat introduce further complexity, where non-conductive processes such as subsurface water flow can counteract or even reverse the expected conductive offset. This is exemplified by the peat-covered CLP1 site, which has a positive TO of 0.4 °C (MAGST = − 2.3 °C, TTOP = − 1.9 °C), highlighting these complex interactions.

In the ultraxerous permafrost of Antarctica, local substrate properties exert a more pronounced control over the TO. Here, in hyper-arid environments where soils are predominantly coarse-grained mineral substrates, subtle variations in soil composition become a primary determinant of subsurface thermal dynamics. For example, minor differences in particle size distribution, such as the inclusion of fine sandy loam within a coarse gravelly matrix, can drive an order-of-magnitude difference in the observed TO (e.g., 1.4 °C versus 0.2–0.3 °C). Furthermore, the expected relationship between TO and ALT breaks down in this extreme environment. Although Antarctica has the lowest mean ALT (41.3 ± 5.5 cm), there is no consistent correlation with the TO. The Mt. Fleming site exemplifies this deviation: it exhibits one of the coldest thermal states globally (TTOP: −23.3 °C; MAGST: −23.5 °C) and the thinnest ALT (7.7 cm), yet its TO is slightly positive (0.2 °C). This pattern indicates that ALT alone is insufficient to explain TO variations here. Instead, Antarctic permafrost thermal dynamics appear to be governed more fundamentally by an interplay of substrate composition, the surface radiative balance, and non-conductive heat transfer within features like ice-rich debris or moraines.

While this conductive framework is well-established, its limitations in fully explaining the complexity of TO have been recognized20,66. Our data-driven analysis further confirms this, showing that its predictive power is highly context-dependent and often superseded by other processes. The meta-analysis reveals a weak, non-significant overall trend (slope = −0.007) that masks strong, opposing patterns when stratified by soil type (Fig. 7). The clearest alignment with conductive theory appears in the gravelly soils of the Third Pole, which exhibit a strong, significant negative slope (−0.50). This robustly confirms that higher moisture enhances winter heat loss as expected30. This contrasts sharply with the behavior of loamy soils, which, though theoretically prone to a negative TO, show a diametrically opposed response: a strong, significant positive slope in Antarctica (slope = 0.25) versus a weak negative trend in the Arctic. This reversal powerfully demonstrates that regional climatic regimes can override the expected thermal behavior of a given soil, likely due to the dominance of non-conductive heat transfer, such as the thermally-driven convection observed in coarse materials at Mt. Fleming site, Antarctica. Sandy soils further challenge a purely conductive view, with a consistently positive trend that suggests non-conductive processes like vapor diffusion may be a common, underappreciated mechanism in coarse-grained soils. Collectively, these findings demonstrate that the influence of moisture on TO is not monolithic. Instead, it is governed by an intricate interplay between substrate properties and the prevailing regional energy regime, which demands soil-specific parameterizations in permafrost models.

Fig. 7: Forest plot from the meta-analysis of the relationship between moisture and the thermal offset (TO), with results grouped by soil type and polar region.
figure 7

The total sample size shown in the forest plot includes only samples from groups defined by region and soil texture for which a valid regression of moisture variables (total precipitation and volumetric soil water at four depths) on TO could be performed, excluding those with missing data or insufficient variation. Grey squares indicate the effect size (slope) for individual studies, with horizontal lines representing the 95% confidence intervals (CI). The size of each square is proportional to that study’s weight in the meta-analysis. The blue diamond represents the pooled summary effect and 95% CI for each soil-region subgroup and the overall analysis. The vertical dashed line at zero represents the line of no effect. The red horizontal bar shows the 95% prediction interval for the overall summary effect. Heterogeneity statistics (I² and p-value) are provided for each subgroup.

Taken together, our analysis demonstrates that the partitioning of energy fluxes within the active layer is governed by an intricate interplay between soil texture, dynamic moisture content, and the prevailing regional energy regime. While these mechanisms are often framed in terms of steady-state conductive balance, it is equally important to recognize that many observed TO values may reflect transient, non-equilibrium conditions. In such cases, permafrost temperatures can lag behind changes in surface conditions, resulting in the persistence of warmer permafrost than predicted by equilibrium models46. Such disequilibrium effects likely contribute to the regional and soil-type variations in TO identified here. This underscores the critical need to incorporate transient thermal dynamics and phase-change lag effects into next-generation permafrost modeling frameworks to improve projections of permafrost response to environmental change67,68.

In conclusion, our cross-polar synthesis reveals that the permafrost thermal state is governed by a fundamental paradigm of scale-dependent decoupling of controls. We demonstrate that while the SO is controlled by large-scale climatic forcing like precipitation, the subsurface TO is fine-tuned by a context-dependent interplay of local substrate properties, where the influence of moisture can even reverse its thermal effect depending on the regional climate and soil texture. Failing to account for these substrate-specific, non-linear processes will lead to significant errors in projecting permafrost degradation, with major implications for global carbon cycling, hydrological connectivity, and infrastructure stability. Therefore, advancing predictive understanding of permafrost thermal dynamics requires a targeted, multi-pronged effort. This includes: (1) expanding high-resolution monitoring networks, particularly in data-sparse regions, and crucially, maintaining long-term observations to quantify interannual variability; (2) improving the process-based representation of coupled snow-vegetation-soil thermal dynamics in numerical models, including transient responses to disturbances like wildfire in relevant biomes; and (3) integrating these models with remote sensing observations to upscale site-level understanding to regional and global scales. Ultimately, bridging the gap between large-scale climatic drivers and micro-scale substrate responses is not merely an academic challenge; it is critical for accurately forecasting the fate of permafrost in a warming world and its profound consequences for the global climate, ecosystems, and human societies.

Methods

Study sites and data compilation

This study synthesizes ground temperatures across 117 sites in the Arctic, Antarctica, and the Third Pole (Fig. 8). These sites represent diverse permafrost conditions categorized for regional comparison as cold-humid (Arctic), warm-dry (Third Pole), and cold-arid/ultraxerous (Antarctica). The Arctic dataset includes sites from the north slopes of the Alaska Range (n = 18), Canada (n = 14), Russia (n = 15), Nordic countries (n = 11), Svalbard (n = 4), and Greenland (n = 4) (Fig. 8a). The Third Pole representation comprises 32 sites that exhibit warm-dry permafrost characteristics (Fig. 8b). The Antarctic dataset includes 19 sites characterized by ultraxerous permafrost conditions (Fig. 8c). We compiled data for all sites of hourly GST (0–5 cm), hourly Ta, and ALT. Additionally, to represent the thermal conditions near the permafrost table, we also included hourly ground temperature from depths selected based on the site-specific ALT. These datasets were primarily compiled from documentations, the Global Terrestrial Network for Permafrost (GTN-P) dataset, and the Circumpolar Active Layer Monitoring (CALM) program archives (see Table 1 for a complete list of data sources).

Fig. 8: Overview of permafrost distribution and selected sites.
figure 8

This figure displays the global permafrost distribution (derived from the International Permafrost Association) and highlights selected study sites across the Arctic (a), Third Pole (b), and Antarctica (c). All maps are presented using polar projections, with geographic coordinates indicated by the grid lines.

Table 1 Site information across the three poles

Temperature offsets are defined by MAGST, MAAT, and TTOP. To investigate potential drivers of variability in temperature offsets, a comprehensive set of ancillary data was collected for each site. A total of sixteen variables were compiled as predictors for our machine learning models. Twelve of these predictors were derived from the ERA5-Land daily aggregated product, for which we calculated the long-term mean over a 10-year period (2014–2023). These variables represent key environmental processes, encompassing the cryosphere characteristics (e.g., snow depth, snow density, snow cover), hydrosphere components (e.g., 0–10, 10–40, 40–100, and 100–200 cm depth soil moisture, total precipitation, evaporation from vegetation transpiration), and the land-atmosphere interface (e.g., albedo, leaf area index, and 10 m wind components). The ERA5-Land dataset, which contains a total of 50 variables describing the global water and energy cycles over land hourly at a spatial resolution of 9 km69, was accessed and processed using the Google Earth Engine (GEE) platform. Site-specific metadata provided the remaining quantitative predictors and the qualitative variables required for stratified analysis. The four quantitative predictors included the site’s geographic coordinates (latitude and longitude), elevation, and the measured ALT.

Concurrently, qualitative information on vegetation type and soil texture was compiled specifically to facilitate the subsequent meta-analysis by serving as grouping variables. To ensure consistent analysis, original site descriptions were harmonized into standardized categories. For instance, sites described as polar desert, alpine desert, or barren land were grouped into a single “No vegetation” category. Similarly, detailed soil classifications were aggregated into five primary texture classes: loam (e.g., sandy loam, silt loam), sand (e.g., loamy sand, clay sand), gravel (e.g., sandy gravel, gneiss), peat, and clay (e.g., silty clay, sandy clay).

Temperature offsets

The thermal regime of permafrost is characterized by distinct temperature offsets between the atmosphere, the ground surface, and the top of the permafrost. SO is defined as the difference between MAGST and MAAT:

$${SO}={MAGST}-{MAAT}$$
(1)

SO reflects the net thermal influence of the ground surface energy balance components, primarily modulated by surface characteristics20. In winter, insulating layers like snow cover and organic soil horizons decouple the ground surface from cold air temperatures, typically leading to MAGST > MAAT. The thickness, density, and duration of snow cover are major controls on ground surface thermal regimes. In summer, factors like vegetation cover (shading, altered albedo, evapotranspiration) and the thermal properties of the surface organic layer influence energy absorption and partitioning, further contributing to the offset.

TO refers to the difference between MAGST and TTOP:

$${TO}={TTOP}-{MAGST}$$
(2)

The temporal coverage of the data used to calculate TO and SO varied among sites and regions (Supplementary Figs. 16). TO is primarily influenced by variations in thermal conductivity between the thawed (humid) and frozen (ice-rich or dry) states of the active layer soils70. Typically, thermal conductivity is higher in the frozen state (Kf) than in the thawed state (Kt), especially in unsaturated soils. This asymmetry (Kf > Kt) leads to more efficient heat loss from soils during winter compared to heat gain during summer, resulting in a colder TTOP than MAGST. Latent heat effects associated with the phase change of water within the active layer also contribute significantly to this offset, damping temperature fluctuations near 0 °C.

Descriptive analysis of temperature offsets

To characterize the spatial distribution and variability of temperature offsets across different permafrost regions and spatial scales, several descriptive analyses and visualizations were conducted. To synthesize the thermal characteristics and their variability across different permafrost regions, descriptive statistics (mean ± standard error) were calculated for MAAT, MAGST, TTOP, TO, and SO, for sites grouped into defined sub-regions (e.g., Alaska, Russia, Greenland, Svalbard, Tibetan Plateau, Antarctica, Nordic, Canada) within the three major permafrost regions. These regional summaries were visualized using forest plots generated with the ggforestplot package (v0.1.0) to allow inter-regional comparison of mean offset magnitudes and associated temperatures. The spatial distribution of mean site-specific TO and SO values (mean across all available years per site) was illustrated on global maps, with marker size representing the absolute magnitude of the offset (|TO| or |SO|) and marker color indicating the signed value (positive or negative offset).

Furthermore, representative mean annual temperature-depth profiles from select sites within the Arctic, Antarctic, and the Third Pole were presented to visualize the vertical thermal regime and illustrate the differences in key thermal parameters (MAAT, MAGST, TTOP, ALT, SO, and TO) across the three poles. To quantify the relative variability of TO and SO within each of the three major permafrost regions, the coefficient of variation (CV) was calculated. The CV, defined as the ratio of the standard deviation (σ) to the mean (μ), quantifies relative variability and is expressed as a percentage71:

$${CV}=\frac{\sigma }{\mu }\times 100 \%$$
(3)

XGBoost and SHAP analysis of multivariate drivers

To quantify the influence of multiple environmental factors on TO and SO, we employed the XGBoost algorithm72 coupled with Shapley additive explanations (SHAP)73 for model interpretation. The cleaned and preprocessed dataset was randomly split into training (75%) and testing (25%) subsets to facilitate model development and evaluation. The training data were used to develop the models, while the testing set served as a holdout for monitoring performance and preventing overfitting via early stopping. Separate XGBoost regression models were developed to predict TO and SO using the 16 predictive variables detailed in the preceding section as input features. Models were trained using the “reg: squarederror” objective and “rmse” evaluation metric, with hyperparameters (e.g., eta = 0.1, max_depth = 6) tuned to optimize performance. Early stopping (patience = 10) was employed over up to 100 boosting rounds to prevent overfitting. An XGBoost model, as an ensemble of \(K\) decision trees (\({f}_{k}\)), maps feature \(x\) to a prediction \(\hat{y}\):

$$\hat{y}=f\left(x\right)=\mathop{\sum }\limits_{k=1}^{K}{f}_{k}(x)$$
(4)

To interpret the model predictions, SHAP values were calculated for all training samples in the training set using the predict () function with the predcontrib = TRUE argument, which implements the TreeSHAP algorithm. These values attribute the difference between a prediction \(f\left(x\right)\) and the expected output \(E\left[f\left(X\right)\right]\) to each feature \(i\), satisfying the additivity property:

$$f\left(x\right)=E\left[f\left(X\right)\right]+\mathop{\sum }\limits_{i=1}^{M}{\varphi }_{i}(x)$$
(5)

where \(M\) is the number of features and \({\varphi }_{i}(x)\) is the SHAP value for feature \(i\) and input \(x\). \({\varphi }_{i}(x)\) represents the contribution of feature \(i\) to the prediction for instance \({x}\) relative to the mean prediction. SHAP values were computed using the TreeSHAP algorithm in the xgboost package.

To assess feature importance and typical influence direction, SHAP values were summarized across training samples. Global importance of feature \(i\) was quantified by the mean absolute SHAP value:

$$Importanc{e}_{i}=\frac{1}{N}\mathop{\sum }\limits_{j=1}^{N}|{\varphi }_{i}\left({x}_{j}\right)|$$
(6)

The typical directional influence was assessed by the mean signed SHAP value:

$$Directional\,Influenc{e}_{i}=\frac{1}{N}\mathop{\sum }\limits_{j=1}^{N}{\varphi }_{i}\left({x}_{j}\right)$$
(7)

where \({N}\) is the number of training samples. These importance scores, along with mean feature values and mean signed SHAP values, were compiled to evaluate the contribution of each predictor to both the TO and SO models.

All modeling and SHAP analyses were conducted in R (v4.3.2) using the xgboost, dplyr, ggplot2, and patchwork packages.

Meta-analysis

We employed a meta-analysis approach to examine the influence of key environmental drivers on the changing magnitudes of TO and SO across different permafrost sites. Two separate analyses were conducted: one for the effect of snow-related variables (snow depth, density, and cover) on SO, and another for the effect of moisture-related variables (total precipitation and volumetric soil water at four depths) on TO. Each unique combination of site characteristics (vegetation type for the snow analysis; soil texture for the moisture analysis) within a specific geographic region was treated as an individual study in the meta-analysis. For each study, we first quantified the local driver-response relationship. Recognizing that these related predictors are often collinear, we used a principal component analysis (PCA) to derive a single composite predictor representing their dominant mode of variation. The relationship was then quantified by fitting a simple linear regression model:

$${Off\,{set}}_{j}={\alpha }_{j}+{\beta }_{j}{\rm{\cdot }}{{Predict}}_{j}+{\varepsilon }_{j}$$
(8)

where \({{Predict}}_{j}\) is either the first principal component (PC1) score or the single varying driver for study \({\rm{j}}\). In all cases, the estimated regression slope (\({\hat{\beta }}_{j}\)) and its standard error (\({\rm{se}}({\hat{\beta }}_{j}))\) were extracted to serve as the effect size and its variance estimate, respectively. Groups with insufficient data points for a stable regression were excluded from the meta-analysis.

Given the expected considerable spatial heterogeneity in permafrost thermal regimes and environmental controls across diverse regions and surface characteristics, a random-effects model was chosen for the meta-analysis. This model assumes that the true effect size (slope, \({\beta }_{j}\)) for each group \(j\) is not identical but varies around an overall mean effect (μ) with a between-group variance (\({\tau }^{2}\)), reflecting true heterogeneity beyond sampling error. The observed effect size (\({\hat{\beta }}_{j}\)) for group \(j\) is therefore modeled as:

$${\hat{\beta }}_{j}={\beta }_{j}+{\varepsilon }_{j}$$
(9)

where \({\beta }_{j} \sim N(\mu ,{\tau }^{2})\) represents the distribution of true effects, and \({{\rm{\varepsilon }}}_{j} \sim N(0,{{\rm{se}}(\hat{\beta })}^{2})\) represents the sampling error from the local regression. The random-effects model provides a pooled estimate (\(\hat{\mu }\)) that is a weighted mean of the true effects, accounting for both within-study sampling variability and between-group heterogeneity.

Heterogeneity among the true effects (\({\beta }_{j}\)) was assessed using Cochran’s Q statistic and formally quantified by the \({I}^{2}\) statistic, which estimates the percentage of total variation across groups attributable to true heterogeneity rather than sampling error. The formula for \({I}^{2}\) is:

$${I}^{2}=\frac{Q-(k-1)}{Q}\times 100 \%$$
(10)

where \({Q}\) is the Q-statistic and \(k\) is the number of groups included in the analysis.

Subgroup analysis was performed by geographic region to explore regional differences in the pooled effects for both models. All meta-analyses were conducted using the metagen function from the meta package in R (v4.3.2). Forest plots were generated to visualize the estimated effect size and confidence interval for each group, along with the pooled estimates for each regional subgroup and the overall random-effects pooled estimate and its prediction interval.