Abstract
Nitrous oxide is a potent greenhouse gas and the primary ozone-depleting agent of the 21st century, but marine emissions of nitrous oxide remain difficult to constrain due to their spatiotemporal variability. In the Southern Ocean, where extratropical cyclones create conditions conducive to air-sea gas flux, shipboard measurements are unlikely to capture the full extent of nitrous oxide emissions due to the impracticality of sampling said storms. Here, we use machine learning to derive nitrous oxide observations from biogeochemical Argo floats, revealing that low-pressure storms amplify air-sea gradients and create hotspots of emissions. Taking these low-pressure storms into account, rather than assuming 1 atmosphere (the standard condition outside of storms), increases the net annual Southern Ocean nitrous oxide flux by 88%. Our results imply that the Southern Ocean plays a significant role in the global nitrous oxide cycle, and may be a weaker overall sink of greenhouse gases than previously thought.
Similar content being viewed by others
Introduction
Nitrous oxide (N2O) is the leading ozone-depleting agent of the 21st century1,2,3 and a potent greenhouse gas, with a global warming potential 273 times that of carbon dioxide (CO2) on a 100-year timescale1. The ocean, a net source, plays a critical role in the global N2O budget, yet the magnitude of marine N2O emissions remains highly uncertain, particularly in dynamic, historically undersampled regions such as the Southern Ocean4,5,6. This uncertainty limits our ability to assess the impact of marine N2O emissions on climate, because even trace amounts of N2O emitted from the ocean may play a major role in offsetting marine CO2 uptake in terms of radiative forcing7,8. Furthermore, this counteracting effect may be directly stimulated by enhanced biological activity, such as that triggered by natural or intentional ocean iron fertilization7,8.
The Southern Ocean is a major contributor to global marine N2O emissions, yet estimates of its contribution have varied widely. Initial studies suggested that the Southern Ocean accounted for 24–43% of total marine N2O emissions4,5,9, but recent work revised the Southern Ocean flux down to only 19% of global emissions6. These inconsistencies leave a wide range of estimates of the Southern Ocean N2O source (0.8–1.6 Tg N/year; Table 1), making it difficult to assess its significance as a greenhouse gas source to the atmosphere. Furthermore, previous estimates relied on sparse sampling, spatial averaging, and/or climatological monthly means of air-sea N2O disequilibrium, and are thus unlikely to capture the sub-seasonal scales of variability that are important to Southern Ocean biogeochemistry and air-sea gas exchange10,11,12.
Here, we apply machine learning models to high space- and time-resolution Biogeochemical Argo (BGC-Argo) in situ sensor data to investigate a key driver of sub-seasonal variability: the passage of strong storms through latitudes 50–65°S every 4–8 days13,14,15. The barometric pressure in the centers of extratropical cyclones can drop to as much as 8% below 1 atm (Fig. S1–S12; S2,4), creating conditions favorable for N2O outgassing. Recent work on storm-driven CO2 fluxes demonstrated that storms significantly enhance air-sea gas exchange, driving high CO2 fluxes through turbulent entrainment of the deeper inorganic carbon reservoir16,17,18. Under-sampling storm-driven CO2 fluxes, which occur on timescales of days, can distort seasonal and annual air-sea CO2 flux estimates11,12. Previously impossible due to the lack of observational coverage, we can now quantify the effect of storms on Southern Ocean N2O fluxes using BGC-Argo data, and thus better constrain the role of the Southern Ocean in greenhouse gas exchange with the atmosphere.
Results and discussion
Float-based observations of N2O flux
To project sparse N2O measurements onto distributed sensor data, we trained machine learning models on the Global Ocean Ship-based Hydrographic Investigations Program (GO-SHIP) N2O dataset (n = 1968 profiles; Fig. S13), then applied these models to predict N2O concentrations from parameters measured by profiling BGC-Argo floats (n = 28,112 profiles; Fig. S14). BGC-Argo floats provide year-round, distributed measurements of ocean biogeochemistry, and, along with gliders, have transformed our understanding of Southern Ocean CO2 emissions16,17,19. Because BGC-Argo floats cannot measure N2O directly, we turned to supervised learning algorithms, which have been used to map chemical distributions of iodide20, methane21, pCO222, and surface N2O saturation6. Supervised learning algorithms are a natural framework for modeling the sharp gradients in marine N2O concentrations because, unlike multiple linear regression, they are robust to non-linearity and perform well on the smaller datasets inherent to labor-intensive ocean sampling23. We found that dissolved N2O is highly predictable from dissolved oxygen, conservative temperature, absolute salinity, and nitrate concentration (Fig. S15). Based on a test dataset comprised of profiles withheld from model training, our models predicted the seawater partial pressure of N2O (\(p{N}_{2}{O}_{{sw}}\)) with R² ≥ 0.91, and root mean squared error ≤ 21 natm in the Southern Ocean (Fig. S15).
Storms drive hotspots of N2O flux
Applying these models to BGC-Argo float profile data revealed that high sea-to-air N2O fluxes tend to be collocated with coherent low-pressure centers with high wind speeds (Fig. S1–S12). The air-sea partial pressure disequilibrium of N2O is defined as:
where \(p{N}_{2}{O}_{{sw}}\) is the partial pressure of N2O dissolved in surface seawater, \(p{N}_{2}{O}_{{atm}}\) is the atmospheric partial pressure of N2O, P is the barometric pressure, and \(\chi {N}_{2}{O}_{{atm}}\) is the dry mixing ratio of N2O in air. Low barometric pressures in the centers of storms reduce \(p{N}_{2}{O}_{{atm}}\), widening \(\triangle p{N}_{2}O\) (Fig. 1b) and driving greater flux out of the water (Fig. 1c). In other words, the low barometric pressure in storms acts to suck gas out of the water; we term this “the Hoover Effect.”
a Barometric pressure in atmospheres (atm) corresponding to each of 28,112 float profiles. The barometric pressures displayed are the mean of ERA5 and NCEP reanalysis products that have been interpolated to the location and time of float profiles. b Air-sea nitrous oxide (N2O) pressure disequilibrium, \(\Delta p{N}_{2}O\), defined as the partial pressure of N2O in seawater (\(p{N}_{2}{O}_{{sw}}\)) minus the partial pressure N2O in the atmosphere (\(p{N}_{2}{O}_{{atm}}\)), such that positive values indicate \(p{N}_{2}{O}_{{sw}} > p{N}_{2}{O}_{{atm}}\). c Air-sea N2O flux in micromoles/meter squared/day (µmol/m2/day). Positive values indicate flux out of the water. In all subplots, points are scaled by the magnitude of air-sea N2O flux (a–c).
To quantify the integrated effect of low barometric pressure in storms on air-sea N2O fluxes, we calculated air-sea N2O fluxes with the global mean barometric pressure (1 atm) in place of the observed barometric pressures (with wind and all other inputs being equal). We compare against fluxes calculated with 1 atm, as opposed to the mean Southern Ocean barometric pressure, because the characteristically low mean Southern Ocean pressure is itself a product of frequent storm activity13. In particular, storms tend to converge at 60° S18,24, creating the upward motion that dynamically maintains the mean low pressure of the Subpolar Low25,26,27.
The difference between the fluxes based on true barometric pressure and 1 atm is of a similar order of magnitude to the flux itself (Fig. S16), indicating that barometric pressure is a primary driver of high fluxes in the centers of these storms. This result implies that low-pressure storms create conditions that promote hotspots of N2O flux, and conversely, that estimates based on smoothed barometric pressure fields or 1 atm barometric pressure may be missing a substantial proportion of the net annual Southern Ocean N2O flux. When we take these low barometric pressures, or the Hoover Effect, into account, we find that the net annual Southern Ocean N2O source is 1.6 ± 0.3 Tg N2O-N/year (Fig. 2). In contrast, when we calculate the net annual Southern Ocean N2O source with 1 atm barometric pressure, the net annual flux drops to 0.9 ± 0.3 Tg N/year, or by a margin of 50 ± 20% (Fig. 2). This observation suggests that roughly half of the total flux is driven by deviations in barometric pressure from the global mean.
Cumulative curves of the area- and time-integrated air-sea N2O fluxes associated with each BGC-Argo data point in teragrams of nitrogen per year (Tg N yr−1), sorted by barometric pressure in atmospheres (atm), and calculated with: a the observed barometric pressure (blue) and b 1 atmosphere barometric pressure (black). Cumulative curves are the average of fluxes calculated from four different combinations of wind product and gas exchange parameterization (shades of green). Small envelopes of uncertainty around each wind product-parameterization pairing (green) reflect uncertainties propagated from \(p{N}_{2}{O}_{{sw}}\), atmospheric \(\chi {N}_{2}{O}_{{atm}}\) with a Monte Carlo simulation; the overall uncertainty (light blue, gray) reflects the combined Monte Carlo uncertainties, standard deviation of the four pairings, and uncertainty from the bootstrap analysis. The histogram of barometric pressures is shown at the top of the figure.
Furthermore, low barometric pressures and high wind speeds act synergistically in storms, creating a nonlinear response. The mixed partial derivative term of the Taylor expansion of \({F}_{W14}\) (equation S4) with respect to wind speed and barometric pressure, centered on the median wind speed \({U}_{{med}}\) and 1 atm barometric pressure \({P}_{1{atm}}\), has the form:
Where U10 is the wind speed at 10 meters, \({U}_{{med}}\) is the median Southern Ocean wind speed, \({P}_{1{atm}}\) is a barometric pressure of 1 atm, \(s\) is the Henry’s Law constant, and Sc is the Schmidt number. Given that, in storms \(\left({U}_{10}-{U}_{{med}}\right) > 0\) and \(\left(P-{P}_{1{atm}}\right) < 0\), the sign of this second-order mixed partial derivative is positive, indicating that the combination of high winds and low pressure in storms produces more flux than would be expected from their individual effects alone. The Taylor expansion of \({F}_{L13}\) (eqns. S21–S23) is more complex but yields a similar result (Supplementary Text S2).
To obtain a discrete approximation of the interaction effect, we can compare the observed flux to 1) a “baseline” case calculated with 1 atm barometric pressure and median piston velocities (k); 2) a “wind effect” case calculated with 1 atm barometric pressure and observed piston velocities; and 3) a “pressure effect” case calculated with observed barometric pressures and median piston velocities (Supplementary Text S2). This empirically calculated interaction effect accounts for a significant proportion of the overall flux (Fig. 3). This result implies that the interaction of low barometric pressures and high wind speeds in storms could lead to a nonlinear response of air-sea N2O flux to increasing storm frequency28.
Cumulative curves of the area- and time-integrated air-sea N2O fluxes associated with each BGC-Argo data point in teragrams of nitrogen per year (Tg N yr−1), sorted by barometric pressure in atmospheres (atm). Fluxes are calculated with median piston velocities and 1 atm (“baseline”, dark purple) and the observed piston velocities and barometric pressures (black line). The pure pressure effect corresponds to \({F}_{{median\; k}}-{F}_{{baseline}}\) (medium purple); the pure wind effect corresponds to \({F}_{1{atm}}-{F}_{{baseline}}\) (light purple); and the interaction (palest purple) is the remainder.
The importance of storms is also evident in the small number of profiles that contribute disproportionately to the net annual flux. For example, only 5% of the float profiles comprise 34 ± 9% of the net annual flux; 10 ± 3% of profiles make up 50% of the total flux (Fig. S17). Together, these results imply that brief, intense flux events in the middle of Southern Ocean storms account for a disproportionate amount of the net annual Southern Ocean N2O emissions. Previous estimates5,6 do not capture these brief, intense flux events in storms.
Biotic vs. abiotic drivers
It is a surprising result that a difference of only up to 8% in barometric pressure can drive an 88% increase in the net annual Southern Ocean N2O flux. Carbon studies show that air-sea CO2 flux is strongly mediated by the imprint of biology, with outgassing associated with upwelling/entrainment of remineralized carbon. In contrast, for N2O, we hypothesized that storms modulate sea-to-air flux primarily via lowering the barometric pressure, as opposed to entraining the deep N2O reservoir, due to the relatively low, uniform \(p{N}_{2}{O}_{{sw}}\) in the Southern Ocean (Fig. S18). To test this hypothesis, we performed a sensitivity test holding \(p{N}_{2}{O}_{{sw}}\) constant; this test produced a much smaller change than holding the barometric pressure constant at 1 atm (Fig. S19). The importance of barometric pressure in dominating flux also means that our results are somewhat agnostic to uncertainties in the machine learning prediction: because the effect is driven by low atmospheric pressure, not high surface \(p{N}_{2}{O}_{{sw}}\), the result doesn’t depend on perfectly capturing high \(p{N}_{2}{O}_{{sw}}\) in the centers of storms.
The large impact of variations in barometric pressure on N2O flux arises from the relatively small N2O partial pressure disequilibrium in much of the Southern Ocean (Fig. 1b): the median \(p{N}_{2}{O}_{{sw}}\) in the Southern Ocean is 334 natm, while the median partial pressure disequilibrium is only 11 natm. This means that small changes in barometric pressure represent a large percentage change in pressure disequilibrium, and thus large changes in air-sea flux. In the Antarctic Southern Zone, the surface water pN2O is close enough to equilibrium with the atmosphere that the pressure effect accounts for 100%—or even reverses the direction—of air-sea flux (Figs. 5, S20).
Seasonal cycles in N2O flux
While the magnitude of air-sea flux is driven by barometric pressure, the seasonality of flux is driven by the seasonal cycle in surface seawater dissolved N2O (\(p{N}_{2}{O}_{{sw}}\)), and, to a lesser extent, wind speed. The overall seasonal cycle suggests an autumnal peak in Southern Ocean integrated flux (Fig. 4), matched by April-June flux maxima in four out of five zones (Fig. 5). Drivers of this autumnal maximum are evident from sensitivity tests performed by calculating fluxes with 1 atm barometric pressure, median piston velocities (k), and median \(p{N}_{2}{O}_{{sw}}\) (Fig. 4). Fluxes calculated with 1 atm barometric pressure have a lower magnitude but the same seasonal cycle as the observed fluxes (Figs. 4a, 5); in contrast, fluxes calculated with median k values display a muted autumnal maximum (Fig. 4b), and fluxes calculated with median \(p{N}_{2}{O}_{{sw}}\) barely contain any seasonal cycle at all (Fig. 4c). This effect likely arises from the prominent seasonal cycles in wind speed and \(p{N}_{2}{O}_{{sw}}\) in each zone (Fig. S21). In the Subantarctic Zone and Polar Frontal Zone, there are particularly strong peaks in \(p{N}_{2}{O}_{{sw}}\) in April through June (Fig. S21f, i), which correspond to the autumnal entrainment maximum in these zones29. Future work should investigate the role of circulation in bringing N2O produced at depth to the surface.
The seasonal cycle in observed net Southern Ocean N2O flux in teragrams of nitrogen per year (Tg N yr−1), integrated by month. The observed flux (solid pink line) is compared to the seasonal cycle in fluxes calculated with: a 1 atmosphere (atm) barometric pressure (dashed line in a), b median piston velocities k (dashed line in (b), c the median seawater partial pressure of N2O (\(p{N}_{2}{O}_{{sw}}\); dashed line in (c), and d without the capping effect of sea ice (dashed line in (d)). Shading around each curve indicates the uncertainty in the area- and time-weighted fluxes, which is driven primarily by the uncertainty in the choice of gas exchange parameterization. Vertical gray shading indicates winter months (March-November).
\(\Delta p{N}_{2}O\) in nanoatmopheres (natm; a, c, e, g, i) and sea-to-air flux in teragrams of nitrogen per year (Tg N/yr; b, d, f, h, j) in the Subtropical Zone (green, STZ; a, b), Subantarctic Zone (red, SAZ; c, d), Polar Frontal Zone (blue, PFZ; e, f), Antarctic-Southern Zone (orange, ASZ; g, h), and Southern Ice Zone (purple, SIZ; i, j), calculated with the observed barometric pressure (bold solid line), 1 atmosphere (atm) barometric pressure (dashed line), and without the capping effect of sea ice (thin line; does not affect \(\Delta p{N}_{2}O\)). Shading indicates the standard error of the mean \(\Delta p{N}_{2}O\) in each zone and month (a, c, e, g, i) or the uncertainty in the area- and time-weighted fluxes, which is driven primarily by the uncertainty in the choice of gas exchange parameterization (b, d, f, h, j). Note varying y-axis scales to better visualize seasonal cycles.
There is no seasonal cycle in barometric pressure in our dataset, but rather a large degree of spread throughout the year, reflecting the fact that floats encounter low-pressure centers associated with storms and intervening high-pressure ridges (Figs. S21, S22). The distribution shift towards lower pressures in the southernmost zones (Fig. S21) reflects the increasing storm density from north to south, which has been previously observed in the Southern Ocean18. Averaging over these points produces a general decline in barometric pressure moving from north to south, with the lowest barometric pressures in the Antarctic Southern Zone and Southern Ice Zone (Fig. S21j, m).
Sea ice causes fluxes to decrease over the course of austral winter in the southern ice zone; in the absence of sea ice limiting exchange, N2O fluxes in the southern ice zone could reach a September maximum of > 0.1 Tg N/year, more than twice as high as the peak flux in any other zone (Fig. 5j). Estimating the net annual flux based on peak months (April-June) vs. trough months (October-December) leads to statistically significant differences in each zone (Fig. S20), indicating the importance of sampling the full seasonal range of sea-to-air N2O fluxes in the Southern Ocean. Thus, extrapolating from summertime shipboard measurements may lead to an underestimate of the total annual flux, especially if those measurements take place during the early part of austral summer.
A higher estimate of the annual Southern Ocean N2O flux
Our float-based estimate of the net annual Southern Ocean N2O source (1.6 ± 0.3 Tg N/year) represents a significant upward revision from earlier work6 (0.8 ± 0.2 Tg N/year; Table 1). The discrepancy may be explained by the pressure effect: when using 1 atm barometric pressure, our estimate (0.9 ± 0.3 Tg N/year) is within error of previous work6 (Table 1). Previous work6 focused on predicting monthly mean N2O disequilibria (Δ\(\chi {N}_{2}O\)) with monthly mean World Ocean Atlas30 data, which may have smoothed over low-pressure centers in the centers of storms. In contrast, the BGC-Argo in situ observations used here allowed us to predict pN2O, and, thus, calculate fluxes on sub-daily timescales. Furthermore, our models were trained on both surface and subsurface data, which may have allowed them to better capture Southern Ocean winter conditions (i.e., source waters).
Our improved quantification of N2O flux suggests that the Southern Ocean is a more important component of the global N2O cycle than previously thought. Given a global N2O flux of 4.2 ± 1 Tg N/year6, our estimate indicates that the Southern Ocean represents 38 ± 12% of all marine N2O emissions, which represents a significant upward revision. An important caveat is that this estimate of the global source (4.2 ± 1 Tg N/year)6 is subject to the same uncertainties and potential aliasing as the corresponding earlier estimates of Southern Ocean N2O emissions. Our estimate also implies that N2O flux from the Southern Ocean offsets a greater proportion of CO2 uptake by the same region: using a N2O global warming potential of 2731, our updated flux predicts that Southern Ocean N2O outgassing offsets 218 ± 41 Tg CO2 uptake per year, twice the 109 ± 27 Tg CO2/year suggested previously6. Assuming an annual net Southern Ocean CO2 uptake of 0.87 Pg C/year31, this means that N2O outgassing from the Southern Ocean offsets 7% of the annual CO2 uptake by the same region.
Limitations of this analysis and avenues for future study
There are several key limitations of this analysis that could be improved with further work. Firstly, BGC-Argo oxygen measurements tend to be biased low32. A first-order correction for this bias resulted in a ~ 10% reduction in the net annual N2O flux, which is within our range of uncertainty (Supplementary Text S3). Correcting for float oxygen biases is still an area of active research32, however, so no correction was applied here. The observed reconstruction bias (“Methods”, vide infra) may have resulted in part from float oxygen bias, in addition to spatiotemporal sampling differences. Secondly, given typical ocean decorrelation length and time scales of ~160 km and ~60 days, respectively33,34, float profiles every 10 days may not be entirely independent, leading to an underestimate of the overall uncertainty.
Third, the parameterization of N2O air-sea gas exchange is still an active area of research, and the subjectivity in the choice of parameterization is one of the leading sources of uncertainty (Fig. 2). Since CO2 and N2O have similar solubilities, eddy covariance CO2 flux studies35,36,37,38 could help provide insight into N2O air-sea gas exchange, following previous work using eddy covariance to better constrain bubble fluxes of CO239,40. Future work should attempt to use the insights from these eddy covariance studies to help narrow the uncertainties due to the subjectivity of parameterization of air-sea gas exchange. Relatedly, the interactions between air-sea gas exchange and sea ice are more complex than the simple fractional correction applied here would suggest41,42,43. More work is needed on biogeochemical N2O cycling within and around sea ice itself, as well as a more nuanced treatment of how different types of sea ice (pancake ice, leads in sea ice) modulate air-sea gas exchange.
While BGC-Argo data improve spatial and temporal coverage compared to shipboard data, there are still locations where no float profiles exist, and additional data would likely improve our estimate. Although we perform a bootstrap analysis to assess uncertainty due to sparse coverage (Eq. 15), the estimated absolute flux values presented here apply only to the regions and seasons where float data are available. Missing coastal and shelf processes may be an especially important source of potential bias in our N2O fluxes. BGC-Argo provides good coverage of the open ocean but is missing coastal and shelf regions, where both emissions and uncertainties are high44; future work could take advantage of other autonomous technologies, such as gliders and moorings, to better constrain N2O emissions in the near-coastal regions of the Southern Ocean and elsewhere. There is also a need for more high-quality shipboard N2O measurements, especially in coastal regions, to improve the representativeness of the training dataset. Comparing artificially-seeded, autonomously-measured N2O fluxes to a model ground truth45 could help evaluate where additional N2O measurements would be most impactful.
Future work could also situate the air-sea N2O fluxes calculated here in the context of storm geometry. A recent study found weaker storm activity in the Atlantic sector of the Southern Ocean18; the storm masks from that study could be used to quantify the percentage of high-N2O-flux profiles that occur within defined storms, and break this down by sector to quantify whether there are indeed fewer low-pressure points in the Atlantic sector. Visually, the highest N2O fluxes appear to occur on the northern edge of each low-pressure center (Fig. S1–S12); this aligns with the same recent study, which showed that the highest wind speeds tend to occur on the northern edge of the low-pressure center with respect to storm geometry18. An analysis of N2O fluxes with respect to storm geometry could be used to quantitatively evaluate this pattern.
Potential climate feedbacks
While changes in N2O flux under different warming scenarios are difficult to predict, we can perform sensitivity tests to demonstrate the potential response of N2O flux to different climate forcings such as increasing wind stress46,47,48, decreasing barometric pressure47,49, and increasing storm frequency28. The mean barometric pressure over the Southern Ocean is predicted to change by only 0.002 atm by 210026, not enough of a change to generate a significant increase in the annual net N2O flux, but a more drastic shift in the mean barometric pressure of 0.01 atm increased the annual net N2O flux by 31% to 2.1 ± 0.4 Tg N/year (Table 1). Models tend to disagree on both the amount and distribution of Southern Ocean sea ice loss50,51, so we performed a sensitivity test removing the capping effect of sea ice in its entirety. This increased the overall flux by 25% to 2.0 ± 0.4 Tg N/year (Table 1; Fig. S23), driven primarily by strong outgassing in the Southern Ice Zone (Fig. 5). Increasing wind speeds by 25%, which is the predicted increase in Southern Ocean wind stress under RCP8.525, increased N2O flux by 38% 2.2 ± 0.7 Tg N/year (Table 1; Fig. S23). The combined effects of increasing wind speeds by 25% and removing the sea ice barrier increased the net annual flux by 81% to 2.9 ± 0.8 Tg N/year (Table 1; Fig. S23). Finally, as shown above, low barometric pressures and high wind speeds have an interaction effect in storms, which could lead to a nonlinear response of flux to increasing storm frequency. While these sensitivity tests are not simulations of future climate scenarios with associated feedbacks, they demonstrate that the Southern Ocean N2O source is a dynamic system that is poised to experience a significant ramp-up in N2O emissions as the region changes.
Methods
Datasets
We compiled the Random Forest model training dataset from N2O measurements collected during CLIVAR (Climate and Ocean: Variability, Predictability and Change) and GO-SHIP (Global Ocean Ship-Based Hydrographic Investigations Program). The GO-SHIP dataset provides full water column, quality-controlled N2O data from 2005 to 2023 (Fig. S13; Supplementary Table 3). We restricted our training dataset to the GO-SHIP program due to the high level of quality control applied to the N2O measurements and accompanying hydrographic and nutrient measurements taken from the same Niskin bottle. GO-SHIP data were filtered to retain only samples for which the CTD pressure, temperature, salinity, and oxygen were flagged as good (flags 2, 6, and 7), the nitrate and N2O water samples were flagged as good (flags 2, 6, 7, and 8), and the Niskin bottle was flagged as good (2). This filtering left 35,664 data points from 1968 unique profiles from 24 cruises in the training dataset. GO-SHIP data from the full water column (as opposed to just the surface) were included in model training to better capture the conditions and water masses from which N2O in the Southern Ocean originates.
BGC-Argo is a global, autonomously drifting float array with in situ sensors for temperature, salinity, dissolved oxygen, nitrate, and other biogeochemically relevant parameters52. BGC-Argo floats sample for these parameters from 2000 m depth to the surface every 10 days, then transmit the data back to data assembly centers on land52. For this study, BGC-Argo float data were obtained from a snapshot of the BGC-Argo dataset from October 9th, 202453. The float dataset was limited to data points where [O2] and nitrate concentration ([NO3−]) were both available; excluding [NO3−] from the predictor features would increase the number of float profiles available for this analysis, which could be explored in further studies. Float data were filtered to retain only delayed-mode-adjusted pressure, temperature, salinity, [O2], and [NO3−] with quality flags 1 (“good”), 2 (“probably good”), 5 (“value changed”), and 8 (“interpolated, extrapolated, or other estimation”). Float data were further restricted to points south of 30°S. While the full water column was considered in model training, only surface points were retained in the BGC-Argo dataset because the parameter of interest was air-sea flux. To handle the ice avoidance algorithm, which prevents floats from rising shallower than 15–20 m when under ice, surface points were extracted by taking the shallowest point in each profile that was also shallower than 30 m. This left 28,112 surface data points from an equivalent number of unique profiles from 325 total floats in the Southern Ocean, starting in 2008 (Fig. S14). Floats were assigned to zones based on climatological mean locations of Southern Ocean fronts19,54 (Fig. S14a). The effective decorrelation length scales, calculated on a monthly mean basis, ranged from 160 to 240 km per float, with the best coverage in the Southern Ice Zone (Fig. S14b). Effective decorrelation time scales ranged from 0.04 to 0.06 days per float. While each float samples every 10 days, they sample at random intervals, providing much finer temporal resolution in aggregate55.
The atmospheric mixing ratio of N2O (\({{\rm{\chi }}}\)N2Oatm) was obtained from NOAA flask dataset measurements at the Palmer Station (64.6°S, 64°E) and South Pole (90°S, 0°E) monitoring stations56. Assuming that the atmosphere is zonally well-mixed, the Southern Ocean monthly mean was estimated by weighting each station by the cosine of its latitude (meaning that \({{\rm{\chi }}}\)N2Oatm was primarily driven by the Palmer Station). Monthly mean \({{\rm{\chi }}}\)N2Oatm were linearly interpolated in time to the date and time of each float profile.
Barometric pressure and wind speed were obtained from surface-level ERA557 and NCEP38 reanalysis data, matching each float profile to the nearest 6-hourly ERA5 and NCEP values in space and time (Figs.1; S4b, S4c). Fractional sea ice cover was obtained from ERA5, because NCEP sea ice cover is only 0 or 138. NCEP winds tended to be higher than ERA5 winds (Fig. S24a), while the barometric pressures from the two datasets were more similar (Fig. S24b).
Derived variables
Absolute salinity (SA), potential temperature (PT), conservative temperature (θ), density (ρ), potential density anomaly (σθ), oxygen saturation, and spiciness were calculated with the Thermodynamic Equation of Seawater 2010 (TEOS-10) Python implementation58. Apparent oxygen utilization (AOU) was calculated from the dissolved oxygen concentration ([O2]) and solubility of oxygen at the in situ temperature and salinity59. The solubility of N2O in seawater was calculated based on ref. 60 using an updated version of a gas exchange toolbox61. The partial pressure of N2O, the target variable for the machine learning models, was calculated from:
where \(p{N}_{2}{O}_{{sw}}\) represents the seawater partial pressure of N2O in natm, [N2O] is the concentration of N2O in nmol kg−1, \(\rho\) is the in-situ density in kg/m3, and \({k}_{H}\) is the Henry’s Law constant in mol m−3 atm−1. Additional derived variables included the day of the year, transformed as cos(2πDOY/365.25) to eliminate the discontinuity at the end of each calendar year, and longitude, transformed into two different predictors following refs. 62,63 as cos(π(LONGITUDE – 110°E)/180) (“LON1”) and cos(π(LONGITUDE – 20°E) /180) (“LON2”) to ensure that regions of maximum explanatory power occurred over the oceans.
Random forest model tuning and training
We trained random forest regressor models to predict \(p{N}_{2}{O}_{{sw}}\) from variables measured by BGC-Argo floats. Prior to random forest model training, a “test” dataset was generated by randomly withholding 20% of profiles from the full GO-SHIP dataset (Fig. S13). A random 20% of profiles, as opposed to 20% of data points, were withheld to ensure that inherent correlations among the data points from a single profile did not artificially boost the apparent effectiveness of the machine learning algorithm. The test dataset was completely withheld from the model selection, hyperparameter tuning, and training process (vide infra), to be used to evaluate each random forest model’s performance on unseen data after all training and validation steps.
The “validation” dataset was generated by withholding 20% of the data points from the remaining 80% of profiles. The validation dataset was used during the model selection and hyperparameter tuning process to fine-tune the model and prevent overfitting, helping to choose the best model parameters. This left 64% of the training dataset for model training during feature selection and hyperparameter tuning.
Six random forest regressor models, based on six candidate sets of predictor variables (Fig. S25), were tested as potential contributors to the final model ensemble. Instead of testing all possible combinations of predictor features, predictor features were chosen that were likely to provide information about water masses and N2O production mechanisms, and whose range in the training dataset covered the full dynamic range of values in the BGC-Argo dataset. Additionally, a model containing a larger number of predictor features was included in the model selection process as a point of comparison. To assess the uncertainty in each candidate model’s performance, a K-fold cross-validation was performed by training each model five times, withholding five distinct sets of data (folds) of validation data such that each point appeared in the validation dataset exactly once. Models were trained using the RandomForestRegressor from sci-kit-learn64, with 600 decision trees following ref. 62 and a minimum of one sample per leaf to maximize model performance. For each of the five folds, model R2, root mean square error (RMSE), and mean absolute error were obtained by applying the model to the validation dataset.
To quantify reconstruction bias45, candidate models were then applied to the BGC-Argo dataset to predict \(p{N}_{2}{O}_{{sw}}\), and the resulting median Southern Ocean surface \(p{N}_{2}{O}_{{sw}}\) was compared to the median Southern Ocean surface \(p{N}_{2}{O}_{{sw}}\) in the GO-SHIP dataset. Reconstruction bias was calculated as the difference between these two values:
where \({{median}(p{N}_{2}O)}_{{BGC}}\) is the median Southern Ocean surface \(p{N}_{2}{O}_{{sw}}\) predicted from BGC-Argo float data in natm, and \({{median}(p{N}_{2}O)}_{{GO}-{SHIP}}\) is the median Southern Ocean surface pN2O in the GO-SHIP dataset. This procedure was repeated five times per candidate model, for each of the five folds, to obtain an uncertainty in that model’s bias.
The K-fold cross-validation revealed that adding more predictor features lowered RMSE but increased reconstruction bias (Fig. S25), suggesting that the models with a greater number of features were overfitting to spurious correlations within the training data, diluting the influence of core oceanographic features, and/or decreasing the effective sample size per feature65,66. Thus, to minimize overfitting and bias, we chose the simplest four models—including only absolute salinity, conservative temperature, either [O2] or AOU, and [NO3−]—to include in the final model ensemble. Because each Random Forest model consists of 600 decision trees, our approach is essentially an ensemble of ensembles.
The hyperparameters for each of the four chosen models were tuned by manually searching ranges of minimum samples per leaf, the number of decision trees, and minimum samples per split to find the combination of hyperparameters for each model that minimized that model’s RMSE in the Southern Ocean. The optimum values for minimum samples per leaf and minimum samples per split were one and two, respectively, so these were used for final model training. The number of estimators had minimal impact on model performance, so we used 600 estimators for final model training to minimize overfitting.
Using these hyperparameters, each of the four chosen models was re-trained on the combined training and validation dataset (e.g., 80% of stations that were not withheld as test data). Model R2, RMSE, and MAE were then evaluated on the test data (e.g., the 20% of stations that were initially withheld). [O2] and AOU were the most important predictor variables; the exclusion of [NO3−] made little difference in the Southern Ocean, indicating that future work could exclude this variable to access a greater number of float profiles (Fig. S15). The models tended to underpredict extremely high values of \(p{N}_{2}{O}_{{sw}}\) (Fig. S15b, e, h, k), but this problem does not persist in the Southern Ocean data, which span a much smaller range of \(p{N}_{2}{O}_{{sw}}\) (Fig. S15c, f, i, l). To expand this analysis to global scales, more high-quality training data are needed from oxygen-poor upwelling regions where these high values of pN2O are found.
Application of random forest models to BGC-Argo data and associated uncertainties
Each of the four final trained models was applied to the surface BGC-Argo dataset in the Southern Ocean to predict \(p{N}_{2}{O}_{{sw}}\) from the temperature, salinity, dissolved oxygen, and nitrate measured by the BGC-Argo floats. This produced four datasets of predicted \(p{N}_{2}{O}_{{sw}}\) based on the four random forest models.
To generate a true prediction interval around model predictions for unseen points, it is necessary to consider uncertainty due to training dataset noise (aleatoric uncertainty) in addition to model uncertainty due to limited training data and/or model architecture (epistemic uncertainty). Based on tests with injected artificial noise (Supplementary Text S3), we approximate uncertainty due to training dataset noise with Mean Absolute Error, such that
where \({\Delta }_{{residual}}\) is the residual variance of the training data, or the uncertainty due to the inherent “noise” or randomness of the data themselves, and \({{MAE}}_{i}\) is the mean absolute error of model i for the test dataset.
Because a Random Forest consists of an ensemble of decision trees, each of which excludes a certain number of training data points, it is designed to handle this sort of uncertainty. Techniques such as Quantile Regression Forests67 take advantage of the Random Forest model architecture by calculating not just the mean output of all the decision trees but also different quantiles. We take a simple version of this approach, taking the standard deviation of all of the decision tree predictions as the uncertainty around a given model’s output:
where \({\sigma }_{{trees} , i}\) is the standard deviation of the 600 decision tree predictions for a given point by model i:
where \({p{N}_{2}O}_{j , i}\) is the “leaf,” or prediction by tree j in model i. \({\overline{p{N}_{2}O}}_{i}\) is the mean predicted pN2O by model i for a given point, equal to the mean of all 600 decision trees:
We found that initializing the Random Forest model with a different set of predictor features produces different final results. Thus, in our analysis, we use four different Random Forest models initialized with four different sets of predictor variables. Because each Random Forest model consists of 600 decision trees, our approach is essentially an ensemble of ensembles. The disagreement between the mean predictions by the four models tends to exceed the disagreement between the decision trees for an individual model. So, as a final source of uncertainty, we include the uncertainty of the model ensemble:
where \({\sigma }_{{ensemble}}^{2}\) is the disagreement between the four Random Forest models in the ensemble, and \({\overline{p{N}_{2}O}}_{{ensemble}}\) is the mean prediction of the four Random Forest models in the ensemble:
Together, the combined uncertainty in the predicted pN2O at a given point is calculated from:
Where \({\Delta }_{{residual}}^{2}\) is the uncertainty due to noisiness in the training data, \({\Delta }_{{trees}}^{2}\) is the uncertainty due to each model’s architecture, and \({\sigma }_{{ensemble}}^{2}\) is the squared standard deviation of predictions by the four Random Forest models in the ensemble.
Air-sea flux
The air-sea flux of N2O was calculated according to two wind speed-based parameterizations of air-sea gas exchange (Supplementary Text S1). The first of these parameterizations68 has a quadratic dependence on wind speed that includes bubble-mediated exchange implicitly in the nonlinear component of the wind speed dependence. In the second parameterization69, diffusive and bubble-mediated gas exchange are modeled explicitly with linear and cubic dependences on wind speed, respectively.
Uncertainty in air-sea flux
Uncertainties in individual air-sea fluxes were propagated from 1) uncertainties in wind speed; 2) uncertainties in \(\chi {N}_{2}{O}_{{atm}}\) and \(p{N}_{2}{O}_{{sw}}\):
where \({\triangle F}_{i}^{2}\) is the uncertainty in air-sea N2O flux associated with float profile i. To quantify uncertainties due to wind speed, fluxes were calculated with combinations of two gas exchange parameterizations (Supplementary Text S1) and two wind products (ERA557 and NCEP70). The uncertainty due to wind speed was estimated as the standard deviation of fluxes calculated from these four methods:
where \({F}_{i , W14 , {ERA}5}\) denotes the flux calculated from ref. 68 and ERA5 winds57 for individual float profile i.
The uncertainty \({\Delta F}_{{N}_{2}O , i}\) due to uncertainties in \(\chi {N}_{2}{O}_{{atm}}\) and predicted \(p{N}_{2}{O}_{{sw}}\) was calculated using a Monte Carlo simulation, accounting for covariances between different gas exchange parameterizations and wind speed products. For each profile i, 1000 realizations of \(p{N}_{2}{O}_{{sw}}\) were sampled from a normal distribution with µ = \(\overline{p{N}_{2}O}\) and σ = \({\Delta }_{p{N}_{2}O}^{2}\) and 1000 realizations of \(\chi {N}_{2}{O}_{{atm}}\) were sampled from a normal distribution with µ = the monthly mean atmospheric mixing ratio of N2O and a standard deviation derived from the errors published in the NOAA flask dataset35. Then, from these 1000 values, four flux estimates were calculated with different wind product-parameterization pairs. The combined uncertainty incorporates both the Monte Carlo uncertainties within each method (parameterization-wind product pair) and the covariance between methods:
where \({Cov}({F}_{j , i} , {F}_{k , i})\) is the covariance between methods j and k for profile i. The summation is over all parameterization-wind product pairs. \({Cov}({F}_{j , i} , {F}_{k , i})\) is calculated from the 1000 Monte Carlo fluxes:
where \({F}_{j , i , m}\) is the flux for profile i, method j, Monte Carlo iteration iter, and \(\overline{{F}_{j , i}}\) is the mean flux for profile i, method j. It was assumed that the uncertainties in sea ice cover were negligible compared to the uncertainties in the other parameters.
The uncertainty in the net annual flux from the Southern Ocean \(({\Delta F}_{t})\) was calculated as the sum in quadrature of 1) the random uncertainties in individual fluxes; 2) the standard deviation of the net annual flux calculated from four different parameterization-wind product pairs; and 3) the uncertainty due to sparse sampling, calculated from bootstrapping:
where \(\Delta {F}_{{t , N}_{2}O}^{2}\) is the cumulative sum of the individual uncertainties \({\Delta F}_{{N}_{2}O , i}\) from the Monte Carlo analysis:
Where \({A}_{{zone}}\) is the area of the zone, and \({D}_{{month}}\) is the number of days in the month, in which a profile occurs.
The term \(\sigma {F}_{t , {winds}}\) is the standard deviation of the cumulative fluxes calculated from each parameterization-wind product pair:
where
where \({F}_{i , W14 , {ERA}5}\) denotes the flux calculated from ref. 68 and ERA5 winds57 for individual float profile i.
To evaluate the uncertainty associated with interpolating over locations where no float profiles exist, we performed a bootstrap analysis. Repeated this 1000 times, we randomly selected a subset of the floats, re-integrated the fluxes from those floats in space and time to account for fewer floats per zone and month, and summed the resulting integrated fluxes. We then obtained an uncertainty from the standard deviation of the 1000 bootstrap sums. As expected, this uncertainty increased with decreasing bootstrap sample size (Fig. S26); we chose a bootstrap sample size of 162 floats (50% of the dataset), which yielded \({\Delta F}_{t , {bootstrap}}\,\)= 0.06 Tg N/year.
In our estimate of the overall Southern Ocean N2O flux, \(\sigma {F}_{t , {winds}}\) or the subjectivity in the choice of gas exchange parameterization and wind product, are the leading sources of uncertainty, with much smaller contributions from \(\Delta {F}_{t , {N}_{2}O}^{2}\), and \({\Delta F}_{t , {bootstrap}}\) (Fig. 2).
Data availability
The processed GO-SHIP N2O data, BGC-Argo float data with interpolated ERA5 and NCEP-NCAR Reanalysis 1 data, N2O fluxes, and figure data used in this study are available via Zenodo under https://doi.org/10.5281/zenodo.17904981 [https://doi.org/10.5281/zenodo.17904981]71. The original GO-SHIP data were obtained from CLIVAR and Carbon Hydrographic Data Office (CCHDO) [https://doi.org/10.6075/J0CCHNNN]72. The original BGC-Argo float data were obtained from a snapshot of the BGC-Argo dataset from October 9th, 2024 [https://doi.org/10.17882/42182]53. The original NCEP-NCAR Reanalysis 1 data was provided by the NOAA PSL, Boulder, Colorado, USA, from their website at [https://psl.noaa.gov]70. The original ERA5 hourly reanalysis datasets were downloaded from the Copernicus Climate Change Service (C3S) Climate Data Store at [cds.climate.copernicus]57. Figures containing maps were created using Cartopy with coastline data from Natural Earth (naturalearthdata.com, CC0 license).
Code availability
The gasex-python library for air-sea gas exchange calculations is available via Zenodo under https://doi.org/10.5281/zenodo.15132888 [https://doi.org/10.5281/zenodo.15132888]61. Python scripts for Random Forest model training, pN2O prediction from BGC-Argo float data, and air-sea N2O flux calculations are publicly available and archived at Zenodo under https://doi.org/10.5281/zenodo.17905126 [https://doi.org/10.5281/zenodo.17905126]73.
References
Smith, C. et al. The Earth’s Energy Budget, Climate Feedbacks, and Climate Sensitivity Supplementary Material. in Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change (eds Masson-Delmotte, V. et al.) (Cambridge University Press, Cambridge, United Kingdom, 2021).
Ravishankara, A. R., Daniel, J. S. & Portmann, R. W. Nitrous oxide (N2O): the dominant ozone-depleting substance emitted in the 21st century. Science 326, 123–125 (2009).
Wuebbles, D. J. Nitrous oxide: no laughing matter. Science 326, 56–57 (2009).
Nevison, C. D., Weiss, R. F. & Erickson III, D. J. Global Oceanic emissions of nitrous oxide. J. Geophys. Res. Oceans 100, 15809–15820 (1995).
Nevison, C. D. et al. Southern Ocean ventilation inferred from seasonal cycles of atmospheric N2O and O2/N2 at Cape Grim, Tasmania. Tellus B Chem. Phys. Meteorol. 57, 218–229 (2005).
Yang, S. et al. Global reconstruction reduces the uncertainty of oceanic nitrous oxide emissions and reveals a vigorous seasonal cycle. Proc. Natl. Acad. Sci. 117, 11954–11960 (2020).
Law, C. S. & Ling, R. D. Nitrous oxide flux and response to increased iron availability in the Antarctic Circumpolar Current. Deep Sea Res. Part II Top. Stud. Oceanogr. 48, 2509–2527 (2001).
Jin, X. & Gruber, N. Offsetting the radiative benefit of ocean iron fertilization by enhancing N2O emissions. Geophys. Res. Lett. https://doi.org/10.1029/2003GL018458 (2003).
Suntharalingam, P. & Sarmiento, J. L. Factors governing the oceanic nitrous oxide distribution: Simulations with an ocean general circulation model. Glob. Biogeochem. Cycles 14, 429–454 (2000).
Prend, C. J. et al. Sub-seasonal forcing drives year-to-year variations of southern ocean primary productivity. Glob. Biogeochem. Cycles 36, e2022GB007329 (2022).
Monteiro, P. M. S. et al. Intraseasonal variability linked to sampling alias in air-sea CO2 fluxes in the Southern Ocean. Geophys. Res. Lett. 42, 8507–8514 (2015).
Sutton, A. J., Williams, N. L. & Tilbrook, B. Constraining Southern Ocean CO2 flux uncertainty using uncrewed surface vehicle observations. Geophys. Res. Lett. 48, e2020GL091748 (2021).
Wei, L. & Qin, T. Characteristics of cyclone climatology and variability in the Southern Ocean. Acta Oceanol. Sin. 35, 59–67 (2016).
Lin, X., Zhai, X., Wang, Z. & Munday, D. R. Mean, variability, and trend of Southern Ocean wind stress: role of wind fluctuations. https://doi.org/10.1175/JCLI-D-17-0481.1 (2018).
Carranza, M. M. et al. When mixed layers are not mixed. storm-driven mixing and bio-optical vertical gradients in mixed layers of the Southern Ocean. J. Geophys. Res. Oceans 123, 7264–7289 (2018).
Nicholson, S.-A. et al. Storms drive outgassing of CO2 in the subpolar Southern Ocean. Nat. Commun. 13, 158 (2022).
Carranza, M. M. et al. Extratropical storms induce carbon outgassing over the Southern Ocean. Npj Clim. Atmospheric Sci. 7, 1–16 (2024).
Turner, K. E., Krasting, J. P., Russell, J. L. & Stouffer, R. J. Uncovering the seasonality of storm-driven southern ocean heat and carbon uptake. https://doi.org/10.1175/JCLI-D-24-0642.1 (2025).
Gray, A. R. et al. Autonomous biogeochemical floats detect significant carbon dioxide outgassing in the high-latitude Southern Ocean. Geophys. Res. Lett. 45, 9049–9057 (2018).
Sherwen, T. et al. A machine-learning-based global sea-surface iodide distribution. Earth Syst. Sci. Data 11, 1239–1262 (2019).
Weber, T., Wiseman, N. A. & Kock, A. Global ocean methane emissions dominated by shallow coastal waters. Nat. Commun. 10, 4584 (2019).
Landschützer, P. et al. A neural network-based estimate of the seasonal to inter-annual variability of the Atlantic Ocean carbon sink. Biogeosciences 10, 7793–7815 (2013).
Breiman, L. Random Forests. Mach. Learn. 45, 5–32 (2001).
Turner, J., Marshall, G. J. & Lachlan-Cope, T. A. Analysis of synoptic-scale low pressure systems within the Antarctic Peninsula sector of the circumpolar trough. Int. J. Climatol. 18, 253–280 (1998).
Edmon, H. J., Hoskins, B. J. & McIntyre, M. E. Eliassen-Palm cross sections for the troposphere. https://journals.ametsoc.org/view/journals/atsc/37/12/1520-0469_1980_037_2600_epcsft_2_0_co_2.xml (1980).
Held, I. M. & Hoskins, B. J. Large-scale eddies and the general circulation of the troposphere. in Advances in Geophysics vol. 28 3–31 (Elsevier, 1985).
Moon, W., Lee, S. P., Vanderborght, E., Manucharyan, G. & Dijkstra, H. An idealized model for the spatial structure of the eddy-driven Ferrel cell in mid-latitudes. Preprint at https://doi.org/10.5194/egusphere-2025-1004 (2025).
Chang, E. K. M. Projected significant increase in the number of extreme extratropical cyclones in the southern hemisphere. J. Clim. 30, 4915–4935 (2017).
Pellichero, V., Sallée, J.-B., Schmidtko, S., Roquet, F. & Charrassin, J.-B. The ocean mixed layer under Southern Ocean sea-ice: seasonal cycle and forcing. J. Geophys. Res. Oceans 122, 1608–1633 (2017).
Reagan, J. R. et al. World Ocean Atlas 2023. NOAA National Centers for Environmental Information (2024).
Zhong, G. et al. The Southern Ocean carbon sink has been overestimated in the past three decades. Commun. Earth Environ. 5, 1–12 (2024).
Bushinsky, S. M. et al. Offset between profiling float and shipboard oxygen observations at depth imparts bias on float ph and derived pCO2. Glob. Biogeochem. Cycles 39, e2024GB008185 (2025).
Johnson, G. C., Lyman, J. M. & Purkey, S. G. Informing deep argo array design using argo and full-depth hydrographic section data. https://doi.org/10.1175/JTECH-D-15-0139.1 (2015).
Johnson, G. C., Mahmud, A. K. M. S., Macdonald, A. M. & Twining, B. S. Antarctic bottom water warming, freshening, and contraction in the eastern Bellingshausen basin. Geophys. Res. Lett. 51, e2024GL109937 (2024).
Butterworth, B. J. & Miller, S. D. Air-sea exchange of carbon dioxide in the Southern Ocean and Antarctic marginal ice zone. Geophys. Res. Lett. 43, 7223–7230 (2016).
Dong, Y. et al. Direct observational evidence of strong CO2 uptake in the Southern Ocean. Sci. Adv. 10, eadn5781 (2024).
Landwehr, S. et al. Using eddy covariance to measure the dependence of air–sea CO2 exchange rate on friction velocity. Atmospheric Chem. Phys. 18, 4297–4315 (2018).
Yang, M. et al. Natural variability in air–sea gas transfer efficiency of CO2. Sci. Rep. 11, 13584 (2021).
Reichl, B. G. & Deike, L. Contribution of Sea-state dependent bubbles to Air-Sea carbon dioxide fluxes. Geophys. Res. Lett. 47, e2020GL087267 (2020).
Yang, M. et al. Global Synthesis of Air-Sea CO2 Transfer Velocity Estimates From Ship-Based Eddy Covariance Measurements. Front. Mar. Sci. https://doi.org/10.3389/fmars.2022.826421 (2022).
Else, B. G. T. et al. Wintertime CO2 fluxes in an Arctic polynya using eddy covariance: Evidence for enhanced air-sea gas transfer during ice formation. J. Geophys. Res. Oceans https://doi.org/10.1029/2010JC006760 (2011).
Prytherch, J. et al. Central Arctic Ocean surface–atmosphere exchange of CO2 and CH4 constrained by direct measurements. Biogeosciences 21, 671–688 (2024).
Butterworth, B. J. & Else, B. G. T. Dried, closed-path eddy covariance method for measuring carbon dioxide flux over sea ice. Atmospheric Meas. Tech. 11, 6075–6090 (2018).
Resplandy, L. et al. A synthesis of Global Coastal Ocean greenhouse gas fluxes. Glob. Biogeochem. Cycles 38, e2023GB007803 (2024).
Heimdal, T. H., McKinley, G. A., Sutton, A. J., Fay, A. R. & Gloege, L. Assessing improvements in global ocean pCO2 machine learning reconstructions with Southern Ocean autonomous sampling. Biogeosciences 21, 2159–2176 (2024).
Fyfe, J. C., Saenko, O. A., Zickfeld, K., Eby, M. & Weaver, A. J. The role of poleward-intensifying winds on southern ocean warming. J. Clim. 20, 5391–5400 (2007).
Deng, K. et al. Changes of Southern Hemisphere westerlies in the future warming climate. Atmospheric Res 270, 106040 (2022).
Bracegirdle, T. J. et al. Twenty first century changes in Antarctic and Southern Ocean surface climate in CMIP6. Atmospheric Sci. Lett. 21, e984 (2020).
Neme, J., England, M. H. & McC. Hogg, A. Projected changes of surface winds over the Antarctic continental margin. Geophys. Res. Lett. 49, e2022GL098820 (2022).
Holmes, C. R., Bracegirdle, T. J. & Holland, P. R. Antarctic Sea ice projections constrained by historical ice cover and future global temperature change. Geophys. Res. Lett. 49, e2021GL097413 (2022).
Shu, Q. et al. Assessment of sea ice extent in CMIP6 with comparison to observations and CMIP5. Geophys. Res. Lett. 47, e2020GL087965 (2020).
Bittig, H. C. et al. A BGC-Argo Guide: planning, deployment, data handling and usage. Front. Mar. Sci. https://doi.org/10.3389/fmars.2019.00502 (2019).
Argo. Argo float data and metadata from Global Data Assembly Centre (Argo GDAC) - Snapshot of BGC Sprof data files as of October 09th, 2024. SEANOE (2024).
Prend, C. J. et al. Indo-Pacific sector dominates southern ocean carbon outgassing. Glob. Biogeochem. Cycles 36, e2021GB007226 (2022).
Johnson, K. S. & Bif, M. B. Constraint on net primary productivity of the global ocean by Argo oxygen measurements. Nat. Geosci. 14, 769–774 (2021).
Dutton, G. S. et al. Combined atmospheric nitrous oxide dry air mole fractions from the NOAA GML halocarbons sampling network, 1977-2024. National Oceanic and Atmospheric Administration (NOAA) Earth System Research Laboratory (ESRL) https://doi.org/10.15138/GMZ7-2Q16 (2024).
Hersbach, H. et al. ERA5 hourly data on single levels from 1940 to the present. Copernicus Climate Change Service (C3S) Climate Data Store (CDS) https://doi.org/10.24381/cds.adbb2d47 (2023).
Fernandes, F. et al. TEOS-10/GSW-python. thermodynamic equation of seawater - 2010 (TEOS-10) (2023).
Garcia, H. E. & Gordon, L. I. Oxygen solubility in seawater: better fitting equations. Limnol. Oceanogr. 37, 1307–1312 (1992).
Weiss, R. F. & Price, B. A. Nitrous oxide solubility in water and seawater. Mar. Chem. 8, 347–359 (1980).
Manning, C., Nicholson, D. (Roo) & Kelly, C. L. First release of gasex-python v0.0.1. https://doi.org/10.5281/zenodo.15132889 (2025).
Sharp, J. D. et al. GOBAI-O2: temporally and spatially resolved fields of ocean interior dissolved oxygen over nearly 2 decades. Earth Syst. Sci. Data 15, 4481–4518 (2023).
Bittig, H. C. et al. An Alternative to Static Climatologies: Robust Estimation of Open Ocean CO2 Variables and Nutrient Concentrations From T, S, and O2 Data Using Bayesian Neural Networks. Front. Mar. Sci. https://doi.org/10.3389/fmars.2018.00328 (2018).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Domingos, P. A few useful things to know about machine learning. Commun. ACM 55, 78–87 (2012).
James, G., Witten, D., Hastie, T., Tibshirani, R. & Taylor, J. An Introduction to Statistical Learning: With Applications in Python. https://doi.org/10.1007/978-3-031-38747-0 (Springer International Publishing, Cham, 2023).
Meinshausen, N. Quantile regression forests. J. Mach. Learn. Res. 7, 983–999 (2006).
Wanninkhof, R. Relationship between wind speed and gas exchange over the ocean revisited. Limnol. Oceanogr. Methods 12, 351–362 (2014).
Liang, J.-H. et al. Parameterizing bubble-mediated air-sea gas exchange and its effect on ocean ventilation. Glob. Biogeochem. Cycles 27, 894–905 (2013).
Kalnay, A. E. et al. The NCEP/NCAR 40-year reanalysis project. Bull. Am. Meteorol. Soc. 77, 437–471 (1996).
Kelly, C. Data and analysis products for: low-pressure storms drive nitrous oxide emissions in the Southern Ocean. Zenodo https://doi.org/10.5281/zenodo.17904982 (2025).
CCHDO Hydrographic Data Office. CCHDO hydrographic data archive, version 2024-11-01. UC San Diego library digital collections. https://doi.org/10.6075/J0CCHNNN (2024).
Kelly, C. L. ckelly314/ml-argo-n2o: v1.0.0 - Initial Release: ML-based N₂O flux estimation in the Southern Ocean. Zenodo https://doi.org/10.5281/zenodo.17905127 (2025).
Acknowledgments
We gratefully acknowledge Channing Prend for many conversations about Southern Ocean physical oceanography and BGC-Argo data. Training data were collected and made publicly available by the U.S. Global Ship-based Hydrographic Investigations Program (U.S. GO-SHIP; https://usgoship.ucsd.edu/) and the programs that contribute to it. BGC-Argo data were assembled or collected and made available by the Global Ocean Biogeochemistry Array (GO-BGC) Project funded by the National Science Foundation (NSF grant #OCE-1946578). Development of the BGC-Argo parquet dataset used in this work was supported by NSF grant #OAC-2311383 to D.P.N. C.L.K. was supported by a Doherty Postdoctoral Scholarship funded by the Woods Hole Oceanographic Institution and a U.S. GO-SHIP Postdoctoral Fellowship funded by the National Science Foundation (NSF grant # OCE-2023545). B.X.C. was supported by NSF grant #OCE-2048518. D.P.N.’s contribution to this work was sponsored by the National Science Foundation’s Global Ocean Biogeochemistry Array (GO-BGC) Project under NSF grant #OCE-1946578 with operational support from NSF grant #OCE-2110258. A.M.M. acknowledges partial support from NSF grants #OCE-2023545 and #OCE-1923387. A.F.E. was supported by the 2022 Cooperative Institute for Climate, Ocean, and Ecosystem Studies (CICOES) Summer Internship Program.
Author information
Authors and Affiliations
Contributions
C.L. Kelly, B.X. Chang, and D.P. Nicholson conceived the study. B.X. Chang collected much of the machine learning training data. C.L. Kelly and B.X. Chang processed, analyzed, and curated the training data. C.L. Kelly, A.F. Emmanuelli, and E.R. Park developed the machine learning models. D.P. Nicholson provided critical insights on BGC-Argo data and air-sea gas exchange. D.P. Nicholson and A.M. Macdonald supervised the work. C.L. Kelly wrote the manuscript with input from all authors.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Damian Leonardo Arévalo-Martínez, Yuanxu Dong and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Kelly, C.L., Chang, B.X., Emmanuelli, A.F. et al. Low-pressure storms drive nitrous oxide emissions in the Southern Ocean. Nat Commun 17, 2037 (2026). https://doi.org/10.1038/s41467-026-68744-2
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-026-68744-2







