Newly reconstructed Arctic surface air temperatures for 1979–2021 with deep learning method

Ma, Ziqi; Huang, Jianbin; Zhang, Xiangdong; Luo, Yong; Ding, Minghu; Wen, Jun; Jin, Weixin; Qiao, Chen; Yin, Yifu

doi:10.1038/s41597-023-02059-5

Download PDF

Data Descriptor
Open access
Published: 15 March 2023

Newly reconstructed Arctic surface air temperatures for 1979–2021 with deep learning method

Ziqi Ma¹,
Jianbin Huang^2,3,
Xiangdong Zhang⁴,
Yong Luo^5,6,7,
Minghu Ding⁸,
Jun Wen¹,
Weixin Jin⁹,
Chen Qiao⁵ &
…
Yifu Yin⁵

Scientific Data volume 10, Article number: 140 (2023) Cite this article

4191 Accesses
4 Citations
1 Altmetric
Metrics details

Subjects

Abstract

A precise Arctic surface air temperature (SAT) dataset, that is regularly updated, has more complete spatial and temporal coverage, and is based on instrumental observations, is critically important for timely monitoring and improving understanding of the rapid change in the Arctic climate. In this study, a new monthly gridded Arctic SAT dataset dated back to 1979 was reconstructed with a deep learning method by combining surface air temperatures from multiple data sources. The source data include the observations from land station of GHCN (Global Historical Climatology Network), ICOADS (International Comprehensive Ocean-Atmosphere Data Set) over the oceans, drifting ice station of Russian NP (North Pole), and buoys of IABP (International Arctic Buoy Programme). The last two are crucial for improving the representation of the in-situ observed temperatures within the Arctic. The newly reconstructed dataset includes monthly Arctic SAT beginning in 1979 and daily Arctic SAT beginning in 2011. This dataset would represent a new improvement in developing observational temperature datasets and can be used for a variety of applications.

Deep learning-based reconstruction of monthly Antarctic surface air temperatures from 1979 to 2023

Article Open access 23 May 2025

Exceptional warming over the Barents area

Article Open access 15 June 2022

Cooler Arctic surface temperatures simulated by climate models are closer to satellite-based data than the ERA5 reanalysis

Article Open access 01 March 2024

Background & Summary

In recent decades, substantial changes have occurred in the Arctic^{1,2,3,4,5,6,7,8}. However, due to the harsh environment, there is currently no complete observation network in the Arctic yet. Prior studies^9,10,11 indicated that incomplete coverage of observations in the Arctic may lead to a cold bias in the estimation of the recent Arctic warming, and even underestimate the rate of the recent global warming. Although satellites can provide full coverage observation over the Arctic, they only measure lower tropospheric temperature rather than surface air temperature (SAT)^9,12,13. Moreover, reanalysis datasets, which are widely utilized in climate science research, also provide full coverage data over the Arctic. However, the reanalysis datasets are not the actual observations^14,15.

Improving the coverage of Arctic observations has been one of the important focuses of polar research. Martin and Munoz¹⁶ developed a 6-hour gridded Arctic SATs covering the Arctic ocean and coastal area for 1979–1993 using an optimal interpolation method (OI) based on instrumental observations. The OI analysis was further improved by Rigor et al.¹⁷, and the coverage of the constructed Arctic SAT was expanded to cover the whole Arctic. However, these Arctic SAT datasets have not been updated to represent the dramatic changes that have occurred over the past nearly two decades. Additionally, efforts were also made to increase the observational coverage in the polar regions for the global surface temperature^{9,18,19,20,21,22,23,24,25}. A variety of interpolation methods have been used to fill the Arctic’s data gaps with observations from nearby mid-latitudes, however, the resultant Arctic warming may still be underestimated due to a lack of reasonable application of instrumental observations in the Arctic¹⁰. The global surface temperature dataset NOAA GlobalTemp-Interim²⁵ has used the monthly buoy measurements from IABP¹⁷ (International Arctic Buoy Programme; https://iabp.apl.uw.edu/) to enhance the coverage of observations in the Arctic. However, a large amount of high-temporal resolution temperature records would be lost in the generation of monthly buoy observations due to buoy movement and interruptions of buoy observations.

Deep learning methods have been used in Arctic climate studies, such as sea ice forecasting^26,27,28,29. Recent studies have demonstrated that deep learnings are valuable methods for patching missing data, with partial convolution method outperforming other image inpainting technologies^{18,30,31,32,33}. Compared to conventional interpolation methods such as kriging and principal component analysis-based infilling, deep learning approaches with partial convolution can produce geographically more realistic temperatures¹⁸. In this study, we aim to develop a high-quality, regularly updated gridded Arctic SATs since 1979 by employing the deep learning with partial convolution. This new Arctic SAT data will aid in the comprehension of the Arctic climate state, Arctic climate change monitoring, model validation and interaction of Arctic and global climate.

Methods

Multi-source observations

This study aims to reconstruct the surface air temperature (SAT) in the Arctic using available instrumental observations as much as possible. Here, SAT was reconstructed based on multi-source daily observations, including SAT at 2 m (SAT at 2 m will be specified in this study, all others represent the surface air temperature) from GHCN-d³⁴ (Global Historical Climatology Network-daily) and the Russian NP (North Pole) drifting ice stations³⁵, and SAT from the surface marine observations of ICOADS Release 3.0.2³⁶ (International Comprehensive Ocean-Atmosphere Data Set, hereafter ICOADS) and the IABP (International Arctic Buoy Programme) buoy observations. In the reconstruction, air temperatures from ICOADS surface marine observations and IABP buoy observations were not corrected to 2 m, but were used directly. The above observational datasets have already undergone quality assurance reviews^34,35,36, except for IABP buoy observations. In this work, only IABP buoy observations were subjected to quality control and correction. Moreover, the Arctic is defined as the geographic area north of 60°N, whereas all above-mentioned observations from the Northern Hemisphere were utilized for reconstruction. Due to the application of an equal-area grid in the reconstruction, only the north of 30°N can be fully covered by the reconstructed SATs (more details seen in ‘Base data for reconstruction’ & ‘deep learning model and training’). The expansion of the reconstruction area to lower latitudes enables the inclusion of more climatic interactions in the reconstruction¹⁰.

GHCN-d³⁴ is developed by NOAA (National Oceanic and Atmospheric Administration), collecting near real-time updated measurements of SAT, total daily precipitation and snowfall, etc. from more than 80,000 meteorological land stations in 180 countries and territories worldwide. In 1990, 2010, 2015 and 2020, there were approximately 4000, 5000, 6400, and 6100 terrestrial stations, respectively. During these years, there were 370, 480, 560, and 540 terrestrial stations north of 60°N, respectively.

ICOADS³⁶ began in 1662, and is also developed by NOAA by combining observations from multiple sources, such as ships, moored and drifting buoys, coastal stations, and other ocean platforms, etc. ICOADS offers the most extensive surface marine meteorological observations inclusive of gridded SATs. Until the twenty-first century, these observations were sparsely distributed and primarily limited to ice-free regions.

The NP stations³⁵ recorded multiple meteorological variables, including SAT and surface temperature (ST), which were obtained from Arctic and Antarctic Research Institute (AARI). NP observations were conducted from 1937 to 1991, interrupted by the collapse of the Soviet Union, and restarted since 2003. Approximately 1–3 NP stations annually report SATs and STs over sea ice in the Arctic, however, STs were not available during summer months (May-September). These NP stations are the manned observing stations and the records are regarded as the most accurate instrumental measurements over sea ice in the Arctic.

IABP buoy observations began from 1979, which included meteorological variables such as ST, SAT, and surface pressure¹⁷. Currently, raw records of buoy’s SAT are available from at least 24 buoys annually since 2011, whereas ST records are available from at least 17 buoys annually since 1979. In this study, buoy STs from 1979 to 2010 were first converted to SAT and then used alongside other observations for the reconstruction. Figure 1 describes the schematic overview of the Arctic SAT’s reconstruction. The locations of observations north of 60° N used in the reconstruction are shown in Fig. 2.

Quality control and correction on buoy observations

Typically, buoy records are available at about fifteen minutes interval (https://iabp.apl.uw.edu). We successively converted these buoy temperature measurements into hourly, 3-hour and 6-hour data using an arithmetic mean. In these steps, a preliminary check of observations was performed. If the temperature exceeds the range of three standard deviations of all records for the same period, the data is discarded as suspicious. Further, the daily observation was produced by averaging the four successive six-hour observations in a day if all four are available; otherwise, the daily observation was set to a missing value. Then, the aforementioned daily observations underwent a screening again to eliminate erroneous observations such as flyers and “flat lined” records and records with obviously unreasonable trajectories.

Daily buoy observations were subjected to quality control and correction using NP data, which are similar to Rigor et al.¹⁷, in the following three steps: 1) Constrain the daily buoy observations with standard deviations of the daily NP observations in the corresponding month. Buoy observations were discarded when σ_a < 0.25 σ_NP or σ_a > 4 σ_NP (σ_a and σ_NP respectively represent the standard deviations of daily buoy and NP observations for a certain month); 2) Remove the outliers. If buoy daily measurements surpass μ_NP ± 2 σ_NP (μ_NP is the monthly mean of NP observations), they are eliminated. 3) Make corrections on summertime (June-July-August) daily buoy observations. The daily buoy observations were filtered using a 1-week moving average. The mean of the filtered buoy observations was replaced by the average of summer NP observations. If the adjusted buoy’s temperature exceeds the NP maximum, they were then changed to the NP maximum. Due to the fact that a considerable volume of heat will be needed to melt sea ice, the buoy summer temperature will be close to the melting point, necessitating this correction^16,17.

Conversion of Buoy ST to SAT

Due to unavailability of SAT records prior to 2011 for IABP buoys, there is a potential way by using ST to extend the reconstruction back to 1979. Previous studies^16,37,38,39 on the link between temperatures at different altitudes in the Arctic indicated that ST and air temperature over sea ice have a close relationship throughout the year, especially during sea ice melt seasons. As a result, it may be possible to obtain the SAT by exploiting its relationship with ST based on NP data. During the period of May-August, however, ST for NP is unavailable. So, we developed linear regression models for STs and SATs in various temperature intervals based on NP data (Table 1). Using buoy ST and the aforementioned regression models, the SAT (corresponding to ST less than 0°C) prior to 2011 can be generated. This newly produced SATs were then subjected to the same quality control as the original buoy SATs. In addition, when the buoy ST is greater than or equal to 0°C, we approximated the buoy SAT with the buoy ST after removing potentially invalid data. The ST greater than or equal to 0°C is regarded as potentially invalid observation and set as missing value, if the SAT inferred from ST immediately before and after the ST is a missing value. Then, the new SAT including in May-August underwent the same quality control and corrections as the original buoy SAT. Finally, the SATs prior to 2011 are produced (hereafter SAT-n).

Table 1 Linear regression relationship between ST and SAT at various ST intervals based on NP data.

Full size table

The conversion of daily ST to daily SAT may introduce errors into the reconstruction of the Arctic SAT. Notably, our purpose is to acquire a set of monthly Arctic SATs for 1979–2021 in this study. Therefore, it is vital to assess the reliability of the daily SAT-n (inferred from the ST) used for the reconstruction of monthly Arctic SAT. We designed three experiments for reconstructing Arctic SAT with the buoy observations for 2011–2020 by using the deep learning model (DLM, further details seen in ‘Deep learning model and training’), when both SAT and ST were available. In the first experiment (expt1), IABP buoy SAT was included in the reconstruction. In the second experiment (expt2), IABP buoy SAT-n (inferred from the ST) was included in the reconstruction, while in the third experiment (expt3), IABP buoy observations (both SAT and SAT-n) were excluded from the reconstruction. The difference between expt1 and expt2 in the reconstructed Arctic SAT can be used to validate the reliability of SAT-n in the reconstruction, whilst the difference between expt1 and expt3 will demonstrate the added value of IABP buoy observations in the reconstruction.

As shown in Fig. 3, there is a subtle difference between expt1 and expt2 regarding the annual average Arctic SAT. And, slightly higher discrepancies (less than 0.1°C) are seen in sea ice melt seasons, which may be due to the direct approximation of the SAT with ST in these seasons. Moreover, these disparities lack discernible linear trend. In addition, the added value (expt1-expt3) resulting from the inclusion of IABP buoy observations in reconstruction is evident and significantly larger than the deviation resulting from substitution of the SAT with SAT-n in the reconstruction. The inclusion of IABP buoy observations results in stronger warming (positive anomaly with a maximum of 0.78°C in expt1-expt3) for the annual average Arctic SAT from the mid-2014 to the early-2019.

The differences in the spatial warming trends over the Arctic were also examined for July and January between expt1and expt2, expt1 and expt3, respectively. During 2011–2020, the differences between expt1 and expt2 (Fig. 4b) are considerably smaller than the Arctic warming trends (Fig. 4a) in January. The differences (expt1-expt2) are less than 0.12°C/10a (Fig. 4b) and much smaller than the deviations of expt3 from expt1 (Fig. 4c), which indicate the added value from the inclusion of buoy observations in the reconstruction. It is obvious in Fig. 4c that the largest difference is more than 4°C/10a over the central Arctic Ocean, the Laptev Sea and the East Siberian Sea. In addition, a similar conclusion was also obtained for July. The deviations of expt2 from expt1 (Fig. 4e) are much smaller than the reconstructed SAT trends with inclusion of the IABP SAT (Fig. 4d). They are also significantly smaller than the deviations of expt3 from expt1 (Fig. 4f), especially over the region from the northern Alaska northward to 80°N. These results demonstrate that the buoy ST, after quality control and correction, can be used for the reconstruction of Arctic SAT in the absence of buoy SAT.

Base data for reconstruction

The locations of buoy, drifting ice station and ship observations move over time. To reconstruct the Arctic SAT, it is necessary to combine all observations using the same geographical grid as a base data. Due to the geographical distortion of the lat-lon grid in the polar region, an equal-area grid (Equal-Area Scalable Earth Grid (EASE-Grid 2.0⁴⁰), hereafter EASE) was used to provide uniform spatial representation for all observations. Similar to prior studies^16,17, we employed an EASE grid with a cell size of 100 km × 100 km (a total of 32,400 grids with 180 rows and 180 columns). From 2011, there are on average 18 buoy observations on EASE grids every day. In contrast, there are only 3–4 monthly buoy observations (monthly average requiring at least 15 daily observations in one month for the given EASE grid cell). It is clear that more buoy observational information will be retained if the reconstruction is based on daily rather than monthly buoy observations.

After all four source daily observations (quality-controlled SATs from GHCN-d, ICOADS, NP, IABP) were put into the EASE grid cells, the gridded observations were then normalized (to accelerate the convergence speed of model training) as base data using Eq. (1), where X* indicates the temperature after normalization and X represents the original data before normalization. a, b and t denote the row and column numbers and the time of the equal-area grid, respectively. The μ and σ represent the mean and standard deviation of the ERA5 SAT (2 m) over 1979–2020, respectively. As examples, the EASE gridded observations of base data on March 12 and July 20, 2015 were shown in Fig. 6a,d and the observation data gap is obviously seen, although all instrumental observations were integrated into the base data.

$$X{\left(a,b,t\right)}^{\ast }=\frac{X\left(a,b,t\right)-\mu \left(a,b\right)}{\sigma \left(a,b\right)}$$

(1)

Training, validation and testing data for DLM

Different training datasets for DLMs may induce disparities in the reconstruction of temperatures over polar areas. Prior studies examined reanalysis datasets, such as MERRA2 (Modern-Era Retrospective analysis for Research and Applications, Version 2)⁴¹, JRA-55 (Japanese 55-year Reanalysis)⁴², ERA-Interim (ECMWF Re-Analysis-Interim ERA-I)⁴³, ERA5 (ECMWF Reanalysis v5)⁴⁴, ASRv2 (Arctic System Reanalysis, Version 2)⁴⁵, etc., in reproducing SAT over the Arctic^{46,47,48,49,50,51,52}, and indicated that ERA5 and ERA-I are in better agreement with the observed Arctic temperature variations. Wang et al.⁵² further indicated that SATs from ERA5 are closer to the Arctic observations when SAT above −25°C, while below −25°C SATs from ERA-I are closer to the observations. So, SATs (2 m) from ERA5 and ERA-I were adopted for DLM training in this study.

Both ERA-I and ERA5 were developed by ECMWF (European Centre for Medium-Range Weather Forecasts) as global reanalyses with a spatial resolution of around 80 km and 31 km, respectively. ERA-I is the third generation of reanalysis accessible from January 1979 to August 2019, which has already been superseded by ERA5. ERA5 is continuously updated and already extended back to 1950⁵³. For DLM training and testing, a total of 14,610 daily SAT data from 1979 to 2018 and 15,341 daily SAT samples from 1979 to 2020 were adopted from ERA-I and ERA-5, respectively.

The reanalysis SATs (2 m) from 1979 were divided into a training set of 1979–2005 for model training (19,724 daily samples), a validation set of 2006–2012 for adjusting the model hyperparameters (5,114 daily samples), and a testing set since 2013 for testing model generalisation performance (5,113 daily samples). The binary masks were derived from the base data. During 1979–2020, there was a total of 15,341 binary masks. The 0 indicates the absence of observations in the binary mask, while 1 indicates the presence of observations.

Deep learning model and training

This study utilized DLM to fill the instrumental observation gaps in the Arctic. In place of conventional convolutional layers, a UNet-like architecture with partial convolutional layers⁵⁴ was utilized here (Fig. 5 and Table 2). Missing values in the base data are marked with a binary mask that changes with each convolution process. During this procedure, the missing values will be constructed iteratively. As shown in Fig. 5, the model encoding phase is able to extract temperature information across a broad spatial range, allowing the reconstruction to take into account not only nearby but also distant observations. The right side of the diagram illustrates the decoding process. The skip links will establish a connection between the two temperature fields and the two binary masks, transferring the original information to the decoding step and supplying more details for the reconstruction of the missing area.

Table 2 Details of network architecture.

Full size table

The DLM was trained with daily SATs from the reanalysis datasets ERA5 and ERA-I. At first, the reanalysis SATs were put into the same EASE grids as those in base data. The SATs were then normalized based on the monthly mean and monthly standard deviation of ERA5 SATs over 1979–2020. The reconstructions were performed on EASE grids containing 127 rows and 127 columns. The EASE gridded reanalysis SATs were masked with the binary mask before being input into the DLM to reconstruct the corresponding reanalysis SATs. To increase the generic capability, the DLM was trained with the reanalysis SATs by randomly selecting the aforementioned binary masks.

In this study, MSE (Mean squared error) was used as the loss function, which is calculated for SATs across the entire Arctic region between the DLM’s output and the corresponding SATs from the reanalysis. The DLM was trained with 6,000 iterations by applying a batch size of 50. Every 100 iterations, the trained DLM with updated parameters was utilized to reconstruct the Arctic SATs in the training and validation sets, respectively. Then, their MSEs were also calculated. The DLM’s training was not completed until the MSE in the training set no longer declines or continues to decline but it begins to grow in the validation set.

Testing of the trained DLM

The ERA5 and ERA-I SATs after 2012 were used as testing set to test the trained DLM by reconstructing Arctic SAT over 2013–2018. Here, the reconstructed Arctic SATs of two days as examples were shown in Fig. 6. The SATs of reanalysis datasets at observational grids that were input to the DLM were demonstrated in Fig. 6a,d. From the observational grids, it is clear that land station observations are mainly located in North America and Europe, with a few coastline observations in Greenland. Buoy observations are concentrated in the western Arctic and a few in the Barents-Kara seas. It is obvious from Fig. 6b,c and Fig. 6e,f that the reconstructed SATs are highly consistent with the reanalysis data, respectively, with spatial correlation coefficients of 0.997 and 0.993. In addition, the reconstructed daily Arctic SATs correlate with that of ERA5 and ERA-I with same temporal field correlation coefficient of 0.997 during the period of 2013–2018. These correlation coefficients are statistically significant. The above test results indicate that the DLM trained with ERA5 and ERA-I is able to reproduce the SATs over the Arctic well based on the limited observations.

Data Records

The dataset is available at Figshare⁵⁵. As a result of this work, monthly Arctic SAT anomalies relative to 1981–2010 are provided for 1979–2021. The daily Arctic SATs are also presented for 2011–2021. Moreover, the reconstructed SATs north of 30°N are stored in NetCDF format files with 1°x1° latitude-longitude grids, each of which is defined in three dimensions (time, latitude, and longitude). The files ‘Arctic SAT ano 1 × 1 1979–2021 monthly v1.nc’ and ‘Arctic SAT ano 1 × 1 2011–2021 daily v1.nc’ contain the monthly Arctic SAT anomalies for 1979–2021 and the daily Arctic SATs for 2011–2021, respectively. In these files, “SAT” represents the reconstructed SAT anomalies/SATs, “lat” represents the latitude, and “lon” represents the longitude. Moreover, these datasets can be continuously and consistently updated using the given procedures. Notably, the Arctic SAT reconstruction in this study is based on the SAT over the ocean, land, and sea ice.

Technical Validation

Validation of the reconstructed Arctic SAT

Six terrestrial and three marine station observations were randomly chosen to verify the reconstruction. The terrestrial station observations come from GHCN-d and the marine station observations are from NP drifting ice station data. These station observations were excluded from DLM training and reconstruction. The reconstructed SAT, SAT from ERA5 and ERA-I were respectively interpolated to the observational stations over land and the tracks of NP drifting ice stations. Correlation and RMSE (root mean squared error) were adopted to evaluate their relationships with land and marine observations (Table 3).

Table 3 Comparison of the reconstructed, ERA-5 and ERA-Interim (ERA-I) Arctic SAT with observations from six terrestrial and three drifting ice stations.

Full size table

Daily observations of six land stations cover 8 years from 2011–2018. The correlation coefficients of the reconstructed SAT with Land-1~Land-6 station observations are 0.996, 0.981, 0.962, 0.985, 0.995 and 0.973, respectively, which are comparable with those from the atmospheric reanalysis datasets (Table 3). All of these correlations were statistically significant. The RMSEs of the reconstructed SAT are at least comparable with those from one of reanalysis datasets. In general, the reconstruction results over land are comparable with those from the reanalysis datasets, and both are close to the observations. This may be due to the assimilation of terrestrial observations in reanalysis datasets.

The three drifting station observations cover the most of the period of 2003–2006 (Table 3), which were also used to evaluate the reconstruction over the Arctic Ocean. As shown in Table 3, the reconstructed SAT is closer to the marine observations than the reanalysis datasets, with higher correlation coefficients and lower RMSE. Much improvement of the reconstructed SAT relative to the reanalysis datasets may be due to the absorption of buoy observations over the Arctic Ocean in the reconstruction, while the reanalysis datasets lack the assimilation of these observations. It is worth noting that (1) this comparison period is before 2011, when buoy’s SAT-n (inferred from buoy’s ST) over the Arctic Ocean is used in the reconstruction; (2) the six terrestrial and three marine station observations were excluded together in the above experiment. Therefore, it can be concluded that our work can reasonably reconstruct the Arctic SATs since 1979.

In addition, we also investigated the differences in warming trends over 1979–2021 between the reconstructed Arctic SATs and those from ERA5⁴⁴, NASA GISTEMP v4¹⁹ and Berkeley Earth²¹ in March (Fig. 7) and September (Fig. 8), the maximum and minimum sea ice extent months in the Arctic, respectively. These four data sets consistently demonstrate that March Arctic SATs are warming most significantly over the Arctic Ocean and along the Eurasian coastline (Fig. 7). Nonetheless, the reconstructed SATs indicate a much stronger warming in the Arctic (0.728 ± 0.028°C/10a) relative to ERA5 (0.553 ± 0.025°C/10a), NASA GISTEMP v4 (0.617 ± 0.025°C/10a) and Berkeley Earth (0.632 ± 0.026°C/10a), particularly in the area extending from the central ocean to the East Siberian Sea as well as the coastal region of the northern Alaska, and over Greenland. In September, the Arctic temperature warming trends in the reconstruction (0.55 ± 0.013°C/10a) are weaker than the Berkeley Earth (0.571 ± 0.013°C/10a) and ERA5 (0.571 ± 0.012°C/10a), particularly in the region from the Kara Sea eastward to the Beaufort Sea, but stronger than the GISTEMP (0.463 ± 0.012°C/10a) (Fig. 8). In addition, among the four data sets, the reconstructed SATs for Greenland indicate the strongest warming. All trends above-mentioned were calculated for the area north of 60°N. In general, a warmer Arctic Ocean and Greenland have been reconstructed after additional absorption of observations on sea ice in the Arctic (Figs. 7, 8).

Code availability

The code used in this study can be found at https://doi.org/10.6084/m9.figshare.21940490.v1. This code may be updated over time⁵⁶.

References

Box, J. et al. Key indicators of Arctic climate change: 1971–2017. Environ. Res. Lett. 14, 045010 (2019).
Article ADS CAS Google Scholar
Chen, H. W., Zhang, F. & Alley, R. B. The Robustness of Midlatitude Weather Pattern Changes due to Arctic Sea Ice Loss. Journal of Climate 29, 7831–7849 (2016).
Article ADS Google Scholar
Cohen, J. et al. Recent Arctic amplification and extreme mid-latitude weather. Nature Geoscience 7, 627–637 (2014).
Article ADS CAS Google Scholar
van den Broeke, M. R. et al. On the recent contribution of the Greenland ice sheet to sea level change. The Cryosphere 10, 1933–1946, https://doi.org/10.5194/tc-10-1933-2016 (2016).
Article ADS Google Scholar
Cohen, J. et al. Divergent consensuses on Arctic amplification influence on midlatitude severe winter weather. Nature Climate Change 10, 20–29 (2020).
Article ADS Google Scholar
Thoman, R. L. et al. The Arctic. Bulletin of the American Meteorological Society 103, S257–S306 (2022).
Article ADS Google Scholar
Shepherd, A. et al. Mass balance of the Greenland Ice Sheet from 1992 to 2018. Nature 579, 233–239 (2020).
Article Google Scholar
Druckenmiller, M. L., Thoman, R. L. & Moon, T. A. Arctic Report Card 2022: Executive Summary https://doi.org/10.25923/yjx6-r184 (2022).
Article Google Scholar
Cowtan, K. & Way, R. G. Coverage bias in the HadCRUT4 temperature series and its impact on recent temperature trends. Quarterly Journal of the Royal Meteorological Society 140, 1935–1944 (2014).
Article ADS Google Scholar
Huang, J. et al. Recently amplified arctic warming has contributed to a continual global warming trend. Nature Climate Change 7, 875–879 (2017).
Article ADS Google Scholar
Saffioti, C., Fischer, E. M. & Knutti, R. Contributions of atmospheric circulation variability and data coverage bias to the warming hiatus. Geophysical Research Letters 42, 2385–2391 (2015).
Article ADS Google Scholar
Mears, C. A., Schabel, M. C. & Wentz, F. J. A Reanalysis of the MSU Channel 2 Tropospheric Temperature Record. Journal of Climate 16, 3650–3664 (2003).
Article ADS Google Scholar
Nielsen-Englyst, P. et al. Deriving Arctic 2 m air temperatures over snow and ice from satellite surface temperature measurements. The Cryosphere 15, 3035–3057 (2021).
Article ADS Google Scholar
Bromwich, D. H., Fogt, R. L., Hodges, K. I. & Walsh, J. E. A tropospheric assessment of the ERA-40, NCEP, and JRA-25 global reanalyses in the polar regions. Journal of Geophysical Research: Atmospheres 112 (2007).
Bromwich, D. H., Wilson, A. B., Bai, L.-S., Moore, G. W. K. & Bauer, P. A comparison of the regional Arctic System Reanalysis and the global ERA-Interim Reanalysis for the Arctic. Quarterly Journal of the Royal Meteorological Society 142, 644–658 (2016).
Article ADS Google Scholar
Martin, S. & Munoz, E. A. Properties of the Arctic 2-Meter Air Temperature Field for 1979 to the Present Derived from a New Gridded Dataset. Journal of Climate 10, 1428–1440 (1997).
Article ADS Google Scholar
Rigor, I. G., Colony, R. L. & Martin, S. Variations in Surface Air Temperature Observations in the Arctic, 1979–97. Journal of Climate 13, 896–914 (2000).
Article ADS Google Scholar
Kadow, C., Hall, D. M. & Ulbrich, U. Artificial intelligence reconstructs missing climate information. Nature Geoscience 13, 408–413 (2020).
Article ADS CAS Google Scholar
Lenssen, N. J. L. et al. Improvements in the GISTEMP Uncertainty Model. Journal of Geophysical Research: Atmospheres 124, 6307–6326 (2019).
Article ADS Google Scholar
Morice, C. P. et al. An Updated Assessment of Near-Surface Temperature Change From 1850: The HadCRUT5 Data Set. Journal of Geophysical Research: Atmospheres 126, e2019JD032361 (2021).
ADS Google Scholar
Rohde, R. A. & Hausfather, Z. The Berkeley Earth Land/Ocean Temperature Record. Earth Syst. Sci. Data 12, 3469–3479 (2020).
Article ADS Google Scholar
Sun, W. et al. The Assessment of Global Surface Temperature Change from 1850s: The C-LSAT2.0 Ensemble and the CMST-Interim Datasets. Advances in Atmospheric Sciences 38, 875–888 (2021).
Article ADS Google Scholar
Sun, W. et al. Description of the China global Merged Surface Temperature version 2.0. Earth Syst. Sci. Data 14, 1677–1693 (2022).
Article ADS Google Scholar
Vaccaro, A. et al. Climate Field Completion via Markov Random Fields: Application to the HadCRUT4.6 Temperature Dataset. Journal of Climate 34, 4169–4188 (2021).
Article ADS Google Scholar
Vose, R. S. et al. Implementing Full Spatial Coverage in NOAA’s Global Temperature Analysis. Geophysical Research Letters 48, e2020GL090873 (2021).
Article ADS Google Scholar
Andersson, T. R. et al. Seasonal Arctic sea ice forecasting with probabilistic deep learning. Nature Communications 12, 5124 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Choi, M., De Silva, L. W. & Yamaguchi, H. Artificial Neural Network for the Short-Term Prediction of Arctic Sea Ice Concentration. Remote Sensing 11 (2019).
Wang, L., Scott, K. A. & Clausi, D. A. Sea Ice Concentration Estimation during Freeze-Up from SAR Imagery Using a Convolutional Neural Network. Remote Sensing 9 (2017).
Chi, J. & Kim, H.-c. Prediction of Arctic Sea Ice Concentration Using a Fully Data Driven Deep Neural Network. Remote Sensing 9 (2017).
Arslan, N. & Sekertekin, A. Application of Long Short-Term Memory neural network model for the reconstruction of MODIS Land Surface Temperature images. Journal of Atmospheric and Solar-Terrestrial Physics 194, 105100 (2019).
Article Google Scholar
Dong, J. et al. Inpainting of Remote Sensing SST Images With Deep Convolutional Generative Adversarial Network. IEEE Geoscience and Remote Sensing Letters 16, 173–177 (2019).
Article ADS Google Scholar
Şahin, M., Yıldız, B. Y., Şenkal, O. & Peştemalcı, V. Modelling and Remote Sensing of Land Surface Temperature in Turkey. Journal of the Indian Society of Remote Sensing 40, 399–409 (2012).
Article Google Scholar
Zhang, Q., Yuan, Q., Zeng, C., Li, X. & Wei, Y. Missing Data Reconstruction in Remote Sensing Image With a Unified Spatial–Temporal–Spectral Deep Convolutional Neural Network. IEEE Transactions on Geoscience and Remote Sensing 56, 4274–4288 (2018).
Article ADS Google Scholar
Menne, M. J., Durre, I., Vose, R. S., Gleason, B. E. & Houston, T. G. An Overview of the Global Historical Climatology Network-Daily Database. Journal of Atmospheric and Oceanic Technology 29, 897–910 (2012).
Article ADS Google Scholar
Kahl, J. D. W. et al. Radiosonde Observations from the Former Soviet “North Pole” Series of Drifting Ice Stations, 1954–90. Bulletin of the American Meteorological Society 80, 2019–2026 (1999).
Article ADS Google Scholar
Freeman, E. et al. ICOADS Release 3.0: a major update to the historical marine climate record. International Journal of Climatology 37, 2211–2232 (2017).
Article ADS Google Scholar
Jordan, R. E., Andreas, E. L. & Makshtas, A. P. Heat budget of snow-covered sea ice at North Pole 4. Journal of Geophysical Research: Oceans 104, 7785–7806 (1999).
Article Google Scholar
Overland, J. E. & Guest, P. S. The Arctic snow and air temperature budget over sea ice during winter. Journal of Geophysical Research: Oceans 96, 4651–4662 (1991).
Article Google Scholar
Persson, P. O. G., Fairall, C. W., Andreas, E. L., Guest, P. S. & Perovich, D. K. Measurements near the Atmospheric Surface Flux Group tower at SHEBA: Near-surface conditions and surface energy budget. Journal of Geophysical Research: Oceans 107, SHE 21-21-SHE 21–35 (2002).
Brodzik, M. J., Billingsley, B., Haran, T., Raup, B. & Savoie, M. H. EASE-Grid 2.0: Incremental but Significant Improvements for Earth-Gridded Data Sets. ISPRS International Journal of Geo-Information 1 (2012).
Gelaro, R. et al. The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2). Journal of Climate 30, 5419–5454 (2017).
Article ADS Google Scholar
Kobayashi, S. et al. The JRA-55 Reanalysis: General Specifications and Basic Characteristics. Journal of the Meteorological Society of Japan. Ser. II 93, 5–48 (2015).
Article ADS Google Scholar
Berrisford, P. et al. The ERA-Interim archive Version 2.0. (ECMWF, Shinfield Park, Reading, 2011).
Hersbach, H. et al. The ERA5 global reanalysis. Quarterly Journal of the Royal Meteorological Society 146, 1999–2049 (2020).
Article ADS Google Scholar
Bromwich, D. H. et al. The Arctic System Reanalysis, Version 2. Bulletin of the American Meteorological Society 99, 805–828 (2018).
Article ADS Google Scholar
Chaudhuri, A. H., Ponte, R. M. & Nguyen, A. T. A Comparison of Atmospheric Reanalysis Products for the Arctic Ocean and Implications for Uncertainties in Air–Sea Fluxes. Journal of Climate 27, 5411–5421 (2014).
Article ADS Google Scholar
Demchev, D. M. et al. Verification of ERA-Interim and ERA5 Reanalyses Data on Surface Air Temperature in the Arctic. Russian Meteorology and Hydrology 45, 771–777 (2020).
Article Google Scholar
Graham, R. M. et al. Evaluation of Six Atmospheric Reanalyses over Arctic Sea Ice from Winter to Early Summer. Journal of Climate 32, 4121–4143 (2019).
Article ADS Google Scholar
Graham, R. M., Hudson, S. R. & Maturilli, M. Improved Performance of ERA5 in Arctic Gateway Relative to Four Global Atmospheric Reanalyses. Geophysical Research Letters 46, 6138–6147 (2019).
Article ADS Google Scholar
Jakobson, E. et al. Validation of atmospheric reanalyses over the central Arctic Ocean. Geophysical Research Letters 39 (2012).
Lindsay, R., Wensnahan, M., Schweiger, A. & Zhang, J. Evaluation of Seven Different Atmospheric Reanalysis Products in the Arctic. Journal of Climate 27, 2588–2606 (2014).
Article ADS Google Scholar
Wang, C., Graham, R. M., Wang, K., Gerland, S. & Granskog, M. A. Comparison of ERA5 and ERA-Interim near-surface air temperature, snowfall and precipitation over Arctic sea ice: effects on sea ice thermodynamics and evolution. The Cryosphere 13, 1661–1679 (2019).
Article ADS Google Scholar
Bell, B. et al. The ERA5 global reanalysis: Preliminary extension to 1950. Quarterly Journal of the Royal Meteorological Society 147, 4186–4227 (2021).
Article ADS Google Scholar
Liu, G. et al. In Computer Vision—ECCV 2018 Lecture Notes in Computer Science, Vol. 11215 (eds Ferrari, V. et al.) 19–35 (2018)
Ziqi, M. et al. Newly reconstructed Arctic surface air temperatures for 1979-2021 with deep learning method. figshare. https://doi.org/10.6084/m9.figshare.21345177.v6 (2022).
Ziqi, M. et al. Newly reconstructed Arctic surface air temperatures for 1979-2021 with deep learning method-code. figshare. https://doi.org/10.6084/m9.figshare.21940490.v1 (2023).

Download references

Acknowledgements

This work is supported by the National Key R & D Program of China (Grant No.: 2022YFE0106700), the Norwegian Research Council (Grant No.: 328957) and NOAA CPO NA19NES4320002 and DOE DE-SC0020640. We thank National Oceanographic and Atmospheric Administration (NOAA) and the National Centers for Environmental Information (NCEI) for providing daily land station observations and ocean temperature observations. We also thank observers for providing NP observations on sea ice; the National Snow and Ice Data Center (NSIDC) for providing the equal-area scalable earth (EASE) grid information; the International Arctic Buoy Programme (IABP) for collecting buoy temperature observations. We appreciate the assistance of Ignatius Rigor (IABP), Wendy Ermold (IABP), and D. M. Demchev (AARI) with Arctic observations. Special thanks also to those IABP buoy data providers.

Author information

Authors and Affiliations

Key Laboratory of Plateau Atmosphere and Environment, Chengdu University of Information Technology, Chengdu, Sichuan Province, 610225, China
Ziqi Ma & Jun Wen
College of Resources and Environment, University of Chinese Academy of Sciences, Beijing, 100190, China
Jianbin Huang
Beijing Yanshan Earth Critical Zone National Research Station, University of Chinese Academy of Sciences, Beijing, 101408, China
Jianbin Huang
NOAA CISESS, North Carolina State University, Asheville, NC, 28801, USA
Xiangdong Zhang
Ministry of Education Key Laboratory for Earth System Modeling, Department of Earth System Science, Tsinghua University, Beijing, 100084, China
Yong Luo, Chen Qiao & Yifu Yin
Joint Center for Global Change Studies, Beijing, 100875, China
Yong Luo
State Key Laboratory of Cryosphere Science, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou, Gansu, 730000, China
Yong Luo
Institute of Tibetan Plateau and Polar Meteorology, Chinese Academy of Meteorological Sciences, Beijing, 100081, China
Minghu Ding
Software Technology Center Asia, Microsoft Corporation, Beijing, 100080, China
Weixin Jin

Authors

Ziqi Ma
View author publications
Search author on:PubMed Google Scholar
Jianbin Huang
View author publications
Search author on:PubMed Google Scholar
Xiangdong Zhang
View author publications
Search author on:PubMed Google Scholar
Yong Luo
View author publications
Search author on:PubMed Google Scholar
Minghu Ding
View author publications
Search author on:PubMed Google Scholar
Jun Wen
View author publications
Search author on:PubMed Google Scholar
Weixin Jin
View author publications
Search author on:PubMed Google Scholar
Chen Qiao
View author publications
Search author on:PubMed Google Scholar
Yifu Yin
View author publications
Search author on:PubMed Google Scholar

Contributions

J.H., Y.L., X.Z. designed the research and analysed research results. J.H. and Z.M. collected data, verified the approach, and conducted computations and analysed the results. Z.M. and J.H. drafted, and J.H., X.Z., & M.D. revised the manuscript. All other authors contributed to analysis of research results and improvement of the manuscript.

Corresponding author

Correspondence to Jianbin Huang.

Ethics declarations

Competing interests

The authors declare no competing interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ma, Z., Huang, J., Zhang, X. et al. Newly reconstructed Arctic surface air temperatures for 1979–2021 with deep learning method. Sci Data 10, 140 (2023). https://doi.org/10.1038/s41597-023-02059-5

Download citation

Received: 19 December 2022
Accepted: 08 March 2023
Published: 15 March 2023
DOI: https://doi.org/10.1038/s41597-023-02059-5

This article is cited by

Deep learning-based reconstruction of monthly Antarctic surface air temperatures from 1979 to 2023
- Ziqi Ma
- Jianbin Huang
- Minghu Ding
Scientific Data (2025)
Daily station-level records of air temperature, snow depth, and ground temperature in the Northern Hemisphere
- Vinh Ngoc Tran
- Wenbo Zhou
- Valeriy Y. Ivanov
Scientific Data (2024)