Evaluation of spatial-temporal variation performance of ERA5 precipitation data in China

Jiao, Donglai; Xu, Nannan; Yang, Fan; Xu, Ke

doi:10.1038/s41598-021-97432-y

Download PDF

Article
Open access
Published: 09 September 2021

Evaluation of spatial-temporal variation performance of ERA5 precipitation data in China

Donglai Jiao¹,
Nannan Xu²,
Fan Yang¹ &
…
Ke Xu¹

Scientific Reports volume 11, Article number: 17956 (2021) Cite this article

17k Accesses
243 Citations
Metrics details

Subjects

Abstract

ERA5 is the latest fifth-generation reanalysis global atmosphere dataset from the European Centre for Medium-Range Weather Forecasts, replacing ERA-Interim as the next generation of representative satellite-observational data on the global scale. ERA5 data have been evaluated and applied in different regions, but the performances are inconsistent. Meanwhile, there are few precise evaluations of ERA5 precipitation data over long time series have been performed in Chinese mainland. This study evaluates the temporal-spatial performance of ERA5 precipitation data from 1979 to 2018 based on gridded-ground meteorological station observational data across China. The results showed that ERA5 data could capture the annual and seasonal patterns of observed precipitation in China well, with correlation coefficient values ranging from 0.796 to 0.945, but ERA5 slightly overestimated precipitation in the summer. Nonetheless, the results also showed that the accuracy of the precipitation products was strongly correlated with topographic distribution and climatic divisions. The performance of ERA5 shows spatial inherently across China that the highest correlation coefficient values locate in eastern, Northwestern and North China and the lowest biases locate in Southeast China. This study provides a reliable data assessment of the ERA5 data and precipitation trend analyses in China. The results provide accuracy references for the further use of precipitation satellite data for hydrological calculations and climate numerical simulations.

Evaluation of IMERG and ERA5 precipitation products over the Mongolian Plateau

Article Open access 16 December 2022

Extreme precipitation patterns in the Asia–Pacific region and its correlation with El Niño-Southern Oscillation (ENSO)

Article Open access 08 July 2023

Global spatio-temporal ERA5 precipitation downscaling to km and sub-hourly scale using generative AI

Article Open access 15 June 2025

Introduction

As one of the key types of measurements of the Earth's climate system and hydrological cycle, accurate precipitation measurements are vital to predicting weather, observing ecological changes, predicting droughts and floods, etc.^1,2,3. Traditionally, precipitation data have been measured based on ground-based meteorological instruments and rain-gauge observations at fixed points. However, the spatial resolution of precipitation data obtained from ground-based meteorological stations is poor and susceptible to regional climatic influences, and the accuracy of the spatial distribution information of precipitation data is also insufficient⁴. With the development of remote sensing and computer technology, observing precipitation by satellite is becoming more advanced. Problems with uneven distributions of meteorological sites and radar signal interference can be avoided using satellites to observe precipitation⁵. Over the past decades, climate reanalysis data have been widely used in many fields, such as precipitation forecasting⁶, temperature prediction⁷, soil moisture monitoring⁸, ocean monitoring⁹, and environmental protection¹⁰, all of which use the laws of physics to combine model data with observations from around the world into a complete global dataset. These data with high spatial-temporal resolution can effectively compensate for the lack of direct ground-based precipitation observations.

The fifth generation of global climate reanalysis data, ERA5, published by the European Centre for Medium-Range Weather Forecasts (ECMWF), combines vast amounts of historical observations into global estimates using advanced modeling and data assimilation systems. Although there are several groups produce global atmospheric reanalysis and the most recent products are the MERRA-2 reanalysis¹¹, JRA-55¹² and CFSR (version 2)¹³, ERA5 have been shown to be the best or amongst the best performing reanalysis products by many studies^14,15,16. ERA5 has much higher spatial and temporal resolution than ERA-Interim, providing quality precipitation data over space and time with a much improved troposphere and representation of tropical cyclones¹⁷. Compared to ERA-Interim, ERA5 also provides an enhanced number of output parameters. The move from ERA-Interim to ERA5 represents an increase in overall quality and the level of detail, which has superseded the ERA-Interim reanalysis data¹⁸. For example, Beck et al. (2019) found that ERA5 precipitation data provide significant improvement over ERA-Interim precipitation data at daily time steps against radar and precipitation gauge observations across the conterminous United States¹⁹. Hersbach et al. (2020) found that in comparison to ERA-Interim, the new reanalysis provides better precipitation data at a much higher spatial resolution on the global scale²⁰. The ERA5 dataset is a valuable resource for climate change and scientists in the field of hydrological modeling and beyond. Additionally, some studies have documented the increased accuracy of ERA-5 over that of the ERA-Interim in matching observations for several variables, regions and periods. For example, Wang et al. (2019) estimated ERA5 and ERA-Interim precipitation data over the Arctic sea ice²¹. Zhang et al. (2019) analyzed the atmospheric precipitable water vapor of ERA5 over China²². Albergel et al. (2018) found that a land surface model forced by ERA5 could improve the simulation of evaporation, soil moisture, and river discharge over the continental United States²³. Additionally, Nogueira (2020) evaluated ERA5 and ERA-Interim precipitation data using GPCP (Global Precipitation Climatology Project) as a reference at a global scale and found that it overestimated deep convection and moisture flux convergence over tropical oceans and land, leading to excessive rainfall²⁴. Many studies of the precipitation performance of ERA-Interim have focused more on the spatial performance of reanalysis data at seasonal and annual scales^25,26, which helps to enhance our understanding of data performance across all regions of China and over time.

ERA5 precipitation data has a huge improvement over ERA-Interim. Although ERA5 precipitation data have been widely evaluated, the performances are inconsistent in different regions^21,22,23,24. Few researchers have evaluated the comprehensive performance of the data over a long time series in Chinese mainland¹⁶. Meanwhile, China has complex topography and pronounced climatic heterogeneity, which can affect precipitation amounts significantly. Therefore, the superiority of ERA5 precipitation data over China should be comprehensively analyzed. This study estimates the performance of ERA5 precipitation data from 1979 to 2018 based on gridded observational data (CN05.1) to compensate for the missing accuracy assessment of ERA5 precipitation data in Chinese mainland. The purposes of this study are to (1) evaluate the performance of ERA5 precipitation data against the observed data in China and (2) analyze the annual and seasonal trends and spatial patterns of ERA5 precipitation reanalysis data in China. The results are of great importance for the further rational use of reanalysis data and information on climate change and the hydrological cycle in China. The subsequent sections address the following: Sect. 2 introduces the dataset used in this study and the study area and methods. Section 3 focuses on the precision assessment results. Section 4 discusses factors that may influence the results. Finally, Sect. 5 concludes the paper.

Materials and methods

Study area

China locates in eastern Asia and has a complex topography, variable climate, and diverse terrestrial ecosystems. The precipitation shows significant spatial heterogeneities, decreasing from the southeastern coast to the northwestern interior. The southern region is affected by a tropical and subtropical monsoon climate, which is generally hot and rainy in summer and mild and wet in winter. The northern region is mostly affected by temperate monsoons and a temperate continental climate, with hot and rainy summers and cold and dry winters^27,28. For analyzing the spatial heterogeneity characteristics of precipitation and the performance of ERA5, this study divides Chinese mainland into seven climatic zones according to previous studies of Wang et al. and Yang et al.^29,30. The seven climatic zones are northeastern China (NE), northern China (N), southeastern China (SE), eastern northwestern China (ENW), southwestern China (SW), western northwestern China (WNW) and the Tibetan Plateau (Tibet) (Fig. 1).

Datasets

The European Centre for Medium-Range Weather Forecasts (ECMWF), an international organization supported by 34 countries, is the world's leading international weather forecasting research and operations organization. ERA5 is the latest fifth-generation reanalysis of the ECMWF, which is expected to include detailed meteorological reanalysis data from 1950 onwards by 2020, completely replacing the previous ERA-Interim. ERA5 is a large upgrade over ERA-Interim with higher spatial-temporal resolution, obtaining for the hourly estimates of atmospheric variables at a horizontal resolution of 31 km, with a total of 137 mode layers of 0.01 hPa (approximately 80 km from the surface). ERA5 uses more historical observations, especially satellite data, in advanced data assimilation and modeling systems to estimate more accurate atmospheric conditions³¹. ERA5 uses a pooled reanalysis product consisting of 10 members with a temporal resolution of 3 hours and a spatial resolution of 31 km to assess atmospheric uncertainty. This new feature is based on the data assimilation aggregation (EDA) system developed by ECMWF and can explain errors in observation and prediction models, giving users more confidence when analyzing atmospheric parameters at different times and places. This study uses daily ERA5 precipitation data on regular latitude-longitude grids at 0.25° × 0.25° from 1979 to 2018.

The gridded observational precipitation data (CN05.1) developed by Wu et al. (2013)³² is used to evaluate the performance of ERA5 data. CN05.1 is produced based on observations from more than 2400 stations in China, using the thin-plate spline interpolation method with longitude and latitude as the thin-plate spline function and elevation as a covariate to interpolate the site data³³. The data quality control uses a homogeneity analysis of the time series data and eliminates data with large deviations from historical records or surrounding sites. These observed data are validated in earlier studies^34,35. The spatial distribution of elevation in Chinese mainland is based on NASA Shuttle Radar Topographic Mission (SRTM) Digital Elevation Model (DEM) data (Fig. 1).

Statistical methods

This study estimates the temporal-spatial performance of ERA5 precipitation data using the validation statistical indices of the relative bias (Bias), correlation coefficient (CC), root-mean square error (RMSE), and standard deviation (SD) in seasonal and annual scale. In this study, Bias indicates the size of the deviation between the precipitation data from ERA5 and the observed precipitation data. The relative Bias can be calculated by Eq. (1):

$${\text{Bias}} = \frac{1}{{n*\overline{{X_{OBS} }} }}\mathop \sum \limits_{i = 1}^{n} \left( {X_{{ERA5_{i} }} - X_{{OBS_{i} }} } \right)*100\%$$

(1)

where the range of deviation is −∞~+∞ (the closer the deviation is to 0, the more accurate the data is) and Bias values >0 indicate overestimation, and <0 indicate underestimation.

The correlation coefficient (CC) reflects the degree of linear correlation between the ERA5 precipitation data and observed precipitation data (Eq. (2)).

$$CC = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {X_{{ERA5_{i} }} - \overline{X}_{ERA5} } \right)\left( {X_{{OBS_{i} }} - \overline{X}_{OBS} } \right)}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} \left( {X_{{ERA5_{i} }} - \overline{X}_{ERA5} } \right)^{2} } \sqrt {\mathop \sum \nolimits_{i = 1}^{n} \left( {X_{{OBS_{i} }} - \overline{X}_{OBS} } \right)^{2} } }}$$

(2)

where the range of CC is −1 to 1, completely correlated is 1 and completely uncorrelated is −1.

The RMSE reflects the overall level of error between the ERA5 precipitation data and the observed precipitation data, which can be interpreted as stable.

$$RMSE = \sqrt {\frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left( {X_{{ERA5_{i} }} - X_{{OBS_{i} }} } \right)^{2} }$$

(3)

where the range of RMSE is 0 to ∞, the smaller the value the smaller the overall deviation. For the above equations, $X_{{ERA5_{i} }}$ represents the precipitation data of ERA5, ${\text{X}}_{{{\text{OBS}}_{{\text{i}}} }}$ represents the observed precipitation, and $X_{ERA5}$ and $X_{OBS}$ represents average of precipitation for ERA5 and observation, respectively.

The SD can visually show and compare the “closeness” between the ERA5 and observation precipitation data³⁶.

$$SD = \sqrt {\frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left( {X_{i} - \overline{X}} \right)^{2} }$$

(4)

where n is the number of samples, i is the i^th grid, X_i is the precipitation estimated from ERA5 and observation, respectively. $\overline{X}$ is the average precipitation of the samples.

The line trend (or slope) of the precipitation at the spatial scale is solved by the least-square method³⁷.

$$Slope = \frac{{n\sum {X_{i} Y_{i} - } \sum {X_{i} Y_{i} } }}{{n\sum {X_{i}^{2} - \left( {\sum {X_{i} } } \right)}^{2} }}$$

(5)

where X_i is the ith year, Y_i represents the precipitation in year i, and n is the total number of years.

The Mann–Kendall test can examine the trend of the series by making the following assumptions about the time series:

1.
H0 hypothesis. It is assumed that the data in the series are independent identically distributed random samples, i.e., there is no significant trend.
2.
H1 hypothesis. It is assumed that there is an upward or downward monotonic trend in the series. Under the H0 hypothesis, the test statistic S is defined as:

$$S = \mathop \sum \limits_{i = 1}^{n - 1} \mathop \sum \limits_{j = i + 1}^{n} sgn\left( {x_{j} - x_{i} } \right)$$

(6)

where sgn is the symbolic function and $sgn = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {\theta > 0} \hfill \\ {0,} \hfill & {\theta = 0} \hfill \\ { - 1,} \hfill & {\theta < 0} \hfill \\ \end{array} } \right.$. In Eq. (6), when n ≥ 10, the statistic S approximately obeys a normal distribution. S is normalized to obtain Z, and the significance test is performed using the statistical test value Z with the following formula:

$$Z = \left\{ {\begin{array}{*{20}l} {\left( {S - 1} \right)/\sqrt {var\left( S \right)} ,} \hfill & {S > 0} \hfill \\ {0,} \hfill & {S = 0} \hfill \\ {\left( {S + 1} \right)/\sqrt {var\left( S \right)} ,} \hfill & {S < 0} \hfill \\ \end{array} } \right.$$

(7)

$$var\left( S \right) = \left( {n\left( {n - 1} \right)\left( {2n + 5} \right) - \mathop \sum \limits_{i = 1}^{m} t_{i} \left( {t_{i} - 1} \right)\left( {2t_{i} + 5} \right)} \right)/18$$

(8)

where n is the number of data in the sequence; m is the number of knots (recurring data groups) in the sequence; $t_{i}$ is the width of the knot (the number of repeated data in the ith set of repeated data groups).

Using the bilateral trend test, the $H_{0}$ hypothesis is accepted when $\left| Z \right| \le Z_{1 - \alpha /2}$ at a given significant level α, i.e., the trend is not significant; otherwise, the $H_{1}$ hypothesis is accepted, i.e., $Z > Z_{1 - \alpha /2}$ indicates a significant upward trend of the series, and $Z < - Z_{1 - \alpha /2}$ indicates a significant downward trend of the series.

The spatial distribution of precipitation is clearly affected by the topography. Therefore, this study calculates the CC and Bias of the EAR5 and the observed precipitation data based on four elevation categories: plain areas below 1000 m, medium and high altitude areas from 1000 to 2000 m, higher altitude areas from 2000 to 3500 m, and ultra-high altitude areas above 3500 m.

In addition, the empirical orthogonal function (EOF) is also used to check the consistency of both datasets. In this method, the sampling error method are used to select the leading eigenvectors of EOF analysis and corresponding principal components. Meanwhile, for easier comparison, both ERA5 and observed data are forced to have the same leading modes, which are decided by the minimum number of leading modes between ERA5 and observed data.

Results and discussion

Results

The annual mean precipitation of ERA5 shows similar spatial distribution with that of the observation from 1979 to 2018 (Fig. 2). Figure 2 shows that the annual mean precipitation pattern gradually decreases from the southeast (> 2000 mm) to the northwest (< 200 mm) of China. This is mainly due to Chinese mainland is located in the typical Asian monsoon region where the monsoonal circulation significantly affects the spatial distribution of precipitation. However, the annual mean precipitation data of EAR5 varies from 13 to 2988 mm, which has a wider range than that of the observation. Meanwhile ERA5 overestimates the annual precipitation in the southeastern part of the Tibet Plateau compared to the observation. It is maybe caused by the sparse weather stations in northwest China and the limited observation cannot capture the precipitation pattern with enough details.

Figure 3 presents the seasonal and annual Bias, RMSE, CC, and density scatter plot for the ERA5 and observed precipitation data in Chinese mainland. The 40-year annual average precipitation from the ERA5 and observations data is highly correlated with the corresponding CC, RMSE and Bias values at 0.872, 288.23mm, and 22%, respectively. As can be seen in Fig. 3, the precipitation data of ERA5 are slightly higher than the ground observations, both on the annual and seasonal scale, but at the same time, the CCs all above 0.7. This indicates that there is a slight overestimation of the satellite precipitation data, but overall, the accuracy is high. For seasonal total precipitation, good performance is observed for spring precipitation, with a coefficient of determination CC of 0.858, a Bias of 16%, and an RMSE of 100mm. Winter precipitation has the lowest RMSE, with scare precipitation in this season. In contrast, the ERA5 data has a higher RMSE value (more than 150mm) and lower Bias (19%) in the summer, which indicates that the ERA5 data has a lower deviation from the observation data but a relatively higher error than that in other seasons. In general, the seasonal-scale precipitation data of the ERA5 has a good linear relationship with the observation, with slight overestimation in different seasons but with very small errors and high overall accuracy.

To better understand the differences between the zones, Table 1 lists the quantiles (5%, 50% and 95%) of statistical indicators (CC, RMSE, and Bias) at a grid scale in the seven study zones for the seasonal ERA5 precipitation data against to the observation. It shows that almost all divisions have higher CC and lower RMSE in the spring than in the summer, except for the SE region. In addition, the Bias in the summer in all seven zones is lower compared to that in the other three seasons. This indicates that ERA5 can adequately determine extreme precipitation. The highest CC occurs in the ENW region, but the lowest RMSE and Bias are in the N and NE regions, respectively. This maybe relates to their lower elevation and temperate continental climate, and the fact that weather stations are also more abundant in these areas, providing more accurate station data. The Tibetan Plateau region has the largest deviation regardless of the season, which means that ERA5 reanalysis data are weak for monitoring precipitation in the Tibetan region, and special attention should be paid to this region when applying the data. Generally, ERA5 precipitation data has reliability and high precision in ENW, N, and NE zones for all seasons, while higher Bias and RMSE are located in Tibet (Table 1). At the same time, Tibet has the lowest CC in the summer and winter.

Table 1 5%, 50%, and 95% quantiles of the CC (), Bias (%), RMSE (mm/month) for annual and seasonal precipitation data between ERA5 and observation in seven zones of China.

Full size table

To further estimate the performance of the ERA5 data in terms of the temporal and spatial patterns of the observed precipitation data in Chinese mainland, Fig. 4 presents the spatial maps of the correlation coefficient (Fig. 4(1)–(5)) and Bias (Fig. 4(6)–(10)) between the ERA5 and observations data. It is evident from the annual scale that the correlation coefficients are higher in SE, N, and NE China than in the other regions, and the value of CC is generally above 0.75, suggesting high reliability of the ERA5 precipitation data in these regions. The ERA5 data has a significantly higher CC values in eastern China than in western China. At the same time, the higher spring CC in all regions contributes more to the annual correlation coefficients, except in southeastern China. However, the CC values in most parts of the Tibetan Plateau and northwestern and southwestern China are below 0.6 in all seasons, indicating that the ERA5 data are in better agreement with the observational data at lower elevations in eastern China (CC>0.9) than at higher altitudes in the western region (CC<0.6).

The ERA5 data generally has a smaller Bias in southeastern China, mostly below ±20% (Fig. 4(5)–(10)). In contrast, the Bias in Tibet is mostly positive and generally higher than 100%. This indicates that the ERA5 precipitation data seems to greatly overestimate precipitation in this region. At the seasonal scale, the greatest seasonal differences in precipitation are observed between southwestern China and the Tibetan Plateau region (from 258 ± 20 mm in the summer to 40.2 ± 8.5 mm in the winter), followed by the southeastern region, with a 210.4 ± 98.5 mm difference in total annual precipitation; all other regions have differences of less than 50 mm, with summer and spring contributing more than the other seasons to the interannual differences (Fig. 4(12)–(13)). It indicates the ERA5 precipitation data can capture the spatial pattern of the observed seasonal precipitation; however, the ERA5 overestimates the seasonal precipitation in Tibet and SW regions, especially in spring and winter (Fig. 4(7) and (10)).

Figure 5 presents the performance of ERA5 precipitation data on different terrain (below 1000 m, medium and high altitude areas from 1000 m to 2000 m, higher altitude areas from 2000 m to 3500 m, and ultra-high altitude areas above 3500 m). It shows that the CC values decrease with increasing altitude, with the highest correlation in the plains and the lowest correlation at high altitudes in the Tibetan region. It indicates that the ERA5 precipitation data provides poor estimates of precipitation at high altitudes. Meanwhile, the Bias values increase with altitude and are all positively biased, indicating that the ERA5 precipitation data are overestimated at most pixels.

Figure 6 presents the change trend in annual and seasonal precipitation between the ERA5 and observations data. EAR5 can generally catch the seasonal changing trend pattern of the observation. However, the spatial patterns of changing trends show more inconsistencies between ERA5 and the observation in annual and seasonal scale. The ERA5 annual precipitation shows a clear downward trend in SE (−8.6 mm/year), N (−3.0 mm/year), and NE (−2.98 mm/year) of China, but observed precipitation shows an upward trend in these regions (3.95, 1.03, and 0.3 mm/year in the SE, N, and NE, respectively).

Figure 7 presents the spatial averaged SD and its changing trends for the ERA5 and observation data at the annual and seasonal scales. The SDs of the observation and ERA5 data show consistent trends at both the annual and seasonal scales, with the largest SDs in the summer and the smallest in the winter, indicating that precipitation has temporal variability. It should be noted that the ERA5 precipitation data typically displays a higher SD value than the observed precipitation data, varying from 88.3 mm in the winter to 309.5 mm in the summer. For 1979–2018, the trends of the SD values for the ERA5 and observation data at the annual scale are −12.25 mm and −8.64 mm per decade, respectively, indicating a decrease in the temporal variability in annual precipitation. Similar decreasing trends are observed in spring, autumn, and winter. The SD of summer precipitation shows a slight increasing trend in both the ERA5 and observation by 2.03 mm and 2.38 mm per decade, respectively.

As also can be seen in Fig. 7 that the annual and seasonal precipitation trends are basically similar during the period 1979–2018, both showing a decreasing trend, with only a slight upward trend in winter, rising by 0.4 mm per decade. The contributions of spring and autumn to the declining trend in annual precipitation were slightly larger than those of summer and winter, with changes of more than 3 mm per decade in spring and autumn and less than 2 mm per decade in summer and winter. The change in winter precipitation was the weakest, with 0.4 mm per decade for the observation data and 0.9 mm per decade for the ERA5 data. It is noteworthy that relative to the decreasing trend of 4.3 mm per decade in spring precipitation data from the observed data, the spring precipitation data from ERA5 decrease by 7.5 mm per decade and contribute the most to the decreasing trend in annual precipitation.

The top three leading EOF modes for the ERA5 and observed annual precipitation are presented in Fig. 8. The percentage of total variance explained by the top three main EOF modes exceeds 54% in the ERA5 data and 48% for the observed data. The first and second modes differ somewhat in the proportion of the total variance of the EOF modes corresponding to ERA5 and observed annual precipitation, while the third mode is similar (32.05% vs28.56% for EOF1. 12.4% vs 15.57% for EOF2. 7.99% vs. 7.17% for EOF3). Although EOF1 and EOF2 patterns for ERA5 annual precipitation are identical to those of the observation, they show significant differences in some areas of EOF3. The increasing trend of annual precipitation in almost all areas could be captures by EOF1 (positive values) of both ERA5 and observation, but ERA5 shows 10% more precipitation than observation on the southeast corner of the Tibetan Plateau.. For EOF2, the opposite region is also concentrated in the southeast corner region of the Tibetan Plateau, where observation shows negative values and ERA5 positive values. For EOF3, the annual precipitation of ERA5 decreases in the 25°N-30°N region with negative value, while the annual precipitation of observation increases with positive value. It indicates that It indicates that the spatial patters between the ERA5 and the observed precipitation are generally similar explained by the first and second EOF modes.

Discussion

Satellite reanalysis of precipitation data are widely used in hydrology, meteorology and climatology and their applications. However, it is difficult to quantitatively estimate the accuracy of satellite reanalysis of precipitation data due to their complex variability at spatial and temporal scales. This study estimates the seasonal and annual temporal and spatial performance of the ERA5 precipitation data against observational data in China from 1979 to 2018. The ERA5 precipitation data can capture the temporal-spatial pattern of the observed precipitation in China, which can be used for hydrological and climatic studies in China. There is good agreement between the ERA5 precipitation data and observational data at lower elevations (below 1000 m) in eastern China, but not at higher elevations in western China. ERA5 precipitation data overestimates the spring and summer precipitation. The overestimate also occurs in the southeastern part of the Tibet Plateau for annual scale. The overestimation may be related to inverse algorithms used for satellite products and inaccurate estimates of solid precipitation¹⁶. Unlike previous studies on specific regions or seasons^38,39, this study provides a deeper understanding of the continuous spatiotemporal performance of ERA5 precipitation data from different climate zones and temporal scales in Chinese mainland. We find that the annual and seasonal ERA5 precipitation performance varies spatially across the western, central, and eastern regions. These evaluations help to deepen our understanding of the uncertainty in the ERA5 precipitation data in different regions and seasons in China.

As a new generation of the ECMWF reanalysis data, ERA5 performs better than ERA-Interim in representing the precipitation variability with higher spatial and temporal resolution than ERA-Interim, providing quality precipitation data over space and time^{18,19,20,21,24}. This study finds that the ERA5 precipitation at lower elevations is estimated more accurately and the trends in precipitation at annual and seasonal scales are more consistent with observations, probably because ERA5 has much higher spatial and temporal resolution than ERA-Interim, providing higher quality precipitation data in both space and time²⁵. To obtain more meaningful climate variables, reanalysis systems often take into account ground pressure, 2 m air temperature, 2 m relative humidity and 10 m wind speed, all of which are observations considered within the reanalysis system that contribute to improved data quality⁴⁰. However, we must consider that although the data assimilation approach can improve data accuracy by adding physically meaningful information from the predictive model, it is still subject to uncertainty⁴⁰. For example, numerical simulations, assimilation schemes and errors in observation systems may affect the ability of ERA5 data to capture the actual climate. Therefore, some studies have shown that it is difficult to completely replace observational data information with reanalysis system information to reflect the true state of the atmosphere⁴¹. For example, Wang et al.²⁵ showed in their study that ERA-Interim reanalysis data are not suitable for long-term climate trend calculations. In this paper, we also note that seasonal and interannual changes in the ERA5 data are not entirely consistent with changes in the observational data.

Usually, the stations are located in plains or mountain valleys, which mean that interpolations on the surrounding alpine grid points need to be revised in terms of topography. The dataset used in this study, CN05.1, was implemented using ANSPLIN software, and the resulting revised coefficient is a uniform value across the application area, which varies somewhat depending on the number of sites used. This dataset still has the uncertainties which can also influence the validation result, although this dataset has been quality controlled and estimated by other studies^32,33,34,35. In the future, consideration could be given to interpolating the values separately in different regions after appropriate partitioning by climate characteristics. In addition, the use of reanalysis data to drive high-resolution regional climate models could be attempted and spatially and temporally varying terrain revision parameters could be analyzed in the simulation results and used to interpolate the observations. Therefore, we should also consider the uncertainty in the CN05.1 dataset when evaluating accuracy.

This study finds that the ERA5 precipitation data are in better agreement with the observational data at altitudes below 1000 m, with deviations close to 1% and a general CC more than 0.6. However, significant differences are observed in mountainous areas with an average elevation above 4000 m, especially on the Tibetan Plateau. This indicates that elevation-induced Bias may be the cause of uncertainty in the accuracy of the data, while the scarcity of meteorological stations at high altitudes may also lead to Bias in the generated gridded data. Therefore, when applied to Tibetan Plateau, the altitude correction and gridded data analysis should be taken into account^39,42,43,44. Zhao et al. analyzed the relationship between terrain correction and reanalysis of surface temperature errors between National Centers for Environmental Prediction–National Center for Atmospheric Research (NCEP–NCAR) and ERA-40 using meteorological station data and found that the deviation is usually proportional to the increase in local elevation and terrain complexity⁴⁵. Gao and Hao analyzed the difference between ERA-Interim elevation and observed station elevation and pointed out that elevation differences can affect the accuracy of the reanalysis data, especially in areas with higher elevations⁴⁶.

Conclusions

This study synthesizes and evaluates the performance of the ERA5 annual and seasonal precipitation data at different temporal scales and locations from 1979 to 2018 based on gridded observational data in China. The results show that the ERA5 precipitation data can capture the temporal-spatial patterns of the observed precipitation in China, with a generally high accuracy but slight overestimation of regional precipitation over Chinese mainland, especially in the summer. However, the trend variation of ERA5 precipitation data is dramatically different from the observed data in spatial distribution, both on annual and seasonal scales. The accuracy of the precipitation products is strongly correlated with the topographic distribution and climatic divisions. Furthermore, this study suggests that extra care should be taken to consider precipitation uncertainty at high altitudes when applying the ERA5 precipitation reanalysis data. These results help us to further understand the error sources, the rational application of the reanalysis products and the potential improvements for the next generation of products.

Data availability

ERA5 hourly data on single levels from 1979 to present are available from the ECMWF, please visit: https://cds.climate.copernicus.eu. The digital elevation data can be downloaded from the shuttle radar topography mission (SRTM) digital elevation model (https://eospso.gsfc.nasa.gov/missions/shuttle-radar-topography-mission).

References

Simpson, J., Kummerow, C., Tao, W. K. & Adler, R. F. The tropical rainfall measuring mission (trmm) sensor package. Meteorol. Atmos. Phys. 60(1–3), 19–36 (1996).
Article ADS Google Scholar
Colucci, R. R. & Guglielmin, M. Precipitation–temperature changes and evolution of a small glacier in the southeastern European Alps during the last 90 years. Int. J. Climatol. 35, 2783–2797. https://doi.org/10.1002/joc.4172 (2015).
Article Google Scholar
Jiang, R., Gan, T. Y., Xie, J., Wang, N. & Kuo, C. C. Historical and potential changes of precipitation and temperature of Alberta subjected to climate change impact: 1900–2100. Theor. Appl. Climatol. https://doi.org/10.1007/s00704-015-1664-y (2015).
Article Google Scholar
Li, R. W., Zeng, D. B., Yan, S. Validation of six satellite-derived rainfall estimates over China. Meteorol. Mon. 2015.
Qing, D. D., Song, F., Ning, S. W., Cheng, G. Error analysis of latest GPM remote sensing dataset for Huaihe and Haihe Basins. J. China Hydrol. 2018.
Kim, I. W., Oh, J., Woo, S. & Kripalani, R. H. Evaluation of precipitation extremes over the Asian domain: observation and modelling studies. Clim. Dyn. 52, 1317–1342 (2019).
Article Google Scholar
Huang, X., Soden, B. J., Jackson, D. L. Interannual co-variability of tropical temperature and humidity: a comparison of model, reanalysis data and satellite observation. Geophys. Res. Lett. 2015, 321, doi:https://doi.org/10.1029/2005GL023375.
Naz, B. S., Kollet, S., Franssen, H. J. H., Montzka, C., Kurtz, W. A 3 km spatially and temporally consistent european daily soil moisture reanalysis from 2000 to 2015. Entific Data 2020. doi:https://doi.org/10.1038/s41597-020-0450-6.
Jena, B., Kumar, A., Ravichandran, M., Kern, S., Dias, J. M. Mechanism of sea-ice expansion in the Indian Ocean sector of Antarctica: insights from satellite observation and model reanalysis. PLos One 2018, 13(10).
Qi, S., Yun-Hao, C., Jing, L. I. Inversion of PM2.5 concentration in Beijing based on satellite remote sensing and meteorological reanalysis data. Geogr. Geo-inf. Sci. 2018.
Gelaro, R., McCarty, W., Suárez, M. J. & Todling, R. The modern-era retrospective analysis for research and applications, version 2 (MERRA-2). J. Clim. 30(14), 5419–5454 (2017).
Article ADS Google Scholar
Kobayashi, S., Ota, Y., Harada, Y., Ebita, A., Moriya, M., Onoda, H., Onogi, K. The JRA-55 reanalysis: general specifications and basic characteristics. J. Meteorol. Soc. Jpn. Ser. II, 2015, 93(1), 5–48. https://doi.org/10.2151/jmsj.2015-001
Saha, S. et al. The NCEP climate forecast system version 2. J. Clim. 27(6), 2185–2208 (2014).
Article ADS Google Scholar
Tarek, M., Brissette, F. P. & Arsenault, R. Evaluation of the ERA5 reanalysis as a potential reference dataset for hydrological modelling over North America. Hydrol Earth Syst Sci 2020(24), 2527–2544. https://doi.org/10.5194/hess-24-2527-2020 (2020).
Article ADS Google Scholar
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J. The ERA5 global reanalysis. Q J R Meteorol. Soc. 2020,146(730) https://doi.org/10.1002/qj.3803.
Jiang, Q., Li, W., Fan, Z., He, X., Sun W., Chen, S., Wen, J., Gao, J., Wang, J. Evaluation of the ERA5 reanalysis precipitation dataset over Chinese Mainland. J. Hydrol. 2020, 125660.
He, Q., Zhang, K. & Wu, S. Precipitable water vapor converted from GNSS-ZTD and ERA5 datasets for the monitoring of tropical cyclones. IEEE Access 99, 1–1 (2020).
Article Google Scholar
Uppala, S. M. D., D. P. et al. Towards a climate data assimilation system: status update of ERA-Interim. Meteorol. Sect. ECMWF Newsl. 115, 12–18. https://doi.org/10.21957/byinox4wot (2008).
Beck, H. E., Pan, M., Roy, T., Weedon, G. P. & Pappenberger, F. Daily evaluation of 26 precipitation datasets using Stage-IV gauge-radar data for the CONUS. Hydrol. Earth Syst. Sci. 23, 207–224. https://doi.org/10.5194/hess-23-207-2019 (2019).
Article ADS Google Scholar
Hersbach, H. et al. The ERA5 global reanalysis. Q. J. R. Meteorol. Soc. 146, 1999–2049. https://doi.org/10.1002/qj.3803 (2020).
Article ADS Google Scholar
Wang, C.; Graham, R.M.; Wang, K.; Gerland, S.; Granskog, M.A. Comparison of ERA5 and ERA-Interim near surface air temperature and precipitation over Arctic sea ice: effects on sea ice thermodynamics and evolution. Cryosphere Discussions 2018:1–28.
Zhang, Y., Cai, C., Chen, B. & Dai, W. Consistency evaluation of precipitable water vapor derived from ERA5, ERA-interim, GNSS, and radiosondes over China. Radio Sci. 54, 561–571. https://doi.org/10.1029/2018rs006789 (2019).
Article ADS Google Scholar
Albergel, C., Dutra, E., Munier, S., Calvet, J. C. & Balsamo, G. Era-5 and era-interim driven isba land surface model simulations: which one performs better? Hydrol. Earth Syst. Sci. https://doi.org/10.5194/hess-22-3515-2018 (2018).
Nogueira, M. Inter-comparison of ERA-5, ERA-Interim and GPCP rainfall over the last 40 years: process-based analysis of systematic and random differences. J. Hydrol. 2020, 583, 124632.
Wang, Z. et al. Evaluation of spatial and temporal performances of ERA-interim precipitation and temperature in Mainland China. J. Clim. 31, 4347–4365. https://doi.org/10.1175/jcli-d-17-0212.1 (2018).
Article ADS Google Scholar
Cai, D., Fraedrich, K., Sielmann, F., Guan, Y., Guo, S., Zhang, L., Zhu, X. Climate and vegetation: an era-interim and gimms ndvi analysis. J. Clim., 27(13), 5111–5118.
Yang, X., Zhang, M., He, X., Ren, L., Pan, M., & Yu, X., et al. Contrasting influences of human activities on hydrological drought regimes over China bas ed on high‐resolution simulations. Water Resour. Res. 2020, 56, doi:https://doi.org/10.1029/2019wr025843.
Yang, X., Zhang, L., Wang, Y., Singh, V. P., Xu, C., Ren, L., Yuan, F., Jiang, S.: Spatial and temporal characterization of drought events in China Using the Severity-Area-Duration Method. Water 2020, 12, 230.
Yang, X. et al. Bias correction of historical and future simulations of precipitation and temperature for China from CMIP5 models. J. Hydrometeorol. 19(3), 609–623. https://doi.org/10.1175/jhm-d-17-0180.1 (2018).
Article ADS Google Scholar
Wang, A., Lettenmaier, D. P. & Sheffield, J. Soil moisture drought in China, 1950–2006. J. Clim. 24, 3257–3271. https://doi.org/10.1175/2011JCLI3733.1 (2011).
Article ADS Google Scholar
Meng, X., Guo, J., Han, Y., Observatory, S.M. Preliminarily assessment of ERA5 reanalysis data. J. Mar. Meteorol. 2018.
Wu, j.; Gao, X.J. A gridded daily observation dataset over China region and comparison with the other datasets (in Chinese). Chin. J. Geophys. Chinese Edition 2013, 56, 1102–1111.
Hutchinson; F, M. ANUSPLIN Version 4.0 user guide. Centre for Resources and Environmental Studies. Canberra: Australian National University 1999.
Yuan, W. et al. Validation of China-wide interpolated daily climate variables from 1960 to 2011. Theor. Appl. Climatol. 119, 689–700 (2015).
Article ADS Google Scholar
Wu, J., Gao, X., Giorgi, F., Chen, D. Changes of effective temperature and cold/hot days in late decades over China based on a high resolution gridded observation dataset. Int. J. Climatol. 2017, 37.
Taylor; Karl, E. Summarizing multiple aspects of model performance in a single diagram. J. Geophys. Res. Atmospheres 2001, 106, 7183–7192.
Wang, S., Mo, X., Liu, Z., Baig, M. H. & Chi, W. Understanding long-term (1982–2013) patterns and trends in winter wheat spring green-up date over the North China Plain. Int. J. Appl. Earth Obs. Geoinf. 57, 235–244. https://doi.org/10.1016/j.jag.2017.01.008 (2017).
Article ADS Google Scholar
Chen, G., Iwasaki, T., Qin, H. & Sha, W. Evaluation of the warm-season diurnal variability over East Asia in recent reanalyses JRA-55, ERA-interim, NCEP CFSR, and NASA MERRA. J. Clim. 27, 5517–5537 (2014).
Article ADS Google Scholar
You, Q., Min, J., Zhang, W., Pepin, N. & Kang, S. Comparison of multiple datasets with gridded precipitation observations over the Tibetan Plateau. Clim. Dyn. 45, 791–806 (2015).
Article Google Scholar
Dee, D. P. et al. The era-interim reanalysis: configuration and performance of the data assimilation system. Q. J. R. Meteorol. Soc. 137, 553–597. https://doi.org/10.1002/qj.828 (2011).
Article ADS Google Scholar
Bengtsson, L. Can climate trends be calculated from reanalysis data? J. Geophys. Res. 2004, 109, doi:https://doi.org/10.1029/2004jd004536.
Feng, L., Zhou, T. Water vapor transport for summer precipitation over the Tibetan Plateau: multidata set analysis. J. Geophys. Res. Atmospheres 2012, 117.
Gevorgyan, A. Verification of daily precipitation amount forecasts in Armenia by ERA-Interim model. Int. J. Climatol. https://doi.org/10.1002/joc.3621 (2012).
Article Google Scholar
Hu, Z. Y. et al. Applicability study of CFSR, ERA-Interim and MERRA precipitation estimates in Central Asia. Arid Land Geography 36, 700–708 (2013).
Google Scholar
Zhao, T., Guo, W. & Fu, C. Calibrating and evaluating reanalysis surface temperature error by topographic correction. J. Clim. 21, 1440–1446 (2008).
Article ADS Google Scholar
Lu, G. & Lu, H. Verification of ERA-interim reanalysis data over China. J Subtrop. Resour Environ. 2, 75–81 (2014).
Google Scholar

Download references

Acknowledgements

The authors would like to thank the European Centre for Medium‐Range Weather Forecasts (ECMWF) for providing ERA5 reanalysis data, Thanks also to Nanjing University of Posts and Telecommunications. The authors would like to thank Linyong Wei for suggestions on data analyses.

Author information

Authors and Affiliations

School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu Province, China
Donglai Jiao, Fan Yang & Ke Xu
Department of Communication and Information Technology, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu Province, China
Nannan Xu

Authors

Donglai Jiao
View author publications
Search author on:PubMed Google Scholar
Nannan Xu
View author publications
Search author on:PubMed Google Scholar
Fan Yang
View author publications
Search author on:PubMed Google Scholar
Ke Xu
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, D.L.J.and N.N.X.; methodology and data curation, K.X.; validation and formal analysis, F.Y.; writing—original draft preparation, N.N.X.. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Donglai Jiao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jiao, D., Xu, N., Yang, F. et al. Evaluation of spatial-temporal variation performance of ERA5 precipitation data in China. Sci Rep 11, 17956 (2021). https://doi.org/10.1038/s41598-021-97432-y

Download citation

Received: 11 March 2021
Accepted: 24 August 2021
Published: 09 September 2021
DOI: https://doi.org/10.1038/s41598-021-97432-y

This article is cited by

Increasing probability of extreme rainfall preconditioned by humid heatwaves in global coastal megacities
- Poulomi Ganguli
- Bruno Merz
npj Climate and Atmospheric Science (2025)
Dynamics-constrained rainfall projection reveals substantial increase in population exposure to unprecedented floods in the North China Plain
- Long Yang
- Jinghan Zhang
- Fuqiang Tian
Communications Earth & Environment (2025)
A dataset for monitoring agricultural drought in Europe
- Guido Fioravanti
- Andrea Toreti
- Willem Maetens
Scientific Data (2025)
Leveraging ERA5-Land Reanalysis Precipitation Data for Urban Flood Vulnerability and Water Security Assessments: A Global Perspective
- Mohamed A. Aboelnour
- Alan F. Hamlet
- Feng-Wei Hung
Earth Systems and Environment (2025)
Variabilities in onset and withdrawal characteristics of Indian summer monsoon after 1998 climate shift: dynamical role of somali jet
- Vineet Sharma
- Amarjeet
- Arun Chakraborty
Theoretical and Applied Climatology (2025)