Background & Summary

Winter wheat plays an indispensable role in global food security, and its stable and high-yield characteristics make it a crucial source of grain reserves for many countries1,2. Approximately 75% of the world’s wheat production comes from winter wheat, a proportion that underscores its central role in the global food supply3,4. Against the backdrop of a continuously growing population, the stable supply of winter wheat is crucial to meeting global food demand. Winter wheat, with its growth cycle spanning four seasons, accumulates more nutrients such as proteins and lipids, making it a vital source of energy and nutrition. Its grains are rich in carbohydrates, cellulose, and minerals, supporting the staple food needs of 35%–40% of the global population2,5,6.

As the world’s largest producer of winter wheat, China cultivates this crop mainly south of the Great Wall, spanning regions including North China and the Huang-Huai-Hai area. Winter wheat production represents over 56% of China’s total wheat output6. In 2023, China’s wheat cultivation area reached approximately 23.6272 million hectares, with winter wheat accounting for the vast majority. Such large-scale cultivation has provided stable income sources for numerous farming households and driven the development of related agricultural industries, including seeds, fertilizers, pesticides, and agricultural machinery. This has significantly contributed to the prosperity of the rural economy7,8.

Climate change has had a significant impact on the growth and yield of winter wheat. Global warming has led to changes in the growing conditions of winter wheat, thereby affecting its yield and quality9,10. Rising temperatures have shortened the growth cycle of winter wheat, potentially resulting in insufficient accumulated thermal units during critical growth stages, thereby impairing normal development11,12. For instance, in some regions, the accelerated jointing and booting stages of winter wheat expose the crop to heat stress during late growth phases, negatively impacting grain filling and kernel plumpness13,14. In recent years, extreme weather events such as droughts, floods, heatwaves, and cold spells have become increasingly frequent, posing serious threats to the growth and yield of winter wheat15,16. For example, the late July 2023 rainstorm in Beijing triggered severe waterlogging in some fields, severely compromising crop growth of winter wheat17.

Accurate monitoring and regulation of winter wheat phenological stages are critical for ensuring high and stable yields, optimizing resource utilization, and adapting to climate change, representing a core research focus18,19. Phenological stages serve as the “biological clock” governing winter wheat’s responses to environmental variations18,20. Under the background of global warming, the phenological stages of winter wheat have generally advanced. For example, in North China over the past 30 years, the heading stage of winter wheat has occurred 5–8 days earlier on average, making it more vulnerable to late frost damage and high-temperature stress during the grain-filling period21,22. Investigating the dynamic interactions between phenological stages and temperature, precipitation, and photoperiod can provide a scientific basis for crop breeding and optimization of cultivation zoning, thus ensuring spatiotemporal matching between crops and climate resources. The physiological processes at different phenological stages directly govern yield components. For example, the number of effective tillers during the tillering stage influences panicle count, photosynthetic assimilation efficiency from jointing to heading stages determines grains per panicle, while the duration and intensity of the grain-filling stage correlate with thousand-grain weight23,24.

Moreover, critical phenological stages exhibit significant temporal correlations with meteorological disaster windows25,26. For instance, exposure to −2 °C temperatures post-jointing stage can induce panicle development impairment27, while prolonged rainy spells during flowering stage may trigger Fusarium head blight28. Dynamic monitoring of phenological stages, when integrated with meteorological forecasts, enables the establishment of region- and stage-specific disaster early-warning systems. The phenological shifts in winter wheat have been extensively utilized in assessing long-term impacts of climate change on agricultural systems29. European studies indicate that each 1-day advancement in heading stage due to warming increases yield fluctuation risks by 3%–5%30, while phenological shifts in winter wheat across the North American Great Plains have led to mismatches with pollinator activity periods31. Such research provides empirical support for cross-regional adaptive policy formulation, including sowing date adjustments and insurance mechanism design. Winter wheat phenology research serves as a pivotal hub connecting genetic potential, environmental responses, and agronomic practices, with its outcomes not only advancing crop science theory but also directly contributing to food security and sustainable agriculture.

The Huang-Huai-Hai Plain, China’s largest concentrated winter wheat production region, contributes over 60% of the nation’s total winter wheat output, serving as the core “breadbasket” for national food security32,33,34. This study employs kernel density estimation and linear trend analysis on winter wheat phenological data from 62 national ago-meteorological observation stations across the Huang-Huai-Hai Plain during 1981–2021. Using Origin Lab, 1120 illustrative figures were generated to characterize the interannual variations in winter wheat developmental stages and their durations, aiming to provide theoretical foundations and scientific support for climate change adaptation research and cultivation management in winter wheat production.

Methods

Agrometeorological observation stations and data

Our dataset includes the records collected from 1981 to 2021 of 62 agrometeorological observation stations in the HHHP, including Hebei, Henan, Shandong, Shanxi, Jiangsu and Anhui provinces, and Beijing and Tianjin Municipalities. These stations were maintained by the Chinese Meteorological Administration to investigate local crop production and agricultural meteorological disasters, as well as provide suggestions for local farmers to manage agricultural meteorological disasters. There were 2 stations in Beijing Municipalities, 2 stations in Tianjin Municipalities, 12 stations in Hebei province, 6 stations in Shanxi province, 8 stations in Jiangsu province, 8 stations in Anhui province, 12 stations in Shandong province, 12 stations in Shandong province, and 12 stations in Henan province (Fig. 1). All stations are located in a winter wheat primary production region, and represent the typical cropping system in the HHHP. They are geographically and climatologically different, and have good records of crop data during1981–2021.These stations provide a unique chance to investigate the long series phenology variation trend characteristics.

Fig. 1
Fig. 1The alternative text for this image may have been generated using AI.
Full size image

Spatial distribution of 62 agrometeorological observation stations included in the dataset.

The agrometeorological parameters, including average temperature (Tave), maximum temperature (Tmax), minimum temperature (Tmin), daily temperature range (Tdr), relative humidity (Rh) and precipitation (Pr), of these agrometeorological observation stations in recent 40 years were obtained from the official channel of the China Meteorological Administration, analyzing the historical change characteristics of these meteorological parameters (Fig. 2). In the past 40 years, the agrometeorological parameters of the observation stations in the HHHP had changed significantly, the Tave, Tmax, Tmin, and Pr had shown a significant upward trend, while the Dtr and Rh had shown a downward trend.

Fig. 2
Fig. 2The alternative text for this image may have been generated using AI.
Full size image

Characteristics of agrometeorological parameters of 62 agrometeorological observation stations.

Dataset construction process

The dataset construction process includes the following key steps, as introduced in Fig. 3:

  1. (1)

    Data collection: Historical data and electronic data were collected from official archives and official website of Chinese Meteorological Administration. Winter wheat observed data from 1981 to 2021 for these stations included phenology, yields, cultivar and agronomic management practices. The detailed phenological records, including the dates of wheat sowing(SE), emergence(EM), tillering(TI), jointing(JO), booting(BO), heading(HE), flower (FL), milking(MI) and maturity(MA).

  2. (2)

    Data process: Firstly, the collected data are categorized by year, site, crop, and phenology. Then, quality control methods—such as historical paper-based data review, expert judgment based on professional experience, estimation using adjacent years’ data, and threshold checks—are applied to fill in missing values and correct anomalies. Finally, the data undergo processing steps including day of year (DOY) conversion, calculation of growth period length (GLs), and statistical analysis (e.g., mean and standard deviation computation).

    We took January 1 of each year as 1, and converted the date of the universal period of each phenology period to the day of year (DOY). According to the growth and development characteristics of each stage, the whole growth period length (GLs) was divided into 5 stages, namely, emergence to tiller (EM-TI), tiller to jointing (TI-JO), jointing to flowering (JO-FL), flowering to maturity (FL-MA) and whole growth period (SE-MA, WGP).

  3. (3)

    Data analysis and scientific plotting: The data processing, the calculation and analysis of the DOY were all completed in Excel 2023, and the kernel density estimation, univariate linear regression estimation and its significance test and plotting are all completed in Origin Lab origin 2024.

Fig. 3
Fig. 3The alternative text for this image may have been generated using AI.
Full size image

Flowchart construction process the dataset winter wheat phenology dataset in HHHP.

Kernel density estimation

We used the kernel density estimation (KDE) method to estimate the winter wheat phenological records of these stations. The average value and standard deviation of 40 years were statistically analyzed. We used the results of kernel density estimation and statistical to produce scientific illustrations as the core of this dataset. KDE is one of the nonparametric test methods used to estimate the unknown density function in probability theory. It can estimate the population from samples. KDE does not need to use the prior knowledge of data distribution, does not attach any assumptions to the data distribution, and only studies the characteristics of data distribution from the data sample itself, which has been highly valued in the field of statistical theory and application. Compared with other density estimation methods such as histogram, the probability density obtained by KDE has better continuity and does not depend on the selected interval length. The basic expression of KDE function is Fig. 4:

$$f(x)=\frac{1}{nh}\mathop{\sum }\limits_{i=1}^{n}K\left(\frac{x-{x}_{i}}{h}\right)$$
(1)
$$h=1.059\cdot \delta \cdot \frac{1}{\sqrt[5]{n}}$$
(2)

Where, x1, x2, x3,…, xn represents the total number of DOY samples of n winter wheat at a certain phenology; K is the kernel function; H is bandwidth (automatically selected via Scott’s rule by Origin Lab 2024); δ is standard deviation; f(x) represents the probability density value of the DOY x estimated by KDE. The greater the value, the greater the probability that the DOY x is the DOY of a certain phenology.

Fig. 4
Fig. 4The alternative text for this image may have been generated using AI.
Full size image

Example dataset: Forty-year (1981–2021) variation characteristics of winter wheat phenology and its lengths at Huanghua Agrometeorological Observation Station, Hebei Province.

Univariate linear regression

The univariate linear regression method was used to analyze the variation trend of the DOY and its lengths. The univariate linear regression was estimated by the least square method. The model is as follows:

$$Y=kx+b$$
(3)

Where, Y represents the DOY or the GLs, and x is the year; b represents the intercept; represents the linear trend term, and k × 10 represents the 10-year trend of DOY or GLs. Negative k indicates that the phenology date is advanced or the length of GLs is shortened, and positive k indicates that the phenology date is delayed or the GLs is extended. Two tailed t-test analysis was used to determine the statistical significance of the change trend. The significance level for all regression analyses was set at α = 0.05. The regression analysis revealed p-values < 0.01 for key variables.

Data Records

The Variation of winter wheat phenology dataset in Huang Huai Hai Plain of China from 1981 to 2021 is available at ScienceDB35. The dataset is provided in JPG format estimated and plotted by Origin Lab. All the diagrams can be downloaded directly for free. The dataset profile is shown in Table 1.

Table 1 Dataset profile.

An example dataset is shown in Fig. 4. Figure 4 showed that the kernel density plots indicated the maximum probability density for SE, EM, TI, JO, BO, HE, FL, MI, and MA occurred on day 276, day 283, day 305, day 108, day 118, day 127, day 130, day 146, and day 157, respectively. The GLs EM–TI, TI–JO, JO–FL, FL–MA, and SE–MA exhibited maximum probability density at 22 days, 158 days, 24 days, 33 days, and 245 days, respectively. The linear regression results revealed the following trends: SE, EM and TI show significant delayed with rates of 3.7 days per decade (R2 = 0.414, P < 0.01), 3.5 d decade−1 (R2 = 0.344, P < 0.01), and 4.7 d decade−1 (R2 = 0.273, P < 0.01), respectively. JO, HE, and FL exhibit significant advancement with rates of −2.3 d decade−1 (R2 = 0.222, P < 0.01), −1.2 d decade−1 (R2 = 0.099, P < 0.01), and −1.2 d decade−1 (R2 = 0.111, P < 0.01), respectively. BO, MI, and MA showed no statistically significant changes. EM–TI and SE–MA shorten at rates of −7.1 d decade−1 (R2 = 0.367, P < 0.01) and −3.9 d decade−1 (R2 = 0.401, P < 0.01), respectively. JO–FL and FL–MA showed no significant trends.

Dataset Values

This dataset analyzed the four-decade temporal variations in winter wheat phenology across 62 agrometeorological stations in diverse climatic zones of the HHHP.

In scientific research, it revealed the spatiotemporal evolution patterns of winter wheat phenology under climate change, providing an empirical foundation for deciphering the driving mechanisms of temperature, precipitation, and other climatic factors on phenological shifts. It identifies latitudinal disparities and topographic influences on phenology, uncovered small-scale climate adaptation strategies, and supported global change ecology studies.

The integration of kernel density estimation and linear trend analysis in this dataset enabled the validation of crop model parameters, addressing the limitations of traditional mean-based analyses and improving model prediction accuracy, thereby enhancing simulations of winter wheat phenology under future climate scenarios.

In agricultural practice, this dataset guided agricultural timing by clarifying trends in key developmental stages (e.g., sowing, jointing, and flowering), helping farmers optimize planting schedules to mitigate meteorological disaster risks such as low-temperature rainfall or drought. By analyzing changes in GLs, it facilitated the selection of early-maturing or stress-resistant cultivars adapted to localized climate variability and informs regional winter wheat planting zonation. Furthermore, it aids in refining field management practices—for instance, adjusting fertilization strategies in response to shortened vegetative growth periods or intensifying pest monitoring during prolonged reproductive phases—to enhance yield and quality.

In summary, this high-precision, long-term dataset established a foundational framework for studying crop adaptability under climate change, integrating theoretical depth with practical applicability. Its multidimensional value not only enhances current agricultural productivity but also provided critical data support for addressing future climatic uncertainties in agricultural systems.

Technical Validation

The work largely benefited from agrometeorological observation stations in major grain production areas in China constructed and operated the China Meteorological Administration. Agrometeorological observation business of China began in the 1950s. In 2009, 653 national agrometeorological observation stations were identified, and in 2023, 642 national agrometeorological observation stations were adjusted, distributed in important grain production areas of provinces or municipalities across the country.

Many years of national standardized manual observation have accumulated a large number of valuable manual observation data. More than 1200 agrometeorological observers of the China Meteorological Administration are professional trained and well-trained scientific and technological personnel. Most of them have been engaged in agrometeorological observation for more than 10 years. The observation and recording process is scientific and rigorous. The observers regularly carry out intermediate quality inspection, cyclic evaluation of data quality, detection and correction of errors, and the data quality is guaranteed. The observation data are collected in strict accordance with the national unified “Specification for Agrometeorological Observation”36. Basis for determining winter wheat phenology are shown in Table 2. The observation section is selected in the field with an area of more than 0.10 hm2. According to the shape of the field, it is divided into four equal small areas as four repeated observation points, with conspicuous signboards inserted. 25 plants (stems) are taken from each of the four small areas for each observation to be observed synchronously, once every two days, and patrol observation is carried out at the end of each month. According to the percentage of the number of plants (stems) entering the phenology in the total number of plants (stems) observed, each phenology is divided into the initial stage of phenology (greater than or equal to 10%) and the universal stage of phenology (greater than or equal to 50%). The phenology date used in this study is the date of reaching the universal phenology period.

Table 2 Basis for determining winter wheat phenology.

Observational uncertainties in this dataset primarily may stem from:(1) Human judgment variability: subjective discrepancies in identifying crop growth stages (e.g., leaf emergence, tillering, or spike differentiation) across different observers. (2) Environmental influences: adverse weather conditions (e.g., rain or humidity) occasionally prevented natural opening of glumes during flowering, requiring manual intervention for accurate phenological assessment. (3) Instrumental limitations: Field measurements (e.g., internode length, kernel size) may exhibit minor errors due to manual tools (e.g., rulers) or sensor precision. (4) Historical data gaps: Missing records in archived paper-based datasets necessitated interpolation or expert validation.

To mitigate these uncertainties and enhance data reliability, we implemented a multi-layered quality control measure: (1) Training & Calibration: Observers were rigorously trained to follow Agrometeorological Observation to minimize subjective biases. Tools for measuring length were regularly calibrated. (2) Cross-Verification: Missing values were filled using adjacent years’ data or validated against historical records. (3) Expert Review: Ambiguous cases were resolved through consensus with agronomy experts. (4) Expert Checks: It mainly includes completeness checks, duplicate checks, cross-year consistency checks, station ID checks, temporal consistency checks, range checks, and internal consistency checks, among others.