Introduction

In recent years, research on land use and agricultural sustainability has increased worldwide1,2,3,4,5,6. Land is a crucial resource for human production and livelihood, and its proper utilization is essential for achieving food security, environmental protection, and economic development7,8,9. In the agricultural sector, the rational use and management of cropland are of significant importance in enhancing crop yields, reducing land degradation, and maintaining ecological balance10,11,12.

As an important agricultural country in Europe, Ukraine has vast cropland and rich agricultural resources13,14,15. However, with the continuous and escalating conflict between Russia and Ukraine, sudden and large-scale armed conflicts caused damage to Ukraine’s cropland and impacted crop production16,17,18,19, resulting in the emergence of large areas and different types of abandoned cropland20,21. Therefore, the study of abandoned cropland in Ukraine has become particularly important to understand the status quo, problems and restoration needs of cropland before and after the conflict22,23.

Over the past few decades, remote sensing technology has played a crucial role in land use and agricultural monitoring24,25,26,. Regarding abandoned cropland information extraction, methods can generally be classified into three categories: direct classification based on land features27,28,29,3031, comparative analysis based on two or multiple periods of land use types32,32,34, and change detection based on temporal remote sensing indices35,35,36,38. Among them, compared with the other two types of methods, the detection method based on time-series remote sensing index change can obtain more detailed information about abandoned cropland (such as the start time, duration, scope, intensity and spatial structure of abandoned cropland), and also has higher accuracy and advantages that the other two types of methods do not have34. Such methods usually use the time series change of Normalized Difference Vegetation Index (NDVI) to simulate and detect cropland crop growth, so as to distinguish active cropland from abandoned cropland39.

There are numerous methods to acquire temporal NDVI data. Data with high spatial resolution can be derived from Landsat or Sentinel satellite observations, but they often suffer from low temporal resolution and additional influences from cloud cover and atmospheric interference, limiting their application in large-area abandoned cropland detection34. In comparison, MODIS satellite sensor’s NDVI products, known for their high temporal resolution and superior quality control, have become a standard index for studying cropland abandonment and vegetation growth across large regions40,40,42.

Abandoned cropland is typically defined as land that has not been tilled for at least 2–5 years. However, considering the unique circumstances in Ukraine due to the ongoing conflict, this study defines abandoned cropland as land that has not been normally cultivated for at least one year. In detecting abandoned cropland under this definition, a long-term and continuous management patterns sequence of Ukrainian cropland is also generated43. The long-term sequence of cropland management patterns can serve as an indicator of land management intensity, allowing for further differentiation of abandoned cropland. Due to damaged agricultural infrastructure and refugee outflows, the post-conflict cropland management patterns in Ukraine can be classified as normal cultivation, unused, and unattended cropland. Here, “unused” refers to cropland with no sowing activities, while “unattended” indicates cropland that has been sown but lacks or has no management44,45. Both categories fall under the umbrella of abandoned cropland, complicating the application of commonly used land cover classification or change detection models due to the complex operational conditions46.

In this context, we develop a Dual-period Change Detection (DCD) method for the Ukrainian region. This method can not only detect unused cropland that occurred during the pre-conflict period (2015–2021) but also effectively identify cropland in unused or unattended states during the post-conflict period (2022–2023). Through the implementation and validation of this method, valuable information and scientific evidence will be provided for agricultural management, decision-making, and agricultural recovery in Ukraine and similar regions.

Materials and methods

Overview of the study area

Ukraine is located in Eastern Europe, with a total area of 603,700 square kilometers. It is an important agricultural country in Europe. The landscape of Ukraine is mainly composed of plains and hills, with plains accounting for about 95% of the land area. More than 60% of the land is used for cultivation, and Ukraine possesses two-fifths of the world’s black soil area, which is rich in organic matter and nutrients13,14,15. This fertile soil is highly suitable for crop growth, earning Ukraine the title of "Europe’s breadbasket." In addition to sharing a border with Russia, Ukraine is also adjacent to countries such as Poland, Belarus, Moldova, Romania, Hungary, and Slovakia. The geographical location and land use classification of Ukraine are shown in Fig. 1(Data sources and coordinate projection methods are described in Sect. “Data” and Sect. “Data preprocessing” respectively. Land classification results are derived from appropriate combinations of IGBP’s Global Vegetation classification Scheme).

Fig. 1
figure 1

Geographical Location and Land Use Classification of Ukraine.

In Ukraine, the main crops grown include barley, buckwheat, corn, millet, oats, rye, and wheat. The agricultural production follows two planting seasons: spring planting and autumn planting47. Spring planting starts from March and continues until May, focusing on crops such as corn, barley, oats, buckwheat, and millet, which are harvested in September of the same year. Autumn planting takes place in August each year, mainly for winter wheat and rye, which are harvested in July of the following year. Agriculture plays a significant role in Ukraine’s GDP, and it also has a crucial impact on global food supply and agricultural import and export48,49.

Data

The data used in this study mainly consists of five parts. It includes NDVI time series data, land use type data, medium and high resolution remote sensing image data, and vector data of Ukrainian administrative divisions.

NDVI time series data

The temporal NDVI data used in this study primarily comes from the MODIS Vegetation Index product MOD13Q1, which can be obtained from LAADS DAAC (https://ladsweb.modaps.eosdis.nasa.gov/). It includes 250 m resolution NDVI data and its quality control data, synthesized every 16 days using the MVC method. In this study, standard image products with strip numbers h19v03, h19v04, h20v03, h20v04 and h21v04 were used to establish multi-temporal remote sensing images from NDVI data of 23 time phases every year from 2015 to 2023.

Land use type data

The land use type data used in this study is the MODIS Level-3 Land Cover Type product MCD12Q1 released in 2020. This data is generated based on a decision tree algorithm and has an overall accuracy of approximately 75%50. It covers the entire globe and has five different land use classification schemes. The spatial resolution is 500 m, and it can also be obtained from LAADS DAAC.

Medium-resolution remote sensing image data

The medium resolution remote sensing image data in this study came from Landsat 8–9 satellite, which has a spatial resolution of 30 m, and can obtain continuous observation images at a certain time interval on a global scale. The data can be freely viewed or downloaded from the United States Geological Survey (USGS) EarthExplorer (https://earthexplorer.usgs.gov/), LandsatLook (https://landsatlook.usgs.gov/), and GloVis (https://glovis.usgs.gov/).

High-resolution remote sensing image data

The high-resolution remote sensing image data used in this study is Esri World Image, which can be freely viewed on EarthExplorer. This data includes multiple satellite sensors and data providers, including commercial satellite data providers such as DigitalGlobe, GeoIQ, and public data providers such as NASA. Overall, the data can be classified into high, medium, and low levels based on spatial resolution. High-resolution data typically has a resolution between 0.3 m and 2 m, medium-resolution data has a resolution between 2 and 30 m, and low-resolution data has a resolution above 30 m. It should be noted that the specific data resolution and availability may vary by region, as the image data in different regions may come from different satellites and data providers.

Administrative boundary vector data of Ukraine

The administrative boundary vector data includes country, state (municipality), and district (county) level administrative boundary vector data. It can be freely obtained from the Global Administrative Areas database (GADM) freely available public map data set (https://gadm.org/), version 3.6, with the WGS84 coordinate reference system.

Methodology

The DCD method flow is shown in Fig. 2 and includes the following steps:

Fig. 2
figure 2

Method flow of DCD.

1) Data preprocessing; 2) Crop curve reconstruction; 3) Region of interest identification; 4) Unused cropland detection; 5) Self-supervised classification; 6) Accuracy validation.

Data preprocessing

After collecting the MODIS NDVI and land use type data, data preprocessing is performed to ensure data quality and consistency. This includes band extraction, mosaic, reprojection, stitching, resampling, clipping, re-encoding, and masking operations.

First, the IGBP scheme classification result bands from the MCD12Q1 data product and the NDVI and quality control (SummaryQA) bands from the MOD13Q1 data product are extracted using the MODIS Reprojection Tool (MRT). Then, the bands are stitched, projected, mosaicked, and resampled based on their spatial positions. The spatial reference system used for stitching is the WGS84 datum, and the projection is performed using the Albers equal area projection (projection parameters are shown in Table 1). The resampling interval is set to 250 m.

Table 1 Parameter table of Albers projection.

Finally, the data is clipped and re-encoded based on the national boundary vector information from the administrative boundary data of Ukraine. This ensures the consistency of the bands in the Ukraine region. The spatial information of cropland is contained in the IGBP scheme classification result, and it can be highlighted separately through masking processing.

Crop curve reconstruction

To better characterize crop growth features using NDVI data, it is necessary to reconstruct high-quality NDVI curves, referred to as crop curves, that better align with crop growth logic. In this study, an open-source R package called “phenofit”51 is used for crop curve reconstruction, specifically utilizing its curve fitting functionality. Modifications were made to the package to suit the research requirements, including weight initialization, preprocessing of vegetation index inputs, curve fitting, and weight updating.

The reconstruction of the NDVI time series should rely primarily on information from high-quality points but should also extract limited information from edge points and poor-quality points52,53. In this study, weight initialization is based on the quality control (QC) information of the remote sensing images (the SummaryQA band in MOD13Q1). The NDVI observations are categorized into three classes: good, moderate, and poor, with corresponding weights of maximum (\({w}_{max}\)), middle (\({w}_{mid}\)), and minimum (\({w}_{min}\)). Actual NDVI values are usually greater than contaminated remote sensing observations54, indicating that contaminated observations can still provide valuable information, and \({w}_{min}\) should be greater than zero. If \({w}_{min}\)=0.0, the “poor” data points will be completely ignored. Therefore, in this study, \({w}_{max}\), \({w}_{mid}\), and \({w}_{min}\)are set to 1.0, 0.5, and 0.2, respectively, for weight initialization55,56.

The curve fitting approach is similar to weighted linear regression57, aiming to minimize the cost function J, defined as the weighted sum of squared errors, where weights play a regulating role and directly impact the curve fitting results. J is described as follows:

$$J = \sum \left( {y_{i} - \hat{y}_{i} } \right)^{2} w_{i}$$
(1)

In the equations, the subscript i represents the i-th element. y represents the original NDVI time series, ŷ represents the fitted results, and w represents the weights during the curve fitting process.

Due to the uncertainty of the input sequence values, preprocessing of the NDVI time series data is necessary. If the absolute difference between \({y}_{i}\) and \({y}_{mov,i}\) is greater than twice the standard deviation of the entire time series, the point is marked as a "spike." Here, \({y}_{i}\) is the i-th observed value of the input NDVI, and \({y}_{mov,i}\) is the corresponding moving average of three points. Spike values containing missing values are discarded since phenotypic fitting can handle unevenly spaced sequences.

Furthermore, the upper and lower bounds of the NDVI time series (referred to as ylu) are determined based on high-quality observations. They are primarily used to constrain contaminated NDVI values within a reasonable range, especially during non-drought growing seasons with significant snowfall. Points exceeding the ylu range are considered suspicious, and their values and weights are reassigned to \({y}_{min}\) (the lower limit of ylu) and \({w}_{min}\), respectively.

Commonly used curve fitting functions include weighted Savitzky-Golay (wSG)58, weighted Harmonic Analysis of Time Series (wHANTS)59, and weighted Whittaker (wWHIT)55. Typically, wSG is optimal for vegetation experiencing abrupt changes (e.g., deforestation or forest fires)60, wHANTS is suitable for vegetation with explicit and stable growing seasons59,61, and wWHIT is applicable to vegetation with dynamic changes during the growing season (e.g., planting in abundant or drought years)62. Compared to wSG and wHANTS, wWHIT is more robust against outliers55. It can capture vegetation seasonal signals even when more than 30% of the input NDVI time series is contaminated. Given the vast territory and diverse crop types in Ukraine, this study selects wWHIT as the default option for curve fitting.

After each iteration of curve fitting, a weight updating function is applied to adjust the weights of each point in the time series. Points located above or below the fitted curve are underestimated or overestimated. In the presence of contamination in the NDVI time series, the values are typically lower than normal63. Relatively higher values (points underestimated during the curve fitting process) are more likely to have better quality compared to lower values (points overestimated). Therefore, the weight updating function tends to increase the weights of underestimated points and decrease the weights of overestimated points to minimize the contamination impact and approach the upper envelope of the NDVI time series. Alternatively, it reduces the weights of outliers to minimize their interference. Outliers in phenotypic fitting are defined using the same strategy as the bisquare function in robust regression55.

Weight updating through the weight updating function enhances the robustness of phenotypic fitting in capturing NDVI vegetation seasonality signals. Advanced weight updating functions include the modified bisquare weight updating function (wBisquare)55, the weight updating function of TIMESAT (wTSM)64, and the weight updating function of Chen et al. (wChen)58. Among them, wBisquare has stable performance and is the default choice in this study.

Region of interest identification

Typically, unused cropland exhibit lower NDVI values during the growing season, while normal cultivation (with crop growth) display higher NDVI values35,42. Therefore, the identification and localization of regions of interest are fundamental for detecting land changes.

By analyzing the seasonal characteristics of crop plantings in historical periods in Ukraine, it is known that the crop planting calendar in Ukraine varies significantly based on geographical and crop type differences (USDA). Directly defining the growing season within fixed periods can lead to inaccurate detection results42. Moreover, the regions of interest that need to be considered when detecting cropland management patterns before and after conflicts also vary. Therefore, in this study, a region of interest identification module was utilized to first identify and label the regions of interest in crop growth curves, and then divide them into the growth and harvesting portions, as shown in Fig. 3.

Fig. 3
figure 3

Schematic representation of the region of interest.

First, based on the common seasonal characteristics of various crop plantings in Ukraine (USDA), the preliminary range of crop maturity season was defined as spanning from Julian day 129 (May 9th or May 10th in a leap year) to Julian day 240 (August 28th or August 29th in a leap year). Then, within this initial range of the crop maturity season, the maximum value was searched, and its corresponding date was considered as the crop maturity date. Finally, based on the crop maturity date, as well as the duration of the growth period and the harvesting period, the crop’s growing season and harvesting season were determined.

Due to variations in the duration of the growth and harvesting periods for different crops, this study shortened the interested region of the growing season to 64 days based on the seasonal characteristics of crops in the study area (where barley, with a growth period of two months, has the shortest growth period). Additionally, the interested region of the harvesting season was extended to 64 days (as the harvesting period for crops in the study area generally does not exceed two months).

Unused cropland detection

During the unused cropland detection phase, the algorithm only needs to focus on the crop growing season65. Typically, statistical indicators such as the mean and standard deviation of NDVI values during the growing season for different years are used. Then, a threshold segmentation is applied to the crop growth curve to detect unused years. However, through experimental investigation, it has been found that the standard deviation indicator can exhibit exploding (too large) or disappearing (too small) values during the actual detection process. Therefore, in this study, a relative ratio (percentage) indicator is used to determine confidence intervals and detect unused cropland.

First, let’s assume that the crop index C is correlated with the i (year) and the j (ordinal number of the observation point), as shown in Eq. (2):

$$C(i,j) = \frac{NDVI(i,j) - Mean\_NDVI(OtherYears,j)}{{Mean\_NDVI(OtherYears,j)}}$$
(2)

Here, \(NDVI(i,j)\) represents the NDVI value of the j-th observation point during the growing season of the i-th year, and \(Mean\_NDVI(OtherYears,j)\) represents the average NDVI value observed at the j-th observation point during the growing season of all other years from 2015 to 2021.

Then, the following rule is used to determine whether the land is unused:

$$\begin{gathered} IF\left( {\text{C(i, j) < N}} \right) \hfill \\ \{ \hfill \\ Unused = TRUE \hfill \\ {\text{\} }} \hfill \\ {\text{Where, j = 1, 2, }} \ldots {, 64} \hfill \\ \end{gathered}$$
(3)

Here, N is a constant. This study assumes that if the NDVI values for 64 consecutive days during the growing season, when compared to other years, consistently remain at a lower level, then the corresponding year (i) is a unused year.

It should be noted that the crop index C is greatly influenced by the average NDVI values of other years. However, the average NDVI values of other years may not be entirely reliable in certain cases, such as when there are unused years among the other years, which can cause a decrease in the average values. Therefore, this study introduces a threshold valve before determining unused status, as shown in Eq. (4), to adjust the threshold value N based on different situations.

$$\begin{gathered} IF\left( {Std\_NDVI(OtherYears,j)> 0.1} \right) \hfill \\ \{ N = - 15\% {\text{\} }} \hfill \\ ELSE \hfill \\ \{ N = - 25\% \} \hfill \\ \end{gathered}$$
(4)

Here, \(Std\_NDVI(OtherYears,j)\) represents the standard deviation of the six NDVI values for the j-th observation point during the growing season of other years from 2015 to 2021.

This study assumes that when the standard deviation of the NDVI values for other years is greater than 0.1, there is a high likelihood of unused years (with a decrease in the average values). Therefore, the threshold value is appropriately increased, and the accurate values of the threshold are determined through rough training with limited samples, which are −15% and −25%, respectively.

Self-supervised classification

Based on the unused cropland detection results, the crop curves corresponding to unused years are first removed from the pixels, allowing for statistical analysis of the crop curves for normal cultivation years. Since there are two possible representations of abandoned land post-conflict, namely horizontal and vertical patterns, neither a single dimension nor two dimensions can achieve binary segmentation for all three cases. This is illustrated in Fig. 4(A) and 4(B).

Fig. 4
figure 4

Unused cropland (A) and Unattended cropland (B) crop curve characterization.

To address this, a statistical quantity called the Crop Integral (S) is further constructed as a statistical indicator for binary segmentation. Its expression is given by Eq. (5):

$$S = \int_{first}^{finally} {NDVI{\text{d(t)}}}$$
(5)

Here, the lower bound of the integral (first) and the upper bound (finally) represent the first day of the growing season and the last day of the harvesting season in the region of interest (as shown in Fig. 4), and NDVI is the NDVI value corresponding to a specific time (t).

However, polynomial fitting requires significant computational power51. Therefore, in this study, a geometric approximation method is used to simplify the process of solving S, as shown in Eq. (6), thereby improving the detection efficiency.

$$S \approx \sum\limits_{i = 1}^{n} {NDVI_{i} }$$
(6)

Here, n represents the total number of periods between the growing season and the harvesting season in the remote sensing image, and \(NDVI_{i}\) represents the normalized vegetation index value for a specific period.

From the historical Crop Integral values (\({S}_{1},{S}_{2},\cdots ,{S}_{n}\)) of cropland, the mean statistical indicator (\(\overline{S }\)) is obtained. Then, based on the binary segmentation percentages (n, m), the confidence interval [a, b] for the sample values is further determined as the two thresholds for binary segmentation. The confidence interval is determined by (\(\overline{S }\)) and n, as shown in Eq. (7).

$$\begin{gathered} a = (1 - n)\overline{S} \hfill \\ b = (1 + m)\overline{S} \hfill \\ \end{gathered}$$
(7)

Here, n and m are empirical values for the binary segmentation, which need to be determined through comparative experimental analysis and evaluation66.

After binary segmentation, the cropland corresponding to a particular year is divided into three categories: high Crop Integral values, representing unattended cropland; normal values, representing regular cultivation; and low values, representing unused cropland. This is illustrated in Fig. 5.

Fig. 5
figure 5

Classification of cropland based on binary segmentation.

Accuracy validation

Due to the difficulties in conducting ground surveys of cropland in Ukraine67, the accuracy of the proposed method in this study is evaluated using a visual validation approach with high-resolution reference image and medium-resolution Landsat 8–9 images68. The accuracy assessment process mainly includes random sampling, visual interpretation, and metric calculation68,67,70.

Firstly, random sampling is performed on the results of abandoned cropland extraction based on year and category to ensure that the sample points are representative of different years and categories66. The number of sample points for each year and category is set to 50, resulting in a total of 1000 sample points. Then, based on the sample point information, the temporal and spatial positions of these sample points are marked on the remote sensing images. Visual interpretation is conducted on the multi-temporal images corresponding to each sample point. The change characteristics of cropland on the images are observed, and the management patterns corresponding to each sample point’s year is determined based on crop phenology characteristics. The results of visual interpretation are recorded. Finally, the visual interpretation results are compared with the algorithm extraction results, and the comparison results for each category of sample points are statistically analyzed to calculate accuracy metrics.

In this study, a confusion matrix is used to evaluate various metrics, including Producer Accuracy (PA), User Accuracy (UA), Overall Accuracy (OA), and Kappa coefficient. The confusion matrix is a two-dimensional table used to compare the classification results between visual interpretation and extraction results, as shown in Table 2.

Table 2 Confusion matrix.

Here, Overall Accuracy (OA) represents the proportion of correctly classified samples to the total number of samples. The formula for OA is shown in Eq. (8).

$$OA = \frac{a + e + i}{{a + b + c + d + e + f + g + h + i}}$$
(8)

Overall Accuracy measures the classification accuracy of the model across all categories and is a commonly used evaluation metric. However, when the sample sizes of different categories are imbalanced, the accuracy may not fully reflect the classification results. Therefore, other evaluation metrics need to be used to comprehensively assess the classification results.

The Kappa coefficient is a measure of consistency used to evaluate classification results. For classification problems, consistency refers to whether the model’s predicted results align with the actual classification results. OA directly reflects the proportion of correct classifications, is easy to calculate, but when there is class imbalance during classification, some categories may have low values even when OA is high. The Kappa coefficient can reflect this issue, as it decreases significantly when there are poor results for a particular class. The formula for Kappa is shown in Eq. (9).

$$\begin{gathered} Kappa = \frac{{p_{O} - p_{e} }}{{1 - p_{e} }} \\ p_{O} = OA \\ p_{e} = \frac{(a + d + g) \times (a + b + c) + (b + e + h) \times (d + e + f) + (c + f + i) \times (g + h + i)}{{(ab + c + d + e + f + g + h + i)^{2} }} \\ \end{gathered}$$
(9)

In addition to evaluating the accuracy and consistency of the overall classification results, it is also necessary to evaluate the classification results for individual categories. Producer Accuracy (PA) and User Accuracy (UA) are commonly used metrics to evaluate the classification results of a specific category from two perspectives.

Producer Accuracy (PA), also known as mapping accuracy, refers to the ratio of the total number of pixels correctly assigned to a specific class (diagonal value) to the total number of pixels assigned to that class by the classifier (column sum). Taking unused cropland as an example, the formula for PA is shown in Eq. (10).

$$PA(Unused) = \frac{e}{b + e + h}$$
(10)

The corresponding metric is User Accuracy (UA), which represents the ratio of the number of pixels correctly classified into a specific class by the classifier (diagonal value) to the total reference number of pixels for that class (row sum). Using unused cropland as an example, the formula for UA is shown in Eq. (11).

$$UA(Unused) = \frac{e}{d + e + f}$$
(11)

In summary, by calculating metrics such as Producer Accuracy (PA), User Accuracy (UA), Overall Accuracy (OA), and Kappa coefficient, the performance of the model in abandoned land detection can be comprehensively evaluated.

Results

Pre-conflict period

Spatial and temporal distribution of unused cropland

Based on the cropland management patterns sequence of Ukraine’s cropland from 2015 to 2021 output by the unused cropland detection module of the DCD method, the map visualization of unused cropland distribution is shown in Fig. 6(A) - 6(G).

Fig. 6
figure 6

Spatial distribution map of unused cropland in Ukraine from 2015(A) to 2021(G).

Overall, the trend of unused cropland area in Ukraine is relatively unstable, with significant changes between different years. Among them, the largest unused cropland area was in 2018, approximately 1.046 million hectares, while the smallest was in 2021, around 0.222 million hectares.

In terms of spatial distribution, unused cropland in Ukraine from 2015 to 2021 is predominantly located in the southern region, particularly near the Crimea region. To further explore the spatial distribution characteristics of unused cropland, this study extracted the cropland pixels that remained unused for two or more consecutive years during this period. The results are shown in Fig. 7.

Fig. 7
figure 7

Time and spatial distribution of continuous unused cropland in Ukraine from 2015 to 2021.

By analyzing the unused cropland pixels, it was found that there were approximately 138,000 hectares of cropland in Ukraine from 2015 to 2021 that exhibited continuous unused behavior for two or more years. Among them, over 95% of the continuous unused cropland pixels remained unused for two consecutive years, while the proportion of pixels with continuous unused for multiple years (three years or more) was relatively small. The periods of continuous unused were concentrated mainly in 2017–2018, 2018–2019, and 2019–2020, with the majority of continuous unused cropland pixels located in the southern region of Ukraine. There was a certain clustering effect of continuous unused cropland pixels depending on the time period.

Accuracy of unused cropland detection

The accuracy assessment of the unused cropland map in Ukraine from 2015 to 2021 yielded various accuracy metrics, as shown in Table 3.

Table 3 Accuracy results of unused cropland detection in Ukraine from 2015 to 2021.

According to the accuracy validation results, the Producer Accuracy (PA) for identifying normal cultivation cropland ranged from 0.94 to 0.98, while the User Accuracy (UA) ranged from 0.76 to 0.91. This high level of accuracy in detecting normally cultivated cropland plots provides a solid foundation for the detection of unused cropland.

Regarding unused cropland detection, the Producer Accuracy (PA) values ranged from 0.70 to 0.93, while the User Accuracy (UA) values ranged from 0.95 to 0.98. This indicates that the algorithm also demonstrates high accuracy in detecting unused cropland.

Overall, from 2015 to 2021, the Overall Accuracy (OA) values ranged from 0.83 to 0.93, and the Kappa coefficient (Kappa) values ranged from 0.66 to 0.86. This means that the algorithm not only exhibits high accuracy in the overall unused cropland detection task but also demonstrates high consistency in unused cropland detection.

In summary, the unused cropland detection module of the DCD method proposed in this study generally exhibits high accuracy and consistency. It can effectively identify normally cultivated cropland and unused cropland, laying the foundation for the self-supervised classification module of this method.

Post-conflict period

Binary segmentation thresholds selection and accuracy

To explore the optimal binary segmentation thresholds (n and m), this study conducted comparative experiments and analysis on the value of n. Three sets of tests were performed with n values set at 15%, 20%, and 25% respectively. The results of the comparative experiments are recorded in Table 4.

Table 4 Comparative experiment results for unused cropland detection in Ukraine in 2022–2023.

For normally cultivated cropland and unused cropland, each set of tests calculated the metrics for 2022, 2023, and the overall two-year results separately. According to the calculated metrics, it can be seen that the experiment with n = 20% generally achieved higher Producer Accuracy (PA) and User Accuracy (UA) in most cases.

In 2022, the experiment with n = 20% exhibited relatively high PA and UA for both normally cultivated cropland and unused cropland. The PA and UA for normally cultivated cropland were 0.98 and 0.83, respectively, while for unused cropland, they were 0.8 and 0.98, respectively. In 2023, the experiment with n = 20% also achieved high PA and UA, with both normally cultivated cropland and unused cropland reaching PA and UA values of 0.96. Therefore, the overall two-year results also favored the n = 20% experiment, with PA and UA for normally cultivated cropland at 0.97 and 0.89, respectively, and for unused cropland at 0.88 and 0.97, respectively.

Furthermore, the overall accuracy and Kappa coefficient of the n = 20% experiment were higher compared to the other comparative groups. In 2022, the overall accuracy and Kappa coefficient of the n = 20% experiment were 0.89 and 0.78, respectively. In 2023, they were 0.96 and 0.92, respectively.

Therefore, using n = 20% achieved better results in terms of accuracy and consistency compared to n = 15% and n = 25%. These experimental results indicate that selecting n = 20% as the threshold in the self-supervised classification of unused cropland in Ukraine in 2022–2023 can yield better results.

As for the value of m, in distinguishing whether cropland is under normal cultivation, due to the inability to effectively validate some cases of unattended practices (e.g., delayed harvested crops and delayed plowing) through visual assessment methods, the determination of the threshold had to rely on the verifiable portion (evidence of unharvested crops and lack of plowing). This study conducted four sets of tests with m values set at 15%, 20%, 25%, and 30% respectively. Random sampling was performed from the results of unattended cropland detection in each group, and the number and percentage of samples with no harvest or plowing were counted. The results of the comparative experiments are recorded in Table 5.

Table 5 Control test of unattended test results in Ukraine in 2022–2023.

Based on the experimental results, when m was set to 20%, the percentage of samples with no harvest or plowing in 2022 was 34%, in 2023 it was 38%, and the overall two-year percentage was 36%, which was the highest among all the experimental groups. This indicates that setting m as 20% in this experiment can better detect unattended cropland.

Furthermore, through analysis, it can be observed that the percentage of samples with no harvest or plowing in 2023 was significantly higher than in 2022 for all experimental groups. This suggests that unattended cropland in Ukraine showed an expanding trend in 2023 compared to 2022.

In summary, when the two thresholds (n and m) in the self-supervised classification module are both 20%, better results can be obtained in the detection of abandoned cropland in Ukraine in 2022–2023. The overall accuracy (OA) of unused cropland was 93%, and the Kappa coefficient was 0.85. In the aspect of unattended cropland detection, it can ensure that more than one-third of the cropland in the test results is unattended cropland with no harvest and ploughing, and the remaining two-thirds are abnormal cropland whose crop Integral is significantly higher than that of previous years (It is very likely to be an unattended cropland that lacks management).

Distribution of abandoned cropland

The spatial distribution of abandoned cropland in Ukraine for the years 2022–2023, as outputted by the self-supervised classification module in the DCD method with thresholds of n = 20% and m = 20%, is shown in Fig. 8(A) and 8(B).

Fig. 8
figure 8

Spatial distribution of abandoned cropland in Ukraine. (A) 2022, (B) 2023.

Overall, there was a significant increase in the quantity of abandoned cropland in Ukraine during the years 2022–2023 compared to the period of 2015–2021. In 2022, the area of unused cropland was approximately 1.257 million hectares, unattended cropland covered around 0.462 million hectares, resulting in a total of approximately 1.72 million hectares of abandoned cropland. In 2023, the area of unused cropland decreased to approximately 0.404 million hectares, while unattended cropland expanded to approximately 1.002 million hectares, resulting in a total of approximately 1.407 million hectares of abandoned cropland.

Spatial analysis of the unused cropland parcels in Ukraine during 2022–2023 reveals that in 2022, the unused cropland in the southern region of Ukraine still exhibited an aggregation effect, with a notable spatial diffusion trend compared to 2015–2021. The expansion of unused cropland showed a spreading pattern from the southern region to other parts of the country. However, in 2023, the trend of unused cropland in Ukraine shifted from diffusion to dissipation, mainly concentrated in the eastern part of the country and distributed more sparsely.

Conversely, the unattended cropland in Ukraine was primarily distributed in the eastern region in 2022, with a relatively scattered distribution. However, in 2023, there was a clear clustering effect observed in the unattended cropland in the eastern region of Ukraine, with a significantly expanded range compared to 2022.

In conclusion, the area of abandoned cropland in Ukraine nearly doubled in 2022 compared to previous years and was distributed throughout the country. Although there was a slight decrease in the area of abandoned cropland in 2023, it remained at a high level and exhibited a clustering trend compared to previous years.

Discussion

Contributions of the current study

The contributions of this study mainly include:

1. In this study, the distribution of cropland operation patterns across Ukraine in 2015–2023 is presented in detail, with trend analysis of the area and rate of abandonment. About 90% of the 27 administrative districts show an increasing trend in the area of abandonment in 2022. Among them, a significant increase in the rate of abandonment of cropland was observed in the Crimea, Donetsk, Kherson, Odessa and Zaporizhzhya oblasts. This is shown in Fig. 9.

Fig. 9
figure 9

Map of changes in abandonment area before(2015–2021 average) and after(2022) the conflict in Ukraine.

2. Currently, little is known about the ecological and social impacts of abandoned cropland across Ukraine. The map of abandoned cropland developed in this study provides a database that can be used by researchers to explore the factors influencing the abandonment of cropland in Ukraine or to analyze the potential risks of abandonment in Ukraine. For example, as can be seen in Fig. 10(A) Fig. 10(B), the area of abandoned cropland in Odessa Oblast experienced a significant increase followed by a decrease during 2022–2023. Possible reasons for this include 1) the shift of the main areas of the Russian-Ukrainian conflict; 2) the adjustment of Russia’s policy of blockading the Black Sea; and 3) changes in Ukraine’s agricultural policy.

Fig. 10
figure 10

Statistics on changes in the area of abandoned cropland in Ukraine by region. (A) Comparison of 2015–2021 average with 2022, (B) Comparison of 2022 with 2023.

3. This study formulated rules for discriminating cropland management patterns based on the DCD method. This is of some guiding significance for the automatic identification of abandoned cropland in other regions using remote sensing data. Among them, this study introduced the region of interest identification module to adaptively divide the focus area of time series NDVI, which makes the detection method more accurate, flexible and scientific, and improves the detection efficiency. The unused cropland detection module makes the detection mechanism more realistic by introducing a threshold valve, which effectively improves the adaptive ability and detection accuracy. The self-supervision detection module can effectively distinguish different types of abandoned cropland through binary segmentation on the basis of the results of unused cropland detection. In conclusion, the above three modules can provide a convenient path for other researchers engaged in large regions and multiple types of cropland.

Comparison with other studies conducted in Ukraine

Several researchers have already extracted localized areas or some types of abandoned cropland in Ukraine based on remote sensing data sources. For example, Skakun et al.71 used freely available satellite data and machine learning methods to map cropland in 2013 and 2018 in southeastern Ukraine and further derived the corresponding changes in cropland area. Qadir et al.72 used Sentinel-1 (S1) synthetic aperture radar (SAR) imagery to quantify between 2021–2022 the Changes in sunflower acreage in Ukraine. These two studies relied only on data from two periods to identify abandoned cropland, making it difficult to distinguish between abandoned and fallow land.

Deininger et al.73 used 4 years of satellite imagery (2019–2022) of 10,125 Ukrainian VDCs to estimate the impact of the Russian-initiated war on the area and the expected yields of winter crops aggregated from the field level. He et al.74 used the Sentinel dataset based on the GEE platform using the time-weighted dynamic time regularization (TWDTW) method to map the distribution of abandoned lands in six oblasts of eastern Ukraine. Whereas, the area discussed in this paper is the whole territory of Ukraine and the overall accuracy of the maps for cropland categorization in this study ranges from 83 to 96%. In the study conducted by Skakun et al.71, the accuracies of estimation for stabilized non-cropland, stabilized cropland, increased cropland, and decreased cropland were 97.1 ± 1.0%, 96.8 ± 1.2%, 24.6 ± 3.9%, and 40.1 ± 3.7%, respectively, and the overall accuracy for abandoned cropland was 92.5% in the study conducted by He et al.74. It can be seen that the accuracy of the DCD method is comparable and even better than existing research methods, further proving that the method in this paper is suitable for large-scale studies.

Limitations of the current study

The proposed DCD method performs well in detecting abandoned cropland in Ukraine. However, there are several limitations and considerations regarding its applicability, as outlined below.

Time scale of targets

The DCD method proposed in this study aims to detect and segment the management patterns sequence of cropland, with the detection object’s temporal scale being annual. This limitation results in a lack of theoretical models and detection mechanisms for long-term unused cropland.

Under natural conditions, long-term unused cropland often undergoes a series of secondary succession from bare land to forests or grasslands over time39. If the initial period of long-term unused cropland falls within the detection range, the proposed method can only detect the starting year or the first few years of unused and classify them as unused cropland. Therefore, special attention should be given to continuous unused cropland in the detection results, as there is a high probability that they represent long-term unused cropland. However, if the initial period of long-term unused cropland is outside the detection range, there is a high probability that the proposed algorithm will classify such land as normal cultivated cropland.

Therefore, the detected abandoned cropland in Ukraine obtained using the DCD method proposed in this study only includes unused cropland and unattended cropland. This may lead to the possibility of underestimating the actual extent of abandoned cropland in Ukraine. For the detection of long-term unused cropland, a better solution would be to use the CCDC (Continuous Change Detection and Classification) algorithm to detect and segment different stages of cropland, as this approach can better describe and detect the sequences and characteristics of long-term unused cropland75.

Spatial scale of targets

The DCD method proposed in this study requires high time resolution remote sensing data due to its fine temporal scale. However, the existing data of this kind have relatively low spatial resolution, which limits the applicability of the proposed method in terms of spatial scale.

As shown in Fig. 11(C), although the detection results of the proposed method generally align with the shape of the land parcels, there are many jagged patterns along the boundaries. Overlaying the results with high-resolution imagery reveals that this is mainly caused by the low spatial resolution of the data. Ukrainian cropland has larger plot sizes compared to cropland in other regions of the world. Therefore, the proposed method exhibits higher accuracy and better performance in detecting abandoned cropland in Ukraine. However, if this method is applied to regions with a high degree of fragmented cropland, such as mountainous areas in China, the accuracy and performance of the detection will be significantly reduced. To apply this method to regions with a high degree of fragmented cropland, apart from employing strategies like cloud removal76,75,76,79and image fusion80,79,82, techniques such as mixed pixel decomposition83,84and super-resolution reconstruction85,86 offer potential to enhance the applicability at the spatial scale.

Fig. 11
figure 11

The detection results were compared with local details of ground conditions. (A) Map of detection results of abandoned cropland in Ukraine, (B) local enlarged map of results, (C) comparison of results with ground images, (D) ground images.

Conclusion

Firstly, this study obtained reliable temporal and spatial distribution information of abandoned cropland before and after the conflict in Ukraine (2015–2023). Through stratified random sampling and comparison with the visual interpretation results of medium and high resolution images, the accuracy is between 83 and 96%.

Secondly, a DCD method is proposed in this study. The region of interest identification module in the method accurately located the mature period of crops and divided them into growth and harvest periods, effectively improving the accuracy of abandoned cropland detection. The unused cropland detection module in the method could be used independently for unused cropland detection with high accuracy. This module only focused on the curves during the crop growth period, enhancing the efficiency of unused cropland detection. The self-supervised detection module in this method can effectively distinguish the two kinds of abandoned cropland, unused and unattended.

Finally, this study reveals the changes of abandoned cropland before and after the conflict in Ukraine. Through spatial visualization and analysis of abandoned cropland information in Ukraine from 2015 to 2023, it can be concluded that unused cropland in Ukraine during 2015–2021 was mainly distributed in the southern region, with approximately 138,000 hectares of cropland being unused for two or more consecutive years. In 2022, the unused cropland area was approximately double the average unused cropland area before the conflict, and it was widely distributed. Unattended cropland reached 462,000 hectares and was mainly concentrated in the eastern part of Ukraine. In 2023, compared to 2022, the unused cropland area decreased by 67.8%, while the unattended cropland area increased by 116.7%. Both types of abandoned cropland showed spatial clustering, with major clusters in the Crimea region, Kherson Oblast, Zaporizhzhia Oblast, and Donetsk Oblast.

For future research, in addition to further improving the DCD method, it would be beneficial to explore the driving factors of cropland abandonment in Ukraine87. This would help uncover the mechanisms between human factors and cropland abandonment, contributing to the development of a coupled model for the entire process of cropland abandonment in Ukraine and providing valuable insights for the study of abandoned cropland in other regions with similar characteristics.