Introduction

Porcine pasteurellosis is an infectious disease caused by Pasteurella multocida (P. multocida), which is characterized by septicemia and hemorrhagic inflammation1. This pathogen invades the nasal cavity and respiratory tract of pigs, causing pneumonia, focal infection and abscess2,3. P. multocida can be co-infected with a variety of pathogens (e.g., Actinobacillus suis, Salmonella choleraesuis, Hemophilus parasuis, Streptococcus suis, and Bordetella bronchiseptica to increase the prevalence of respiratory diseases in pigs4. Porcine pasteurellosis causes serious economic losses to animal feeding and hinders the healthy development of the pig breeding industry5.

The occurrence of infectious diseases is usually caused by the interaction of various influencing factors6. China has a vast territory, and nature, climate, transportation and breeding management are diverse. Many factors, such as climate, geography, and transportation, may affect the outbreak of porcine pasteurellosis. Porcine pasteurellosis occurs mainly in summer in the south-central region of Chinese mainland7. The pig breeding industry in China is mainly concentrated in the central and southern regions, and this area has similar climatic characteristics, namely high temperature, humidity and precipitation. Complicated transportation and poor management methods have increased the occurrence and spread of diseases. Therefore, it is important to combine multiple types of risk factors, simulating high-risk areas of infectious diseases. Many models, including multi-criteria decision analysis (MCDA), maximum entropy, boosted regression tree are used to explore the relationship between animal infectious diseases and multiple influencing factors, showing good performance in practical applications8,9,10.

MCDA is a mathematical method with important decision-making value. Geographic information system (GIS)-based MCDA is a process that combines the value of geographic data with the preferences of decision makers and translates them into decision maps11. MCDA is often used in land suitability assessments, environmental planning and infectious disease risk assessments12,13,14. In infectious disease studies, MCDA can clearly show the spatial risk area of the disease by GIS, further understanding the complexity of the spatial risk factors of disease occurrence and epidemic characteristics of the disease13. Paul et al.15. produced a Highly Pathogenic Avian Influenza (HPAI) H5N1 risk map in poultry based on MCDA. The assessment in Cambodian showed that Tonlé Sap watershed was identified as an area of high HPAI H5N1 risk with localized hotspots around the cities of Battambang and Kampong Cham, and in the lower part of the basin bordering Vietnam. Thanapongtharm et al.16. used natural host factors, intermediate host type and environmental factors based on MCDA to determine that the risk areas for Nipah virus of pigs were mainly distributed around bat colonies. Tran et al.17. mapped areas suitable for Rift Valley fever amplification and transmission in East African countries (Uganda, Kenya, Ethiopia, and Tanzania) based on MCDA to provide a reference for disease control and risk identification. Alimi et al.18. assessed the risk of malaria in northern South America and found to be higher in rivers and coastal areas.

In the transmission of infectious diseases, a large amount of data related to infectious diseases has spatial distribution characteristics, and this characteristic becomes the premise of GIS application. Constructing a GIS-based surveillance system can effectively manage and monitor infectious diseases, and can analyze the characteristics and spreading trend of epidemics based on relevant geospatial information. GIS technology has been favored by many researchers for its spatial analysis capability. In the monitoring of diseases and pests in cotton, locust prevention and control, and prediction of avian pasteurellosis, it showed that the application of WebGIS to establish a monitoring and early warning system has a good development prospect19,20,21. Meanwhile, the application of GIS technology in the health field is shown to be an inevitable trend for automation and disease information management22.

The aim of this study was to develop a system to predict risk areas of porcine pasteurellosis. The main steps can be divided as follows: 1) Developing a predictive model based on MCDA and assessing risk areas for porcine pasteurellosis in the Chinese mainland. 2) Constructing of a WebGIS-based prediction system that considers user requirements and purposes.

Materials and methods

Porcine pasteurellosis outbreak data

According to the Ministry of Agriculture and Rural Affairs of the People’s Republic of China, main outbreaks of porcine pasteurellosis in the Chinese mainland were observed in the central and southern China from 2008 to 2020 (http://www.moa.gov.cn/). Sichuan, Chongqing and Guangxi had the most severe outbreaks (Fig. 1a). Between 2008 and 2015, outbreaks of porcine pasteurellosis showed an increasing trend. Since 2015, outbreaks have shown a downward trend (Fig. 1b). Summer (June, July and August) was the main outbreak season for the disease (Fig. 1c).

Fig. 1
figure 1

The outbreaks of porcine pasteurellosis in the Chinese mainland from 2008 to 2020. a number of porcine pasteurellosis outbreak in various provinces. b annual number of porcine pasteurellosis outbreak. c monthly number of porcine pasteurellosis outbreak.

Study framework

In this study, a predictive model for porcine pasteurellosis was developed based on MCDA. According to relevant research, seven spatial risk factors of porcine pasteurellosis were determined7,23. The research framework as shown in Fig. 2 and the specific steps are as follows:

  1. 1.

    Different influencing factors have different attributes and units, which need to be standardized for all data. The fuzzy membership function was used to complete the standardization of the data with values between 0 and 110.

  2. 2.

    Principal component analysis (PCA) was used to determine the weight of influencing factors. Since the factors used for evaluation have different spatial resolutions, all factors need to be adjusted to a resolution of 2.5 arcmin (approximately 4.65 km × 4.65 km).

  3. 3.

    Weighted linear combination (WLC) was used to construct the risk maps of porcine pasteurellosis by combining all risk factors16.

  4. 4.

    One-At-a-Time (OAT) was used to calculate the uncertainty map, and mean of absolute change rates (MACRs)10.

  5. 5.

    C# was used as the development language; MCDA was combined with WebGIS technology to construct a system for predicting risk areas of porcine pasteurellosis39.

Fig. 2
figure 2

Study framework to predict risk areas of porcine pasteurellosis based on MCDA.

Factors used to risk map

Railway and highway

Transportation plays an important role in spreading animal infectious diseases by increasing the contact rate between sick and healthy animals. The railway and highway data of the Chinese mainland were obtained from Geographic Data Sharing Infrastructure, College of Urban and Environmental Science, Peking University (http://geodata.pku.edu.cn), China. The railway and highway densities in the Chinese mainland were calculated by using kernel density estimation (KDE).

Temperature and precipitation

Temperature and precipitation are the important factors in the prevalence of porcine pasteurellosis6. The meteorological data were obtained from WorldClim, and the closest data of the current climate conditions were selected24.

Pig density and population density

As the most susceptible animal, pigs play an important role in the development and transmission of the porcine pasteurellosis. Moreover, population density can reflect the main areas of livestock farming and P. multocida is also pathogenic to humans. The population density and pig density data in this study were obtained from WorldPop and the Food and Agriculture Organization of the United Nations (FAO), respectively. The obtained data was resampled and adjusted to be consistent with the extent of the Chinese mainland.

Influence degree

Research on animal infectious diseases based on MCDA needs a leading factor to represent the research object. Therefore, used the number of porcine pasteurellosis outbreaks to reflect the degree of outbreak in various regions. According to previous studies, inverse distance weighting (IDW) is the most suitable method to simulate the impact of porcine pasteurellosis compared with ordinary kriging, local polynomials, and simple kriging25. In this study, we used IDW to interpolate the number of porcine pasteurellosis outbreaks from 2008 to 2020 in various provinces in the Chinese mainland to obtain the influence degree (degree of outbreak) map.

Standardization of factors

After preprocessing all influencing factors, a raster dataset for the Chinese mainland under the resolution of 2.5 arcmin was obtained. According to studies on other animal infectious diseases, fuzzy membership functions were used to standardize all factors10,15,16. Population density, pig density and the influence degree of porcine pasteurellosis were processed by a linear relationship. Railroad density, highway density, average annual temperature and annual precipitation were standardized by sigmoidal relationship. Finally, seven risk factors were normalized to the spatial maps on a 0–1 scale (Supplement 1).

Determining factors weights

Methods in the determination of factor weights include two main categories: subjective assignment and objective assignment. Our study used PCA to determine the weights of each factor. PCA is a kind of recombination of multiple original variables with a strong correlation to generate a few uncorrelated variables and extract information of the original variables. The weight calculation process of the seven factors was as follows:

  1. 1.

    Calculation of the correlation coefficient matrix:

    $$R=\left[\begin{array}{cccc} r11 & r12& ...& r1p\\ r21& r22& ...& r2p\\ ...& ...& ...& ...\\ rp1& rp2& ...& rpp\end{array}\right]$$
    (1)

    rij is the correlation coefficient between the original variables xi and xj (i, j = 1, 2, …, p).

  2. 2.

    Calculation of eigenvalues and eigenvectors.

    The characteristic equation is |λE − R| = 0. E is the unit matrix, calculating all the eigenvalues, arranging them in order of magnitude as λ1 ≥ λ2 ≥ λp ≥ 0, and respectively finding the eigenvectors ei(i = 1,2,…,p) corresponding to the eigenvalue λi.

  3. 3.

    3. Calculation of the cumulative contribution rate of the principal components:

    $$\frac{\sum_{k=1}^{m}{\lambda }_{k}}{\sum_{k=1}^{p}{\lambda }_{k}}$$
    (2)
  4. 4.

    Calculation of the coefficients of the original index in the linear combination of different principal components (Lij):

    $${L}_{ij}=\frac{{e}_{ij}}{\sqrt{{\lambda }_{i}}}(i=\text{1,2},...,m; j=\text{1,2},...,p)$$
    (3)

    eij is the loading of the original index (xj) in the i-th principal component.

  5. 5.

    Calculation of the contribution rate of the principal components (ui):

    $${u}_{i}=\frac{{\lambda }_{i}}{{\sum }_{k=1}^{m}{\lambda }_{k}}(i=\text{1,2},...,m)$$
    (4)
  6. 6.

    Calculation of the co`efficient of the original index in the comprehensive model (dj):

    $${d}_{j}={\sum }_{i=1}^{m}{u}_{i}{L}_{ij}(i=\text{1,2},...,m; j=\text{1,2},...,p)$$
    (5)
  7. 7.

    Normalization of index weight (wj):

    $${w}_{j}=\frac{{d}_{j}}{{\sum }_{k=1}^{p}{d}_{k}}(j=\text{1,2},...,p)$$
    (6)

Construction of risk map

Seven risk factors were weighted using WLC to construct a risk map. The formula is as follows:

$$S= {\sum }_{i=1}^{n}{w}_{i}v({a}_{i})$$
(7)

n is the number of risk factors; \(w\) is the weight value of risk factor i; v is the value functions of factors at layer i (ai); and S is the total value of each small grid26. ArcGIS 10.2 (ESRI, Redlands, CA, USA) was used to construct the risk map.

Sensitivity analysis and validation

One-At-a-Time is a commonly used sensitivity method, also known as OAT. The purpose of using OAT was to examine the influence of the output factors on the output results27. It reflects the degree and regularity of the influence of a single factor on the result by changing the weight of only one factor at a time. The method has a high comparability and the OAT method is simple and easy to implement.

It is necessary to specify a feasible weight deviation range, which can be defined as a range of percent change (RPC) derived from the original standard weight value in basic operations. Various ranges were applied for each criterion, or a single range was used for all criteria. In this study, the RPC was between − 20 and 20%, with a step size of 1%27. The adjusted new weight is calculated using the following formula:

$$W({C}_{m},PC)=W({C}_{m},0)+W({C}_{m},0)\times PC (1\le m\le n)$$
(8)

where W(Cm, PC) is the initial weight of the major changing factor Cm, PC is the percentage of weight change, and n is the number of risk factors.

$$w({C}_{i},PC)=(1-w({\text{C}}_{\text{m}},\text{PC}))*\frac{{W(C}_{i}, 0)}{1-{\text{w}(\text{C}}_{\text{m}}, 0)} (1\le i\le n,i\ne a)$$
(9)

where W(Ci,0) is the weight of i-th criterion (Ci) at the preliminary weight.

Risk factors were compared by calculating the MACRs of the risk map and adjusted-weight map using the following equation:

$$\text{MACRs}={\sum }_{k=1}^{N}\left|\frac{{R}_{K}-{R}_{0}}{{R}_{0}}\right|\times 100\%$$
(10)

Where Rk is the adjusted-weight risk map; R0 is the initial risk map; N is the number of pixels.

The uncertainty map was the standard deviation of the risk maps generated after all factors changed the weights28,29. Moreover, the outbreak data for porcine pasteurellosis from 2021 to 2022 was used to assess the accuracy of the risk map. The predictive performance of the risk map was evaluated by calculating the receiver operating characteristic (ROC).

Construction of prediction system

Figure 3 showed the research framework for WebGIS-based prediction system. C# was used as a development language to develop the system in Microsoft Visual Studio 2012. Model Builder in ArcGIS 10.2 was used to build models, publish and share services dependent on ArcGIS Server. Silverlight and XAML were used to enable the dynamic presentation of simulation features. The data were stored and processed in Oracle 11 g and ArcGIS. Finally, MCDA was combined with WebGIS technology to construct a system for predicting risk areas of porcine pasteurellosis.

Fig. 3
figure 3

Study framework for WebGIS-based prediction system. (This map was generated using ArcGIS version 10.2 (Esri, https://www.esri.com/en-us/arcgis/products/arcgis-pro/overview). The base map of China was sourced from the Chinese National Standard Map Service (http://bzdt.ch.mnr.gov.cn/index.html), approval number GS (2020) 4619.)

Results

Weight of influencing factors

The scree plot showed that application of two principal components could explain most of the variance of the data (Fig. 4). By calculating the characteristic equations of seven factors, the first two principal components could explain 75% of the variance of all components (Table 1). The coefficients of comprehensive model were normalized to calculate the weights of the factors. The influencing degree of porcine pasteurellosis (0.171), pig density (0.162), and highway density (0.160) had the highest weights in the risk map, followed by the annual average temperature (0.145), precipitation (0.144), and railway density (0.138). In contrast, population density (0.076) had lower importance (Table 2).

Fig. 4
figure 4

The number of extracted principal components.

Table 1 Results of principal component analysis.
Table 2 Weight of the factors.

Risk map for porcine pasteurellosis

Figure 5 showed the risk map of porcine pasteurellosis in the Chinese mainland. Highly risk areas are mainly located in the central and southern of the Chinese mainland, including parts of Sichuan, Chongqing, Guangxi, Guangdong, Hunan, and Hubei. The low-risk areas were mainly distributed in the northern and western, including parts of Qinghai, Xinjiang, Mongolia, Tibet, Heilongjiang, and Jilin.

Fig. 5
figure 5

Risk map of porcine pasteurellosis in the Chinese mainland.

Sensitivity analysis

As shown in Fig. 6, the most sensitive factors were annual average temperature, highway density, and pig density, followed by the influence degree of porcine pasteurellosis, railway density, annual precipitation and population density. The sensitivity of annual average temperature was the most sensitive to changes in weights, and its maximum MACRs (6.5532%) was much lower than the corresponding rate of change in weights ( ± 20%), indicating that the results of risk map were relatively stable.

Fig. 6
figure 6

Mean absolute values of the change rate (MACRs) for risk factors.

Uncertainty map and validation

As shown in Fig. 7, the results of the uncertainty map showed that the simulation of the risk map was robust, with the maximum standard deviation less than 0.01. These results indicated that the simulation ability of the risk map can remain stable, when the weights of the influencing factors are changed. The ROC results showed that the prediction model has moderate predictive performance with the area under the curve (AUC) value of 0.80 (95% CI 0.75–0.84).

Fig. 7
figure 7

Uncertainty map (Maximum standard deviation of the adjusted-weight risk map).

WebGIS-based prediction system

MCDA was combined with WebGIS technology to construct a system for predicting risk areas of porcine pasteurellosis. Figure 8 showed the analysis results of the porcine pasteurellosis risk areas in a region of the Chinese mainland in January. This system can predict risk areas based on monthly updates of each factor data, providing users with decision support for disease prevention and control (Supplement 2).

Fig. 8
figure 8

Prediction results of the porcine pasteurellosis risk areas in a region of the Chinese mainland in January (The risk index increases from white to black).

Discussion

In the previous study, we constructed the first monitoring and warning system of porcine pasteurellosis based on WebGIS25. This system could realize the input and output of disease data, visualization of the epidemic situation, and spatial analysis function, but it cannot combine multiple risks to predict the risk areas of diseases. In this study, we developed a predictive model based on multi-criteria decision analysis (MCDA) and assessed risk areas for porcine pasteurellosis in the Chinese mainland. Then we used this model to construct a WebGIS-based prediction system, which can provide decision support for prevention and control of porcine pasteurellosis through monthly updates of risk factors. High temperatures and humid environments can reduce the immunity and resistance of animal organisms to pathogenic microorganisms.

For the weights of each factor, the differences between the weights of factors other than population density were not very significant. Changes in environmental factors are critical to the spread and occurrence of porcine pasteurellosis, especially climatic factors. High temperature and humidity environment can create conditions for the reproduction of mosquitoes and pathogenic microorganisms. Long-term high temperature and humidity environment may lead to the deterioration of the breeding environment and reduce the immunity of organisms7. The death of more than 200,000 saiga antelopes in central Kazakhstan from P. multicida in 2015 was thought to be related to temperature and humidity abnormalities23. Therefore, we need to pay attention to changes in the animal habitat and maintain strong veterinary surveillance to address possible future increases in climate and environmental stressors. Areas with high road and rail densities are at greater risk from disease transmission. Livestock trade, free movement of people, and vehicles can increase the prevalence of disease through pathogen-carrying contaminants30,31. In addition, pathogens can be present on animals and in the feeding environment for long periods, which is another way in which porcine pasteurellosis can be easily transmitted through contact and animal transport. P. multocida can affect a variety of animals and cause many diseases, such as pneumonia in small ruminants, fowl cholera in poultry, and atrophic rhinitis in swine32. As the most susceptible animal to porcine pasteurellosis, pigs occupy a higher weight on the risk map. Similarly, humans can also be infected with P. multocida, but the incidence rate is lower than that of pigs and other animals, such as rabbits, poultry, cattle, and sheep.

The most important influencing factors involved in the incidence of P. multocida in animals are mismanagement, high temperatures, poor air quality, and overcrowding in a limited space33,34. High suitability areas are mainly distributed in the south-central of the Chinese mainland, where the climate is humid and hot all year round and the traffic is complex, providing good conditions for the propagation and spread of pathogens. Most breeding patterns in the south-central are small households, and the proportion of intensive livestock farming is relatively small. Insufficient breeding management is one of the inducing factors involved in the incidence of P. multocida7. The unsuitable areas are mainly distributed in the west and the north, where the climate and geographical conditions are complicated, and most of the farmed livestock are cattle, sheep, and horses that can graze. Some of the people living in this area do not have a high demand for pork due to their ethnic customs. So, the number of pigs is small compared to other livestock. The combination of climate, geography, breeding needs, and other factors can make the western and northern unsuitable for porcine pasteurellosis.

The factor sensitivity analysis was done to evaluate or assist to reduce the uncertainty in the multi-criteria decision analysis by detecting slight changes (± 20%, 1% each time) of the input data. This analysis is very important for the verification and calibration of the numerical model35. In the sensitivity analysis of the factors, the annual mean temperature had a relatively high sensitivity to the results. Climate factors affect the entire region of the Chinese mainland and have high spatial heterogeneities, which may be the main reason for high sensitivity.

In this study, we used MCDA to develop a prediction model and construct a prediction system for porcine pasteurellosis. However, this type of research is still in the exploratory stage and there are still some uncertainties due to the constraints of various objective conditions. First, MDCA, as a fuzzy mathematical method, was often used to deal with uncertainty and ambiguity in research36,37. Applying this method to assess the risk areas of diseases may produce biased results. In our study, the application of MCDA to develop predictive models for porcine pasteurellosis was mainly due to the limitations of disease information. In further studies, as the disease information increases, we will use machine learning methods, such as random forest, support vector machine, and boosted regression tree, to reconstruct the prediction model of porcine pasteurellosis38,39. Compared with MCDA, machine learning methods may have higher prediction accuracy and lower prediction uncertainty by mining patterns in a large amount of complex data. In addition, the quality and accuracy of the original data, the selection of factors, the calculation of weights, and the grading of factors may cause uncertainties in the results. The weight of the factor is one of the main factors causing controversy and uncertainty in the results. It is impossible to have only one set of weights, and the range of weights is different. So, multiple result data sets will be generated28. In further research, we will focus on the issues involved above.

Conclusions

In this study, we developed a predictive model based on multi-criteria decision analysis (MCDA) and assessed risk areas for porcine pasteurellosis in the Chinese mainland. The results showed that risk areas for porcine pasteurellosis were predicted to locate in the south-central of the Chinese mainland, including Sichuan, Chongqing, Guangdong, and Guangxi. The maximum standard deviation of the uncertain map was less than 0.01and the ROC results showed that the prediction model has moderate predictive performance with the area under the curve (AUC) value of 0.80 (95% CI 0.75–0.84). Finally, MCDA was combined with WebGIS technology to construct a system for predicting risk areas of porcine pasteurellosis. These results can provide a theoretical framework for the construction of prediction models in animal infectious disease prediction systems. It also provides decision support for the development and adjustment of control strategies for porcine pasteurellosis.