Introduction

The rapid development of artificial intelligence (AI) technology has afforded various gains to academic circles and industries in the asset pricing field. AI algorithms have superior flexibility to capture the nonlinearity inherent in datasets; thus, such algorithms have been widely employed to perform various tasks in the financial domain, e.g., assessing credit risks (Bussmann et al., 2021; Chen et al., 2022; Baur et al., 2023) and forecasting recession and stock market crashes (Malladi, 2024). In addition, using AI algorithms in property price appraisals enables more precise assessment compared to traditional methodologies (Chen et al., 2022; Baur et al., 2023). Despite this promising potential, several negative effects attributed to the flexible inference of AI have been reported (Desiere and Struyven, 2021; Friedler et al., 2021; Fuster et al., 2022). For example, asymmetric estimation is a key concern inherent to AI algorithms because it can penalize social and/or economic minority groups, and these concerns can be applied to housing price appraisals. Disparities between social groups in housing price appraisals can lead to income inequality (Choi and Green, 2022) and social polarization (Bénabou, 1994) due to asymmetric housing affordability, thereby potentially threatening sustainable urban development and societal equity.

Traditionally, housing prices have been appreciated using hedonic price models (HPM), which have a long history of use in investigating the relationship between housing factors and property prices (McMillan et al., 1980; Chau and Chin, 2003; Hong et al., 2020; An et al., 2023). Despite their simple implementation and understandability, criticisms have contended that HPMs are limited in terms of their ability to capture the complexity in datasets accurately (Chau and Chin, 2003; Hong et al., 2020; Li et al., 2021). As an alternative, AI algorithms have recently been applied to the housing price appraisal task because they can seize the nonlinearity and dynamic patterns of housing prices effectively (Kang et al., 2021; Chen et al., 2022; Tchuente and Nyawa, 2022) due to their flexibility.

However, previous studies have acknowledged the adversarial effects of AI-based assessments, which may increase the disparities between social groups. For example, Friedler et al. (2021) found that machine learning (ML)-based decision-making systems can potentially discriminate according to demographic, social, and/or economic groups. For instance, the Correctional Offender Management Profiling for Alternative Sanctions, a case management and decision support tool used by United States (US) courts to assess potential recidivism, indicates that black defendants are twice as likely to be misclassified as recidivists compared to white defendants (Angwin et al., 2022). In addition, Desiere and Struyven (2021) demonstrated that AI-based profiling is 2.6 times more likely to misclassify foreign jobseekers as high-risk holders than native jobseekers. Similarly, ML technology has been shown to favor white non-Hispanic borrowers with lower average default propensities than black and Hispanic borrowers. Here, minority borrowers with largely dispersed default probabilities are penalized when estimating their risks, which has been attributed to the greater flexibility of ML algorithms (Fuster et al., 2022) compared with traditional statistical models. To the best of our knowledge, although potential inequality across economic and/or racial groups derived from ML-based assessment systems is evident, the discrimination effects between less-educated and well-educated groups in housing price appraisals have yet to be explored thoroughly.

Educational services and the local education level are closely linked to property prices; thus, the relationship between these factors has been investigated (Wen et al., 2017). Most relevant studies have found that education quality has a positive association with housing prices (Nguyen-Hoang and Yinger, 2011; Wen et al., 2017; Wen et al., 2018). Highly educated residents tend to have higher incomes than the less-educated; thus, they have greater purchasing power in the housing market, resulting in increased housing prices in well-educated cities (Guo and Qian, 2021). For less-educated individuals, the housing cost barrier frequently keeps them from living in a well-educated city (Berry and Glaeser, 2005; Choi and Green, 2022) because residents prefer housings nearby high-quality educational facilities, thereby leading to an increase in housing prices (Wang and Li, 2022). In this sense, the inequality between neighborhoods in terms of education can be presumed and empirically proven. In addition, if AI algorithms exaggerate the gaps between advantaged and disadvantaged groups, the unequal effects of housing price appraisals using AI algorithms according to education quality and achievement in urban cities are worthy of extensive examination.

Previous studies have demonstrated the unequal effects of AI algorithms, which tend to favor major social groups’ values over those of minor social groups (Desiere and Struyven, 2021; Friedler et al., 2021; Angwin et al., 2022; Fuster et al., 2022). We recognize that the characteristics of social groups can be the primary driver of the unequal effects caused by AI algorithms and note that such AI-induced inequalities may manifest in housing price appraisals based on educational levels in Korean neighborhoods. In that context, this study also explores the potential for AI-based property valuations to exacerbate inequalities where educational levels are unevenly distributed across neighborhoods.

Thereby, this study examines how AI models may produce unequal price appraisals and how these asymmetric price appraisals are exacerbated across heterogeneous neighborhoods in educational contexts. To elucidate this, we focus on the differences in the results between the AI and traditional HPMs, i.e., the comparative analysis between the nonlinear and linear valuation models (Fuster et al., 2022). In this procedure, we primarily control the education variables concerning the housing price appraisal. The comparative analysis between AI algorithms and traditional HPMs reveals that neighborhoods with lower education levels are penalized by AI models, whereas well-educated neighborhoods are favored. In addition, discrepancies in housing price assessments are exacerbated in accordance with local higher education levels. This indicates the tradeoffs associated with introducing AI algorithms into housing price appraisal, i.e., while taking advantage of flexibility, fairness in housing price assessments is reduced.

The findings of this study contribute to real estate and property valuation studies dedicated to developing impartial and transparent valuation models by identifying implications regarding the negative externalities of AI algorithms. The results also provide insights for decision makers seeking to advance sustainable urban societies and balanced city development. Policymakers can contemplate which social groups are potentially exposed to these unequal effects and which factors may drive inequalities in housing price appraisals.

The remainder of this paper is organized as follows. “Literature review” reviews previous literature. “Study contexts” provides background information on the study sites and South Korea in general. “Data and methodology” explains the data and methodological framework used in the study, including the experimental design. “Results and discussion” details the results regarding the unequal effects of AI algorithms in housing price appraisals and discusses the findings. Finally, the paper is concluded in “Conclusion”.

Literature review

Asymmetric estimations of the artificial intelligence models

Existing studies have applied AI models to establish a credit scoring model (Gunnarsson et al., 2021), property valuation frameworks (Baur et al., 2023), and mortgage default prediction systems (Fitzpatrick and Mues, 2016). This procedure highlights the comparative analysis between linear and nonlinear models; the former can provide basic trend information and a formidable starting point for unseen samples, particularly against nonlinear models such as AI algorithms (Schulz et al., 2020). However, previous studies have primarily focused on enhancing the predictive power of nonlinear models by comparing them with those of linear models.

A growing body of literature has indicated that sophisticated models (whose accuracies are validated against linear models) can produce asymmetric estimations that overestimate major social groups’ values while underestimating those of minor social groups (Desiere and Struyven, 2021; Friedler et al., 2021; Angwin et al., 2022; Fuster et al., 2022). This situation calls for more empirical evidence and time to ensure the trained AI models’ capability in real-world settings, aside from the given test datasets (Yang et al., 2022). In other words, the deployed models require continuous monitoring and updates to be balanced in real-world settings, even after finalizing them during the testing phase (Gama et al., 2014; Paleyes et al., 2022).

If an AI-based property valuation model tends to favor specific social groups compared to linear models that provide a benchmark point (Schulz et al., 2020), it may have long-term implications for housing price appraisals. Housing price valuation systems are rarely updated due to illiquid property transactions (Deppner et al., 2023); therefore, a significant dependency on AI systems, merely validated during testing, can potentially threaten social sustainability and exacerbate societal inequalities over time. Hence, we must determine the possible episodes where asymmetric estimations of AI models can occur for housing price appraisals. Such investigations require focusing on the model validation procedure regarding fairness and balanced estimation for social groups.

Relationship between education variables and housing prices

Various aspects determine housing prices, including housing attributes, neighborhood characteristics, financial circumstances, and education—a key variable concerning housing prices (Wen et al., 2017; Wen et al., 2018). Previous studies have documented that educational resources are vital social assets in urban development (Berry and Glaeser, 2005; Guo and Qian, 2021; Choi and Green, 2022) and significantly influence housing prices and homebuyers’ decisions to purchase dwellings (Guo and Qian, 2021; Wang and Li, 2022). Furthermore, differences in neighborhood education levels could lead to regional disparities and societal inequality (Berry and Glaeser, 2005; Choi and Green, 2022). This implies that heightened property values due to educational privileges could exacerbate existing inequalities and be a significant barrier to social integration; however, existing literature has primarily highlighted the significance of local education levels on housing prices, overlooking how these disparities can arise in housing price appraisals and become aggravated following different educational contexts.

Urban educational resources—including educational facilities and school quality—are key determinants for property valuations and exhibit a nonuniform relationship with housing prices (Bayer et al., 2007; Wen et al., 2018). Thus, education variables can provide critical information for precisely appraising housing prices, particularly for the tail groups characterized by heterogeneous educational environment levels, offering significant implications for AI-based valuation models. HPMs primarily capture trends and linear patterns in housing prices. In contrast, AI models generate flexible and nonlinear price appraisals (Hong et al., 2020; Schulz et al., 2020), making AI-based estimates more susceptible to distortion at the tails of social groups (Fuster et al., 2022).

In this sense, the absence of education variables from a housing variable set can induce asymmetric estimations, causing AI algorithms to systematically over- or under-estimate property values at the tails of educated groups. Moreover, the interrelationship between social variables and asymmetric estimations produced by AI models (Desiere and Struyven, 2021; Friedler et al., 2021; Angwin et al., 2022; Fuster et al., 2022) highlights the need to examine how the absence of education variables can exacerbate unequal effects in housing price appraisals, particularly for marginal educated groups.

Accordingly, the novelty and contributions of this study are threefold. First, we deliver insights into the conditions under which AI-based property valuation models generate asymmetric price estimations, producing unequal effects. Second, this study identifies how these unequal effects can be exacerbated by the heterogeneous educational contexts of different neighborhoods. Finally, this study explores potential pathways through which housing inequality may be intensified via disparities in housing price appraisals.

Study contexts

Education is significantly associated with housing prices and has been considered a primary resource for economic development in South Korea for a long time. In the early 1950s, after the Korean War left the country in ruins, South Korea overcame despair and developed its economy rapidly (Oh, 2007; Ministry of Education, 2015). “Education fever”Footnote 1, which has been prevalent in Korea for several hundred years, has increased the quality of human resources and enabled rapid economic development (Jones, 2013; Dittrich and Neuhaus, 2023). Since the new millennium, the demand for excellent creative human resources has increased, underscoring knowledge-intensive cutting-edge technology industries and changes in the global economy. Thus, South Korea has focused on developing human resources to establish a strong knowledge-based society, thereby fueling the need for education to cultivate highly educated workers and professional industrial human resources (Ministry of Education, 2015). In this context, homebuyers in South Korea tend to prefer neighborhoods with high-quality education services because parents want their children to enjoy an environment that is conducive to quality education.

In addition, education is an effective path to success and has been considered the fairest way of climbing the economic ladder. This actively leads to well-educated parents having higher willingness and ability to pay for education costs than less-educated parents. However, criticisms have been mounting. For example, the intensive education fever in South Korea has exacerbated the polarization of society because it forces parents to invest excessively in private tutoring for their children (Ministry of Education, 2015).Footnote 2 Thus, increased education costs are imputed to housing costs, thereby creating obstacles for lower-income families seeking to reside in well-educated neighborhoods. This corroborates the existing literature documenting that housing cost barriers are related to education (Berry and Glaeser, 2005; Choi and Green, 2022) because better education services and institutions increase neighborhood housing prices (Nguyen-Hoang and Yinger, 2011; Wen et al., 2017). Therefore, we can infer that South Korea has a nonuniform distribution of housing prices depending on heterogeneous local educational levels. As such, South Korea has an appropriate background to investigate whether the unequal effects of AI-based housing price appraisals can be exacerbated according to local educational levels.

Four metropolitan cities in South Korea, i.e., Busan, Daegu, Daejeon, and Gwangju, were selected as the survey areas in this study. Note that Greater Seoul has been investigated extensively and consistently explored, providing substantial empirical evidence that educational services and facilities have positive associations with housing prices (Jang and Kang, 2015; Sung and Ki, 2023). In contrast to Greater Seoul, financial premiums of local educational levels to housing prices have not been explored extensively in the target areas. In addition, these four cities have unique patterns of education environments compared with Greater Seoul due to the educational administration system in South Korea.Footnote 3 Figure 1 illustrates the locations of the study areas.

Fig. 1
figure 1

Study areas: Busan, Daegu, Daejeon, and Gwangju (sourced by paintmaps.com).

Data and methodology

Data

We obtained transaction records, including residential housing prices, from the Ministry of Land, Infrastructure and Transport (MLIT). Although there are various housing types, in this study, we targeted apartments because they include full address information, which enabled us to discern neighborhoods and aggregate amenity variables. In addition, apartments are the predominant residential type of housing in South Korea (Ahn et al., 2020). Thus, apartment datasets can properly represent the spatial dynamics of housing prices in the four metropolitan cities.

The datasets used in this study include various control variables regarding residential housing characteristics, e.g., unit areas, seasonal transaction periods, proximities to the local environmental amenities, and populations. These variables were retrieved from public databases (Statistics Korea, Statistical Geographic Information Service, the Korea Transport Database, and MLIT) and private real estate companies (Kookmin Bank real estate, Naver real estate, and Daum real estate). Data from the years 2018 and 2019 were aggregated, yielding 53,458, 56,606, 24,350, and 44,305 observations for Busan, Daegu, Daejeon, and Gwangju, respectively.Footnote 4

The aggregated datasets were categorized into four groups, i.e., housing characteristics, local amenities, local demographics, and seasonal dummies, as summarized in Table 1. Prior to fitting the datasets to our models, we cleaned the datasets, and the multicollinearity issue was assessed considering the variance inflation factor (VIF). In this process, some variables were dropped in the modeling procedure. In addition, four variables, i.e., transacted prices and three proximity variables, were transformed to logarithmic scale because these variables significantly depart from a normal distribution (Ahn et al., 2020). The descriptive statistics of the variables used in this study are given in Appendix A in the Supplementary Information.

Table 1 Categorized hedonic variables.

A total of 17 variables were confirmed as hedonic variables, including two education variables, to appraise housing prices. The details of the variables are shown in Table 2, where the first column indicates the appellation of a variable denoted in our results. The second column describes the variable’s characteristics. Generally, the constructed variable set aligns with the existing literature on the assessment of housing prices (Ahn et al., 2020; An et al., 2023; Dai et al., 2023).Footnote 5

Table 2 Control variables.

As proxies for education levels in neighborhoods, two education variables, i.e., Univ. grad. and Top school, were considered in this study.Footnote 6 The Univ. grad. variable reflects the ratio of university graduates among adults residing in a neighborhood, which has conventionally been utilized as a proxy for neighborhood education levels in relation to housing prices (Ahn et al., 2020; Lin et al., 2022; An et al., 2023). A higher level of education has served as a requisite credential for quality job opportunities in South Korea,Footnote 7 as discussed in “Study contexts”. As such, it is commonly believed that people with a higher level of education can afford greater housing costs based on their higher wages. Lin et al. (2022) found that university graduates are more likely to settle for higher housing costs to reside in urban cities with better public services and urban infrastructures. Accordingly, the Univ. grad. variable can capture the degree of educational levels in a neighborhood in relation to housing prices.

The Top school variable represents how many students are admitted to the nation’s most prestigious university, i.e., Seoul National University. Hundreds of universities and community colleges exist in South Korea; however, entrance to Seoul National University is highly competitive because admission has a higher potential for high social standing than admission to lower-ranked universities.Footnote 8 This implies that the number of entrants to Seoul National University can reflect the education fever and/or the quality of educational services in a given neighborhood. In this context, Chung (2015) used this variable, i.e., the admittance rate to Seoul National University, as a proxy for school quality in neighborhoods. Thus, our education variable, Top school, can represent the extent of education fever and/or the quality of educational services in a neighborhood.

We conduct a descriptive statistical analysis, using the key variables (Univ. grad. and Top school), to gain preliminary insights for the comparative analysis. We begin by examining the skewness and excess kurtosis of both education variables within each city. Furthermore, we test whether the two statistics conform to those expected under a normal distribution. Table 3 represents each education variable’s skewness, excess kurtosis, and corresponding test results in each city. Table 3 shows that all surveyed areas exhibit asymmetric distributions for both education variables. Educational contexts in neighborhoods are key determinants concerning property prices and housing purchases (Guo and Qian, 2021; Wang and Li, 2022). In this sense, the test results indicate that estimation disparities between HPMs and ML models can arise from asymmetric education levels within a city, as HPMs primarily capture linear trends (Schulz et al., 2020) against the nonlinearity of ML models (Fuster et al., 2022).

Table 3 Skewness and excess kurtosis of education variables.

Next, we employ the \(t\)-test to compare mean values and the \(F\)-test to compare variances of the two education variables in order to determine whether statistically significant differences exist between cities (Yao et al., 2022). Table 4 summarizes the pairwise test statistics for each city, highlighting the heterogeneous characteristics of the education variables across all surveyed areas. Based on these preliminary insights, we treat each city as an individual experimental group (Rainio et al., 2024).

Table 4 \(t\)-test and \(F\)-test statistics for comparisons between the study areas.

Hedonic price models

HPMs have served as the conventional approach to estimate residential housing prices (Chau and Chin, 2003; An et al., 2023). Most studies have utilized this linear method to identify hedonic variables’ marginal effects on housing prices (Hong et al., 2020; Ahn et al., 2020; Qiu et al., 2023) due to its simple implementation and intuitive interpretability. The ordinary least squares (OLS)-based regression form is the representative model for HPMs. This study assessed housing prices based on HPMs and identified the financial premiums of hedonic variables in relation to housing prices. The mathematical formula of the OLS-based HPM in the log-linear formFootnote 9 (Gibbons et al., 2014; An et al., 2024) is expressed as follows:

$${\mathrm{ln}}\,{p}_{i}=c+\mathop{\sum }\limits_{h=1}^{H}{\beta }_{h}{x}_{{ih}}+\mathop{\sum }\limits_{s=1}^{3}{\gamma }_{s}{d}_{{is}}+{e}_{i},$$
(1)

where \({p}_{i}\) denotes the transacted price of an apartment unit \(i\) per square meter, \(c\) is the constant, and \(H\) is the number of hedonic variables, including housing characteristics, local amenities, and local demographics (\({x}_{{ih}}\)). In addition, \({d}_{{is}}\) denotes the seasonal dummy variables (spring, fall, and winter), and \({\beta }_{h}\) and \({\gamma }_{s}\) denote regression coefficients corresponding to hedonic variables (\({x}_{{ih}}\)) and seasonal dummies (\({d}_{{is}}\)), respectively, which are estimated using the OLS method. The residual term is denoted \({e}_{i}\).

Note that the housing unit prices potentially affect the prices of nearby housing units (Qiu et al., 2023); thus, housing prices are spatially correlated (Huang et al., 2017). To alleviate this spatial dependency issue (Ahn et al., 2020) and ensure the robustness of our results, we introduced the spatial lag regression (SLR) model in the assessment of housing prices. Here, a spatial lag term (\({Wy}\)) is input to the hedonic model. The model specification of the SLR is expressed as follows (Anselin, 2013):

$$y=\rho {Wy}+X\beta +\varepsilon ,$$
(2)

where \(y\) represents the \(N\times 1\) vector of the log-transformed housing prices, \(\rho\) denotes the spatial lag parameter, which is estimated by minimizing the root mean squared error (RMSE) in the range between \(-1\) and \(1\), \(X\) represents the \(N\times \left(H+4\right)\) matrix of hedonic variables including three seasonal dummies, and \(\beta\) represents the \(\left(H+4\right)\times 1\) vector of regression coefficients. Finally, \(\varepsilon\) denotes the \(N\times 1\) vector of residuals, which is assumed to be homoscedastic, independent across observations, and distributed normally.

The spatial weight matrix \(W\) takes the \(N\times N\) matrix in the following row-standardized form:

$${W}_{\tau v}=\left\{\begin{array}{l}1/{D}_{\tau v},\quad{D}_{\tau v} < {D}_{{band}}\\ 0,\qquad\quad\,{\rm{otherwise}}\end{array}\right.,$$
(3)

where \(\tau\) and \(v\) denote two housing units’ locations, which can be specified by their longitudes and latitudes; thus, \({D}_{\tau v}\) is the distance between two housing units located in \(\tau\) and \(v\). In addition, \({D}_{{band}}\) is the distance band set by unity. However, the potential bias resulting from simultaneity issue must be addressed because the target variable \(y\) in Eq. (2) can be jointly estimated (Brueckner, 1998). Therefore, we implement SLR depending on the following mathematical formula, which can be obtained from Eq. (2), to remedy biased estimation (Brueckner, 1998; Ahn et al., 2020):

$$\left(I-\rho W\right)y=X\beta +\varepsilon .$$
(4)

Machine learning algorithms

Regarding our core research questions, we employ the random forest (RF) and extreme gradient boost (XGBoost) models to assess housing prices. These models serve as appropriate alternatives to handle the nonlinearity inherent in our datasets based on their flexibility, which has been challenged to linear models (Dou et al., 2023; Swietek and Zumwald, 2023).

The RF model is an ensemble of decision trees that utilizes a number of predictors, i.e., trees, which can be described as a set of \({h}_{t}\left({X}^{d}\right)\), where \({h}_{t}\) represents a tree predictor corresponding to a tree \(t\), and \({X}^{d}\) is the matrix of the hedonic variables (including seasonal dummies). Here, each tree predictor estimates the log-transformed housing prices in this study. The RF model adopts the bootstrap methodFootnote 10 and out-of-bag estimationFootnote 11 in the training process, thereby providing robustness against outliers and unbiased estimations (Breiman, 2001). The final output of the RF model can be obtained by aggregating each estimated value of the tree predictors, i.e., \({h}_{t}\left({X}^{d}\right)\), as follows:

$$\hat{y}=\frac{1}{T}\mathop{\sum }\limits_{t=1}^{T}{h}_{t}\left({X}^{d}\right),$$
(5)

where \(T\) denotes the number of tree predictors, and \({X}^{d}\) is the \(N\times \left(H+3\right)\) matrix of the hedonic variables used to appraise housing prices.

The XGBoost model employs a growing tree in accordance with feature splits and additive tree structures. The addition of a single tree in each iteration supplements the previous predictor’s estimation error. Following the iterations correcting residuals, the XGBoost model can enhance the predictive power incrementally (Chen and Guestrin, 2016). Based on decision rules, the XGBoost model utilizes tree structures \(f\), which map a score considering the variables’ characteristics to each leaf within each tree structure \({f}_{t}\). Given \(T\) trees, the XGBoost model obtains the estimated value \(\hat{y}\) by summing all scores assigned to the leaves in each tree. The XGBoost model attempts to minimize the following objective function, consisting of the loss function and regularization term as follows (Dou et al., 2023):

$${Obj}\left(\phi \right)=\mathop{\sum }\limits_{i=1}^{N}l\left({\hat{y}}_{i},{y}_{i}\right)+\mathop{\sum }\limits_{t=1}^{T}{\rm{\Omega }}\left({f}_{t}\right),$$
(6)

where \(\phi\) is a set of parameters, and \(l\) is the loss function that calculates the estimation errors. In addition, \(N\) is the number of observations, and \({\hat{y}}_{i}\) and \({y}_{i}\) are the estimated and real target values for single observation \(i\), respectively. \({\rm{\Omega }}\) is the regularization term that controls the complexity of the regression predictors (Chen and Guestrin, 2016), which improves generalization, and each \({f}_{t}\) corresponds to the structure of each tree.

Experimental design

Two scenarios were considered in this study: (1) the emergence of unequal effects of the AI-based valuation and (2) its aggravation in housing price appraisals. The unequal effect of the AI-based valuation model can be confirmed when the positive difference between the housing prices assessed by the AI model and the HPM is in the well-educated housing group, whereas the negative difference is in the less-educated housing group. In addition, the unequal effect is exacerbated if it satisfies the following scenario: the difference between values estimated by the AI model and HPM is statistically greater for well-educated housing groups compared to less-educated housing groups.

Before identifying the two scenarios, we first analyze whether local education levels are significantly associated with housing prices across all surveyed areas. This procedure incorporates two education variables in the HPM specifications, and we investigate their regression coefficients. In addition, the valuation models’ predictive power was checked in the assessment of housing prices. Here, we divided the entire dataset into a training set (70%) and a test set (30%) in each of the four cities. To discern the unequal effects of flexible models according to educational levels, we considered three cases: (1) training the model and assessing housing prices by controlling Univ. grad., (2) training the model and assessing housing prices by controlling Top school, and (3) training the model and assessing housing prices by dropping two education variables. After applying these cases to the four models, i.e., a series of HPMs and two ML models, we calculated the difference between the values estimated by the AI model and the HPM, i.e., \({\hat{y}}_{{AI}}-{\hat{y}}_{{HPM}}\).

Then, we sorted the differences in ascending order based on the level of education variable and aggregated lower and upper groups into 5, 10, and 20 percentiles, considering observations. Finally, we conducted \(t\)-tests for within each group and for between lower and upper groups. The \(t\)-test within each group validates the homogeneity between housing prices estimated by AI models and HPMs, and we employed the \(t\)-test between lower and upper groups to validate whether the differences in the housing prices estimated by the AI models and HPMs exhibit different patterns of percentile groups, i.e., education levels. The latter test was conducted to discriminate whether the discrepancy derived from the AI-based valuation model was exacerbated on average according to the neighborhood’s education level.

Results and discussion

Effects of local education levels on housing prices

Using a series of HPMs, we identified the effects of the hedonic variables on housing prices, focusing on two education variables, Univ. grad. and Top school. As shown in Table 5, the \(F\)-statistics appear significant, and the HPMs explain the variance in housing prices fairly well based on the adjusted R2 for all cases. Our key variables, i.e., Univ. grad. and Top school, are positively and significantly associated with housing prices in the four metropolitan cities when each variable is solely incorporated in HPMs. These results imply that the neighborhood’s education level and the quality of educational services have positive financial premiums relative to housing prices. These findings align with the results of previous studies, which found that education levels are positively linked to residential property prices (Nguyen-Hoang and Yinger, 2011; Wen et al., 2017; Ahn et al., 2020; An et al., 2023).

Table 5 Regression results for Busan, Daegu, Daejeon, and Gwangju.

In addition, when both education variables were incorporated in a series of HPMs, they still exhibited significant relationships with housing prices. In other words, aside from the conventional education variable, i.e., Univ. grad., the additional Top school variable has significance in terms of explaining housing prices. Accordingly, we confirmed that higher quality educational services contribute to an increase in housing prices (Nguyen-Hoang and Yinger, 2011; Wen et al., 2017), which further indicates that the two educational variables can jointly explain housing prices.

Assessment of housing prices

Figure 2 illustrates the predictive power of the models for the housing price appraisal task. In this procedure, we estimated the housing prices of the four cities for comparison. Each circle consecutively represents the RMSE value for the following four cases: (1) full variables, (2) omitting Univ. grad. from the full variables, (3) omitting Top school from the full variables, and (4) omitting both Univ. grad. and Top school from the full variables. The gray bar denotes the average RMSE value of the results derived from each model.

Fig. 2
figure 2

Model comparisons for the predictive power.

As shown in Fig. 2, the AI-based valuation models outperformed the HPMs in terms of the RMSE value. By comparing each model’s predictive power, we found that the RF and XGBoost models obtained lower RMSE values than those of the HPMs when assessing housing prices (regardless of the cases and the cities). These findings corroborate previous studies demonstrating that AI algorithms are more accurate than statistical models due to their flexibility in appraising housing prices (Chen et al., 2022; Lorenz et al., 2023).Footnote 12

We also examined how the housing prices estimated by the ML algorithms differ according to education levels using a partial dependency plot, which visualizes the marginal effects of variables on the estimated results (Lenaers and De Moor, 2023; Lorenz et al., 2023). Here, each education variable was divided into five groups with the same size, i.e., 20 percentile groups; thus, a total of 25 groups were created using the Univ. grad. and Top school education variables. Figure 3 shows the modalities of the flexible models when the education levels were controlled in the assessment of housing prices. We visualize the Daegu case when the RF model was used to assess housing prices, and the other cities exhibited a similar pattern.

First, the rightward circles tend to be saturated as the level of Univ. grad. increases, which implies that the ML model tends to estimate higher prices as the number of university graduates increases. Second, the upward circles also tend to be saturated as the degree of Top school increases, which indicates that housings in a neighborhood with higher education fever are estimated to be more expensive by the ML algorithm than neighborhoods with a lower education fever. Third, the right upper circle reveals that well-educated people are likely to reside in neighborhoods that have better educational services with high housing prices. In this context, we posit that the local education level is appreciated by housing prices even when using AI algorithms to assess housing prices.

Fig. 3: Modality of the flexible model according to education levels.
figure 3

Larger circles indicate that a higher number of observations are grouped, and darker circles represent higher median values of the estimated housing prices.

Disparities in housing price appraisals

Following our experimental design, we explore the two scenarios regarding the unequal effects of AI-based housing price appraisals without controlling one or both education variables. Subsequently, this study conducts \(t\)-tests for the within-group and between lower and upper groups.Footnote 13 The results are summarized in Tables 69. In these tables, Panel A aggregates lower and upper groups based on the Univ. grad. variable according to 5th, 10th, and 20th percentiles when Univ. grad. and both Univ. grad. and Top school were omitted in the assessment of housing prices. Panel B aggregates lower and upper groups based on the Top school variable according to 5, 10, and 20 percentiles in the identical manner as Panel A. Each reported value indicates the average difference in each marginal group between the housing prices estimated by the AI algorithm and the HPM, and the \(t\)-test result is reported for within each group. We denote the \(t\)-test results between lower and upper groups along with their test statistics and significance. For example, Panel A.1 shows the difference between the assessed housing prices obtained by two models and the \(t\)-test results for within each group and for lower and upper groups with their test statistics, aggregated by the Univ. grad. variable when the housing prices were appraised without Univ. grad.

Table 6 Differences in estimated housing prices in Busan.
Table 7 Differences in estimated housing prices in Daegu.
Table 8 Differences in estimated housing prices in Daejeon.
Table 9 Differences in estimated housing prices in Gwangju.

The \(t\)-test results for within each group exhibit different estimation patterns between the AI models and HPMs, as evidenced by their significance for the target cities. Overall, less-educated groups have significant negative difference values, whereas well-educated groups have significant positive difference values. These findings imply that the AI algorithm tends to estimate higher housing prices in well-educated neighborhoods than the HPMs. Conversely, housing prices in less-educated neighborhoods are depreciated considerably by AI algorithms compared with the prices appraised by the HPMs.Footnote 14 The \(t\)-test results for between lower and upper groups show that the average difference between housing prices, estimated by the AI algorithms and HPMs, is heterogeneous between lower and upper groups. In other words, these findings indicate that the unequal effects of the AI appraisal model are exacerbated in the assessment of housing prices in metropolitan cities.

The findings reveal that the ML model is asymmetric in favor of well-educated neighborhoods when appraising housing prices. The ML-based valuation model tends to highly appreciate housing properties in better educated and educating areas, thereby incurring the unequal effects across neighborhoods. Conversely, less-educated groups are penalized by ML algorithms in terms of the appraised housing prices. This result can be attributed to the flexibility of the ML algorithm because it better fits the transacted housing prices, which results in dispersed assessments (Fuster et al., 2022).

AI techniques provide cost benefits, managerial efficiency, and differentiated enterprise strategies to financial technology (FinTech) agents; thus, AI-based valuation systems have been actively adopted to improve business quality (Zhang et al., 2021; Poirier, 2024) and pricing precision (Cao et al., 2021). However, AI models can incur dispersed housing price assessments, which generate asymmetric estimations. In this context, when AI-based valuation systems become the dominant instrument against conventional econometrics in housing markets, the unequal effect derived from ML algorithms may present an obstacle to individuals seeking to move into well-educated neighborhoods with greater cost barriers, thereby aggravating housing costs.

Prospects regarding the increase in housing prices may encourage banks to increase loan activities (Li and Tahsin, 2021), although the opposite may also hold true. Thus, the unequal effects of AI algorithms can potentially penalize households in areas with lower education levels (Madrigano et al., 2015) by undervaluing their properties, resulting in reduced loan limits and hindering home ownership opportunities. Residents and prospective homebuyers in such areas are likely to have lower educational attainments and/or place less value on educational environments, possibly due to limited financial resources (Davidoff and Leigh, 2008; Rajapaksa et al., 2020). Therefore, disadvantaged people with limited housing options would face greater challenges in securing loan approval compared with their economically advantaged counterparts (Munnell et al., 1996; Ross and Tootell, 2004), ultimately decreasing the home ownership rate (Gabriel and Rosenthal, 2005; Larsen and Sommervoll, 2004). In addition, home ownership serves as a significant means of wealth transfer between generations, which can also function as a source of social inequality (Galster and Wessel, 2019). In this sense, using only ML algorithms to assess housing prices must be considered carefully because this can exacerbate wealth disparity between classes.Footnote 15

Discussion

Although established models and systems demonstrated their superiority in the given test cases, whether AI models can generate balanced and symmetric estimations in future real-world cases remains unclear (Makridakis et al., 2018). Previous studies have demonstrated that AI models produce asymmetric estimations across heterogeneous social groups (Desiere and Struyven, 2021; Friedler et al., 2021; Angwin et al., 2022; Fuster et al., 2022). Hence, this study identified the conditions under which AI-based property valuation models generate asymmetric price estimations in the context of housing price appraisals.

Compared to benchmarks, our scenarios revealed that a relatively precise model could generate unequal estimations with an appreciation for socially advantaged groups (well-educated neighborhoods) and a depreciation for socially disadvantaged groups (less-educated neighborhoods). This highlights the need for continuous validation and imbalance correction following the deployment of the model (Myllyaho et al., 2021) using traditional benchmarks (Ye et al., 2025).

Fuster et al. (2022) demonstrated the unequal effects of ML models in social groups’ default risk evaluations (derived from race and ethnic characteristics) by comparing ML and linear models; in such a case, some groups may be penalized or rewarded by sophisticated models, including ML algorithms. This outcome can be attributed to the flexibility mechanism of sophisticated models, which can indirectly capture the effect of unobserved and restricted variables (e.g., race) on the outcome (Fuster et al., 2022). When infrequent model updates compound such unequal estimations, an overreliance on seemingly precise models in decision-making processes can lead to an “uneven playing field” as the embedded asymmetric effects accumulate over time.

In this sense, indiscriminately employing AI models can threaten balanced housing price appraisal since housing price valuation systems are rarely updated due to illiquid property transactions (Gallin et al., 2021; Deppner et al., 2023). If unequal estimations are used in future housing appraisals without regular adjustments, property values in well-educated neighborhoods will be consistently over-appreciated, while those in less-educated areas will continuously experience under-appreciation, widening the price gap between educational groups (Choi and Green, 2022).

Relievable unequal effect

This study suggests an alternative approach to alleviate inequality in the assessment of housing prices. As demonstrated by our findings, the unequal effect regarding housing price appraisals is incurred when education variables are omitted; thus, we investigated whether the unequal effect can be mitigated if two education variables are controlled in the explanatory variable set. Here, we consider the case of Daegu using the RF model to appraise housing prices as the representative case because it obtained the highest accuracy in our comparisons.

Figure 4 shows the differences between the housing prices estimated by the AI algorithm and the HPM by aggregated groups. Here, the yellow line indicates the difference between the estimated housing prices omitting two education variables, i.e., Univ. grad. and Top school, in appraising housing prices, and the black line represents the discrepancy between the estimated housing prices with the full variables (including the two education variables). In addition, each circle represents the case of aggregating each group based on the Univ. grad. variable, and the diamonds represent each group aggregated by the Top school variable.

Fig. 4
figure 4

Modality of differences between estimated housing prices.

If a group is susceptible to the flexibility of the ML algorithm, then the discrepancy of the estimated housing prices moves away from zero, as indicated by the red line. As shown in Fig. 4, both tail groups are more affected by the flexible estimation when the housing prices are appraised without the education variables (yellow lines). In other words, the unequal effects are exacerbated in both marginal sides, i.e., less-educated and well-educated neighborhoods. However, when the education variables are incorporated in the housing price assessment, the black lines are much closer to the red line in all groups compared to the yellow lines. Given education variables, the inequality stabilizes because both models leverage the information regarding the effects of education levels on housing prices; thus, the inequality can be partly relieved, as shown in Fig. 4.

Conclusion

This study investigated the unequal effects of introducing AI algorithms to the housing price appraisal task, with a specific focus on four metropolitan cities in South Korea. The findings of our comparative analysis demonstrate that housing prices in less-educated neighborhoods are depreciated considerably by AI algorithms compared to HPMs. Conversely, the housing prices in well-educated neighborhoods are appreciated by the AI algorithms with favored valuations. In addition, the discrepancy between lower and upper groups is exacerbated according to the local educational levels, which is caused by the flexibility of ML algorithms.

The findings of this study have important implications for decision makers. First, when the valuation models are constructed using AI algorithms, to maintain fairness in the assessment of housing prices, agents must consider variables that potentially induce inequality in their modeling procedure. Second, minute and targeted regulatory policies are required to manage the negative externalities derived from utilizing AI models in the assessment of housing prices. Specifically, FinTech agents must validate whether their AI-based valuation systems have asymmetry that results in asymmetric housing price appraisal.

In this study, we primarily focused on apartment buildings due to their dominance in the target study areas. Nonetheless, our research framework, which scrutinizes the unequal effects of AI models, can be extended to other types of housing or other regions. In addition to the education variables considered in this study, other factors that may potentially contribute to residential segregation or social exclusion warrant further investigation. Environmental amenities, such as air quality and urban greenery, can play a significant role; therefore, future research should expand the current research scope to address environmental inequalities across cities. Furthermore, future studies can extend the current research framework by tracking individuals’ educational levels to explore the micro relationship between educational achievements and housing affordability. Concurrently, future studies can consider effective training and modeling strategies to alleviate the tradeoffs between increasing precision and unbalanced pricing when using AI models for the housing price appraisal task.