Introduction

Subjective well-being (SWB) is a vital indicator of overall health and quality of life, reflecting how individuals perceive and experience their lives. As SWB is closely linked to the economic, social, and health circumstances of populations, understanding its determinants is crucial for promoting a thriving society (Layard, 2006). In recent years, there has been a growing interest in understanding the determinants of SWB, with particular attention paid to the environmental factors that shape it. Environmental determinants, which encompass elements of the built, natural, and social environments, such as service accessibility, air quality, noise levels, and social cohesion, play a crucial role in shaping individuals’ quality of life (Kim et al., 2020; Pfeiffer and Cloutier, 2016). Understanding these key environmental determinants is essential for developing effective policies and interventions aimed at enhancing population well-being.

SWB is a multifaceted concept encompassing various dimensions of individuals’ self-perceived quality of life. It is common to divide SWB into two dimensions: hedonic well-being and eudaimonic well-being (Ryan and Deci, 2001). Hedonic well-being, which focuses on life satisfaction and the balance between positive and negative affects, is commonly measured using the Satisfaction with Life Scale (SWLS) (Diener et al., 1985), the Positive and Negative Affect Schedule (PANAS) (Watson et al., 1988), and the Cantril Ladder (Cantril, 1965). Eudaimonic well-being, on the other hand, emphasizes personal growth, purpose, and fulfillment and is often assessed using Ryff’s Scales of Psychological Well-Being (PWB) (Ryff, 1989) and the World Health Organization Quality of Life (WHOQOL) (The Whoqol Group, 1998). Several classic works have summarized the literature on factors associated with SWB based on traditional statistical models (Aslam and Corrado, 2011; Diener et al., 2018; Mouratidis, 2021). These reviews highlight key contributors to SWB, including the fulfillment of basic human needs (e.g., shelter and food security), social and emotional needs (e.g., positive feelings from respect), and environmental factors (e.g., access to green space) (MacKerron and Mourato, 2013; Tay and Diener, 2011). Environmental factors, encompassing the natural, built, and social environments have received growing attention in SWB research (Wang and Wang, 2016). Theoretically, this effect has been explored through frameworks such as “person–environment fit”(Kahana et al., 2003) and “spaces of well-being” (Fleuret and Atkinson, 2007). Empirical studies highlight the impact of built, natural, and social environments on SWB, with factors like service accessibility, greenspace, noise annoyance, and social cohesion playing significant roles (Dittmann and Goebel, 2010; Morrison, 2011; Nisbet et al., 2011).

The field of urban studies has witnessed a notable shift from traditional regression-based analyses to machine learning (ML) approaches. This methodological transition is driven by the advantages that ML techniques offer, such as the ability to handle complex, high-dimensional data and reveal non-linear relationships, and interactions between multiple variables that traditional regression methods might overlook (Hindman, 2015). Decision-tree-based models, such as Random Forests (RF) and Gradient Boosting Decision Trees (GBDT), enable the identification of key environmental predictors while capturing non-linear effects and variable interactions without requiring predefined model structures (Tang et al., 2020). Artificial Neural Networks (ANN) further enhance analytical flexibility by learning intricate, multi-layered dependencies, making them well-suited for studying spatial and temporal variations in environmental determinants of SWB (Grebovic et al., 2023). While traditional models, such as non-linear regression and multi-level modeling, can handle certain complexities, they still require predefined assumptions about variable relationships and interaction terms. In contrast, ML methods adaptively uncover patterns from large, multi-source datasets, reducing the risk of model misspecification and enabling the integration of diverse data types. The key distinction between ML and traditional statistical methods lies in their underlying approach: ML is inherently data-driven, whereas traditional methods are hypothesis-driven. ML approaches prioritize pattern recognition and prediction, allowing models to learn relationships directly from data without the need for prior assumptions. This data-driven nature enables ML to uncover hidden patterns from the data, offering new opportunities to uncover insights that might be overlooked by traditional methods.

Given these advantages, the application of ML approaches in studying well-being determinants presents a significant opportunity to advance our understanding of how environmental factors influence SWB (Osawa et al., 2022; Song et al., 2023; Zhang et al., 2023). However, despite its growing adoption in SWB research, ML is not always methodologically justified, raising concerns about its necessity and appropriate implementation. Many studies apply ML despite small sample sizes, which undermines the reliability and generalizability of complex models (Flint et al., 2019; Vabalas et al., 2019). Unlike machine learning applications in big data contexts, most SWB studies rely on structured survey datasets that may not require complex feature selection or non-linear modeling (Zhang et al., 2018). Furthermore, some studies fail to justify their choice of ML over traditional regression-based methods, particularly when variable relationships are well-established or predominantly linear. Without clear methodological justification and proper reporting, ML risks being misapplied as a novelty tool rather than a rigorous analytical approach. To date, discussions on the adequacy of using ML in SWB research remain limited. This gap is significant because, while ML offers distinct methodological advantages, its application also presents challenges that must be critically examined. A systematic review is therefore needed to synthesize the current usage of ML methods in related fields and evaluate whether ML genuinely provides novel insights or simply adds computational complexity without meaningful advantages. Furthermore, comparing findings from both ML and traditional approaches would clarify the added value of ML and provide researchers with a clearer understanding of its strengths, limitations, and appropriate contexts for use.

This study, therefore, aims to synthesize existing research that employs ML approaches to identify environmental determinants of SWB, while also comparing these findings with those obtained using traditional methods. This comparison allows us to discern the unique contributions of ML. Furthermore, the study reviews the use of ML in the selected studies in terms of their justification for adopting ML, interpretation of the results, and data quality. Specifically, this review seeks to (1) provide an overview of the current state of research about ML-based studies on environmental determinants of SWB, (2) assess the strengths and limitations of using ML approaches, and (3) identify key determinants of SWB based on existing literature. Through this, we aim to offer insights into the current application of ML in understanding environmental determinants of SWB and emphasize the importance of avoiding the misuse or inappropriate application of ML approaches in future studies.

Methodology

This systematic review was conducted following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (Moher et al., 2009). The review focused on identifying and synthesizing studies investigating the environmental determinants of SWB using ML approaches.

Eligibility criteria

The eligibility criteria for selecting studies were defined based on the following inclusion and exclusion parameters:

Inclusion criteria

  1. 1.

    Study subject: Studies that focused on individual-based or regional-based populations.

  2. 2.

    Study outcome: Studies that investigated subjective well-being, including perceived mental health, perceived physical health, happiness, life satisfaction, quality of life, etc.

  3. 3.

    Analytic approach: Studies employing machine learning methods (e.g., random forest, support vector machine, gradient boosting model).

  4. 4.

    Article type: Peer-reviewed research articles.

  5. 5.

    Time of publication: Studies published from the inception of the databases up to March 2024.

  6. 6.

    Language: Studies published in English.

Exclusion criteria

  1. 1.

    Studies employing analytic methods rather than machine learning.

  2. 2.

    Studies lacking measures of environmental determinants.

  3. 3.

    Studies treating subjective well-being as an independent variable rather than an outcome.

  4. 4.

    Studies published in a language other than English.

  5. 5.

    Review papers, editorials, commentaries, and non-peer-reviewed articles.

Search strategy and selection process

A systematic search was conducted in four major electronic bibliographic databases: PubMed, Web of Science, PsycINFO, and Scopus. These databases were chosen for their comprehensive coverage of environmental, psychological, and health-related research. Specifically, PubMed and PsycINFO provide extensive access to health-related literature, crucial for understanding how environmental factors impact well-being from health perspectives. Web of Science and Scopus offer interdisciplinary coverage, including environmental, psychological, and social sciences, facilitating a broad understanding of the topic. The search strategy was designed to capture all relevant studies and included combinations of three groups of keywords related to environmental determinants, subjective well-being, and machine learning. Detailed search strategies and keyword combinations used for each database can be found in S1 of the Supplementary Information.

The initial search yielded a total of 449 articles. After removing 116 duplicate records, 333 unique articles were screened based on their titles and abstracts (Fig. 1). Two reviewers (MY and YZ) independently assessed the articles for eligibility based on the predefined criteria. Disagreements were resolved through discussions to reach a consensus. During the title and abstract screening, 230 articles were excluded for reasons such as irrelevant themes (n = 166), use of non-machine learning analytic methods (e.g., Ordinary Least Squares regression; Structural Equation Modeling; and quantitative research designs etc.) (n = 55), lack of measures of environmental or social determinants (n = 4), and studies where SWB was an independent variable (n = 2). This process resulted in 44 articles being selected for full-text screening.

Fig. 1
figure 1

Study exclusion and inclusion flowchart.

Following full-text screening, another 15 studies were excluded for the following reasons: irrelevant themes (n = 5), use of non-machine learning analytic methods (n = 7) (e.g., Bayesian multilevel ordered logit model, stepwise regression etc.), SWB as an independent variable (n = 1), no full-text access (n = 1), and lack of ML details in the method section (n = 1). Ultimately, 25 articles met all criteria and were included in the final review.

Data extraction and analysis

Data extraction was performed independently by the two primary reviewers (MY and YZ) to ensure consistency and accuracy. Discrepancies in data extraction were resolved through consensus discussions. The two authors followed transparent and systematic approach that included cross-examination and discussions between each step. The extraction process focused on capturing a comprehensive set of details from each study, including the article title and authors, year of publication, sample size (covering both individual and regional-based populations), and sample characteristics such as demographic and geographic details. Additionally, specific machine learning techniques and models employed in the studies were documented, along with the rationale provided by the authors for selecting these ML approaches over traditional methods. The instruments and scales used to assess SWB were carefully recorded and summarized. The extraction process also identified whether the studies reported and intepretated the non-linear relationships between environmental determinants and SWB, highlighting key determinants of SWB based on their relative importance in the ML models.

Given the extensive range of predictive features and SWB outcomes included in the analysis, the studies were deemed to be too heterogeneous for meta-analysis. Consequently, a narrative synthesis was conducted to provide a comprehensive overview of the findings. The studies were initially grouped according to the level of analysis (individual vs. regional) and subsequently according to the type of environmental determinants, based on the differentiation of geographical environments (e.g., built, natural and social environments). The data on the measurement of SWB were coded inductively. All coding decisions were compared between two authors (MY and YZ), with differences being resolved through discussions until consensus was reached.

Author positionality

The biases, histories, and interests of researchers shape their research processes (Bourke, 2014). Acknowledging the positionality of each author is crucial for transparency, as it helps contextualize how research processes and conclusions have emerged from individual perspectives.

The first author, MY, has a background in health geography, specializing in neighborhood health effects and community well-being. This expertise provided a nuanced understanding of how environmental determinants influence SWB, contributing valuable insight throughout the review process. The second author, YZ, has a background in GIS and public health and has published systematic reviews on obesity risks. YZ also specializes in ML analysis and has published journal articles utilizing ML approaches, offering a methodological perspective crucial for analyzing and interpreting complex ML studies. Overall, each author’s background and expertise contributed distinct perspectives and skills, ensuring a comprehensive and balanced analysis in the systematic review.

Results

Summary of included studies

Table 1 summarizes the basic information for the included studies. All studies were published between 2018 and 2024, comprising 21 studies based on individual data and four studies utilizing entire regions as the unit of analysis. The majority of these studies were conducted in China (n = 6), followed by the U.S. (n = 4), the U.K. (n = 3), Spain (n = 3), Italy (n = 3), Canada (n = 2), the Netherlands (n = 2), Japan (n = 1), Jordan (n = 1), Iran (n = 1), Poland (n = 1), Austria (n = 1), Slovakia (n = 1), the Czech Republic (n = 1), Germany (n = 1), Ireland (n = 1), Pakistan (n = 1), and the United Arab Emirates (n = 1). The sample sizes in the individual-based studies ranged from 105 to 30,097 participants. For the region-based studies, the units of analysis included counties, boroughs, communities, and social media posts.

Table 1 Overview of reviewed studies.

Fifteen of the reviewed studies applied machine learning algorithms specifically to address the potential non-linear relationships often overlooked by traditional methods. Additionally, 11 studies reported choosing machine learning approaches over traditional statistical models to achieve more accurate estimations. Furthermore, five studies highlighted the ability of ML algorithms to determine the relative importance of various predictors for SWB as a primary reason for favoring ML rather than traditional methods. Notably, two of the reviewed studies did not clearly specify their rationale for using machine learning approaches. Regarding the reporting and interpretation of ML results, only 8 out of 25 studies illustrated and interpreted the identified non-linear relationships using approaches such as partial dependence plots. Interestingly, 6 studies claimed to use ML algorithms to uncover non-linearity issues, yet did not report or interpret any results regarding non-linearity (Cohen and Stutts, 2023; Lin et al., 2021; Osawa et al., 2022; Samani et al., 2020; Schreuder et al., 2016; Wu et al., 2020). As for the reporting of relative importance, 24 out of 25 studies reported such results. The study by Buizza et al. (2022) did not report results of relative importance as was done by other studies. Instead, they primarily focused on identifying the key predictors through regression tree splits without explicitly providing relative importance scores in a numerical or graphical format, thereby limiting the comparability of their findings to those of other reviewed studies.

Methods for measuring SWB

Based on our literature review, we found that 20 out of 25 papers adopted self-report measurements of SWB. Thirteen of these measured eudaimonic well-being, including ten papers on psychological well-being, two on general health, and one on quality of life. Additionally, ten papers measured hedonic well-being, five of which focused on happiness, two on life satisfaction, and three on other measures. Notably, two papers adopted multiple SWB measurements: one measured both perceived general health and successful aging (Mayo et al., 2021), and another measured happiness, life satisfaction, and psychological well-being (Wu et al., 2020).

Apart from papers using self-reported measurements, two studies utilized professional diagnoses of psychotic symptoms (Antonucci et al., 2021) and respiratory disease exacerbations (Samani et al., 2020) as indicators for SWB. Furthermore, three papers employed objectively measured indicators, including public emotion extracted via semantic analysis (He et al., 2024), urban vitality (Ming et al., 2024), and multiple health indicators reflecting length and quality of life (Wei et al., 2022).

Current implementation of machine learning methods

Machine learning approaches differ from traditional linear modeling primarily in their data-driven nature, allowing them to identify complex patterns and relationships among variables without relying on predefined assumptions. This flexibility enables ML models to explore relationships within large datasets, uncovering key variables and interactions that might otherwise remain unnoticed.

In the reviewed studies, various ML techniques were employed to explore the determinants of SWB. As shown in Fig. 2, the most frequently employed machine learning method in these studies was the random forest (n = 10), followed by artificial neural networks (n = 9), gradient boosting (n = 7), and decision trees (n = 5). Notably, five studies employed multiple machine learning algorithms and reported the best-performing models based on various accuracy indicators, such as the area under the receiver operating characteristic curve (AUC), root-mean-square error (RMSE), and R² (Etminani-Ghasrodashti et al., 2021; Helbich et al., 2020; Lin et al., 2021; Song et al., 2023; Wei et al., 2022).

Fig. 2
figure 2

Machine learning method.

A notable trend of the reviewed studies was the considerable variation in sample sizes and the number of features included in the ML models. The sample sizes ranged from as few as 96 to over 121,000, reflecting a broad diversity in study design and data availability. Interestingly, more than half of the studies (n = 14) utilized sample sizes smaller than 1000, with three studies including fewer than 200 samples (Andrejiová et al., 2019; Antonucci et al., 2021; Wu et al., 2020). This variation in sample size raises questions about the suitability of ML methods in smaller datasets, as ML models typically require larger samples to avoid overfitting and to ensure reliable results. Regarding the number of features included, most studies (n = 21) inputted more than 10 features into their ML models, reflecting an effort to capture the multidimensional nature of SWB. However, four studies included only 2–6 features (Andrejiová et al., 2019; Antonucci et al., 2021; Cohen and Stutts, 2023; Salameh, 2023), which may limit the ability of the models to fully explore the factors affecting SWB. While these studies reported good model performance, they also face a higher risk of overfitting, especially given the small sample sizes. Conversely, one study initially included 2,018 features in its ML model but ultimately identified 183 as relevant predictors, reporting the top 20 most important factors in their findings (Song et al., 2023). This example illustrates the capacity of ML models to handle high-dimensional data, allowing them to analyze a vast number of potential predictors simultaneously. However, it also underscores the critical role of feature selection in ensuring model interpretability, efficiency, and robustness.

The reviewed studies also exhibited substantial variation in the reporting of model performance. Building on the diversity of ML techniques used, the application of multiple performance metrics in five studies (Etminani-Ghasrodashti et al., 2021; Helbich et al., 2020; Lin et al., 2021; Song et al., 2023; Wei et al., 2022) is a notable strength. By reporting different metrics such as accuracy, AUC, RMSE, and R², these studies were able to assess the models from various angles—prediction accuracy, classification performance, and regression error. This approach strengthens the validity of the findings and helps mitigate the limitations of relying on a single metric. For example, AUC is particularly valuable in classification tasks, especially when the data is imbalanced (Wang et al., 2021), while RMSE provides insight into how well the model predicts continuous outcomes (Hodson, 2022). However, there is a noticeable lack of consistency in the metrics used across studies. While accuracy was frequently reported (in 5 studies), it might not always provide a complete picture, especially when dealing with imbalanced datasets or complex models. AUC (in 5 studies) and RMSE (in 8 studies) offer complementary perspectives that can capture different aspects of model performance, but only a few studies utilized these metrics. Moreover, the 5 studies that did not report any performance metrics. Without clear reporting on how well the models performed, it becomes difficult to assess the reliability of the findings.

Key environmental factors influencing SWB

Based on their relative importance, the top three predictors of SWB, regardless of the type, have been selected. Figure 3 demonstrates the selected associations between environmental factors and SWB across the individual-based studies. The reviewed studies cover all aspects of the geographic environment, including the built, natural, and social environments. Associations between the built environment and SWB were more frequently reported. Specifically, the accessibility of services and facilities such as supermarkets, transport, hospitals, and parks positively influenced happiness, psychological well-being, and quality of life (Barykin et al., 2023; Eder et al., 2021; Zhang and Dong, 2023). Additionally, land use mix and population density have been linked to respiratory disease exacerbations and levels of happiness (Samani et al., 2020; Yin and Shao, 2021). Within the natural environment, greater satisfaction with noise level, air quality, and temperature can have a positive influence on happiness (Antonucci et al., 2021), while exposure to air pollution was associated with respiratory disease exacerbations (Samani et al., 2020). Regarding the social environment, feelings of safety were the most commonly reported factor contributing to SWB (Li et al., 2024; Mayo et al., 2021; Schreuder et al., 2016). Additionally, a sense of belonging to one’s neighborhood also plays an important role in enhancing life satisfaction (Morgan et al., 2012).

Fig. 3
figure 3

The identified key environmental factors for SWB.

Several studies have provided evidence of non-linear and threshold effects of environmental determinants on SWB (Li et al., 2024; He et al., 2024; Yin and Shao, 2021). For example, Li et al. (2024) and Helbich et al. (2020) highlighted a non-linear relationship between neighborhood safety and SWB. Their findings suggest that when safety conditions reach a moderate level, further improvements in safety lead to substantial increases in well-being. Similarly, studies on access to green space have shown non-linear patterns, with well-being increasing significantly as green space access improves, but reaching a plateau once individuals have sufficient proximity to parks or natural areas (Huang et al., 2023). This suggests that while initial increases in green space accessibility have a substantial impact on SWB, further increases might not contribute significantly beyond a certain threshold.

Additional factors influencing SWB

Among the three main environmental determinants of SWB identified in individual-based studies, 33 social factors were reported (Fig. 4). These factors served as controls features or variables of interests in the reviewed studies. As these factors were found equally and even more important in explaining individual SWB outcomes (Helbich et al., 2020), we therefore synthesize findings on the social factors for SWB in this review. The social determinants can be categorized into four main groups: sociodemographic factors, emotional predictors, family predictors, and social capital.

Fig. 4
figure 4

The identified key social factors for SWB.

For sociodemographic factors, age, gender, and income were commonly reported as key determinants of SWB. Age, in particular, influenced happiness (Huang et al., 2023), quality of life (Etminani-Ghasrodashti et al., 2021), and workplace health problems (Andrejiová et al., 2019). Additionally, income had a positive impact on happiness (He et al., 2024; Huang et al., 2023; Yin and Shao, 2021), as did having a spouse (Osawa et al., 2022).

Emotional factors, including avoidance, loneliness, negative self-focus, positive re-evaluation, and self-efficacy, were also identified as significant predictors of SWB. For example, lower levels of loneliness and a greater sense of safety were beneficial for successful aging, particularly among older individuals living with HIV (Mayo et al., 2021). Furthermore, Salameh (2023) demonstrated that higher levels of self-efficacy, bolstered by social capital, can significantly enhance psychological well-being. On the other hand, perceived stress had a negative impact on SWB, emphasizing the importance of stress management in maintaining overall well-being (Morales-Rodríguez et al., 2021).

Family-related factors, including family autonomy and control, family cohesion, family social support, maternal care, maternal overprotection, and parental care, play a crucial role in predicting life satisfaction and psychological well-being. For instance, inadequate parental care and poor family cohesion were associated with a higher risk of suicidal behavior (Cohen and Stutts, 2023). These family dynamics are vital in shaping individuals’ emotional and psychological health (Antonucci et al., 2021; Cohen and Stutts, 2023; Morgan et al., 2012).

Factors related to social capital, such as school engagement, maintained social capital, bonding social capital, and social support, were also identified as important determinants of SWB. Social capital, especially in the form of strong social networks and support systems, significantly contributed to life satisfaction and psychological well-being (Morgan et al., 2012). In addition, school engagement and social support positively influenced life satisfaction and psychological well-being among students and workers (Buizza et al., 2022; Cohen and Stutts, 2023).

Several studies highlight the non-linear relationships and threshold effects of the social determinants of SWB (Helbich et al., 2020; Huang et al., 2023). Huang et al. (2023) analyzed accumulated local effects plots and identified a U-shaped relationship between age and SWB, with older adults reporting higher levels of SWB than middle-aged individuals. They also observed a non-linear association between income and SWB, wherein the positive impact of income plateaued beyond a certain threshold (140,000 CNY/year). Similar patterns were reported by Helbich et al. (2020), who found that middle-aged individuals were at the highest risk of depression, and the protective effect of income against depression diminished for those in the “high” and “very high” income brackets. These findings suggest that while financial stability is crucial for well-being, an excessive emphasis on economic growth does not necessarily lead to further improvements in SWB. Furthermore, the U-shaped relationship between age and SWB highlights the complex variation of SWB across life stages. Middle-aged individuals may experience heightened stress due to work and family responsibilities, whereas older adults, despite potential health concerns, may benefit from greater emotional stability, life satisfaction, and social support, ultimately contributing to higher SWB.

Discussion

This systematic review highlights the significant potential of ML approaches in understanding the determinants of SWB. A comprehensive analysis of 25 studies revealed that ML offers powerful tools to explore environmental determinants of SWB, providing insights that traditional statistical methods might overlook. The ability of ML to handle large, complex datasets and identify non-linear relationships marks a methodological shift in SWB research. However, the application of ML in SWB research is not without challenges, including issues related to model interpretability, the need for large datasets to ensure generalizability, and inconsistencies in performance reporting. Additionally, some studies employ ML without clear methodological justification, potentially prioritizing novelty over rigor. This review summarizes these challenges while illustrating the advantages of ML over traditional methods. We discuss how ML enhances the detection of non-linear relationships and context-dependent interactions, allowing for a deeper understanding of SWB determinants. Furthermore, we emphasize the methodological considerations necessary to improve the robustness, transparency, and applicability of ML-based SWB research, ensuring meaningful advancements in the field.

Summary of the key findings and comparison with traditional research outcomes

This review identified several key determinants of SWB across the studies, categorized into environmental and social factors. Key environmental determinants identified include service accessibility, population density, air quality, and safety. These factors significantly influence both positive and negative emotions and experiences. These findings partially align with those from traditional statistical review studies, which similarly emphasized the importance of green space, neighborhood infrastructure, and safety as primary contributors to SWB (Andalib et al., 2024; Clark et al., 2007; Kodali et al., 2023). While the positive effects of green space have been consistently reported, the associations between other built and social environmental factors and SWB are less consistent (Clark et al., 2007; van Kamp et al., 2003). For example, while some studies suggest that population density exacerbates stress and diminishes well-being (Mouratidis, 2021), others indicate the opposite, highlighting potential benefits such as improved social interactions (Želinský et al., 2021). This inconsistency may stem from the limitations of traditional statistical methods, which, although capable of providing clear and actionable recommendations based on linear relationships, often fail to capture the complex, non-linear, and interactive effects between multiple variables.

In addition to environmental factors, the social determinants, such as sociodemographic factors, emotional predictors, family predictors, and social capital were also frequently reported by the reviewed studies. Age, income, and health status were commonly reported as key factors influencing SWB (Andrejiová et al., 2019; He et al., 2024; Huang et al., 2023; Li et al., 2024). The findings from ML-based analyses align with those obtained from traditional models, where income and health are consistently identified as primary contributors to SWB (Diener et al., 2018; Tay and Diener, 2011).

ML-based studies offer key advantages over traditional statistical methods by uncovering non-linear relationships, identifying intricate variable interactions, and enhancing predictive accuracy. For example, Huang et al. (2023) demonstrated that green space accessibility influences SWB non-linearly, with benefits plateauing beyond a certain threshold, emphasizing the limitations of traditional linear models in capturing saturation points. Furthermore, Helbich et al. (2020) identified pronounced variable interactions for social cohesion, age, employment, and education in predicting depression, illustrating the complex interactions that ML models can uncover. In contrast, traditional statistical methods tend to assume additive effects, potentially leading to oversimplified conclusions about the influence of environmental and social factors on SWB. For instance, Helbich et al. (2020) compared traditional regression models with ML-based approaches and found that regression models failed to capture the non-linear relationship between various physical and social characteristics (e.g., social cohesion and green space) and well-being. Similarly, Wei et al. (2022) demonstrated that linear models often underestimate the interplay between population health outcomes and the predictors, which ML techniques could more accurately model. These findings highlight that ML approaches provide a more sophisticated understanding of the multi-dimensional and context-dependent factors influencing SWB, allowing researchers to move beyond the restrictive assumptions of traditional statistical frameworks.

Furthermore, ML-based studies enhance SWB research by providing detailed rankings of feature importance, offering insights into the most influential factors shaping well-being. Unlike traditional regression models, which determine variable significance through p-values and effect sizes (β-coefficients), ML models rank predictors based on their contribution to overall model performance. This approach enables a hierarchical and context-dependent assessment of influence. For instance, Zhang et al. (2023) applied a RF model to identify key environmental determinants of SWB among the elderly. Their findings revealed that access to parks was the most critical factor in promoting well-being, whereas land use mix—previously shown to have inconsistent associations with SWB in regression-based studies—was not a significant contributor. Similarly, Lin et al. (2021) compared GBDT with ordinal logistic regression and found that ML models assigned greater importance to air quality and temperature satisfaction in predicting individual happiness. Moreover, Song et al. (2023) utilized ML-based feature importance rankings to identify key predictors of SWB, demonstrating the effectiveness of ML in capturing influential factors across different population subgroups.

Challenges of current ML implementation in SWB research

While the reviewed studies demonstrate the potential of ML in advancing SWB research, several critical gaps must be addressed. A primary concern is the interpretability of ML models. While models like artificial neural networks and gradient boosting offer high predictive accuracy, they often operate as “black boxes,” making it difficult to interpret the results and understand the underlying mechanisms driving the predictions (Boelaert and Ollion, 2018). To address these concerns, eXplainable AI (XAI) techniques, such as SHapley Additive Explanations (SHAP), Local Interpretable Model-agnostic Explanations (LIME) and Partial Dependence Plots (PDP), are gaining traction (Lundberg and Lee, 2017; Ribeiro et al., 2016). These methods provide interpretable results by offering insights into how individual variables influence model predictions, thus enhancing transparency. Notably, only eight of the 25 reviewed studies presented the identified non-linear relationships, with even fewer providing detailed interpretations of these findings. Studies such as Huang et al. (2023) and Osawa et al. (2022) have demonstrated the usefulness of these techniques in improving transparency and facilitating the responsible application of ML in SWB research.

Additionally, the reviewed studies highlighted issues related to data quality and availability. Sample sizes varied widely, ranging from 96 to 121,270 participants, with 14 studies using samples of fewer than 1000. Smaller sample sizes can lead to model overfitting, where the model becomes too closely tailored to the training data, capturing noise or random fluctuations rather than the underlying patterns. As a result, the model may perform exceptionally well on the training data but poorly on new, unseen data, leading to unreliable predictions and reduced generalizability. On the other hand, increasing model complexity through large sample sizes and a higher number of variables makes ML models more computationally intensive and difficult to manage, particularly when datasets contain missing values or inconsistent variable definitions. Studies such as Song et al. (2023) emphasize that while ML can efficiently handle large datasets, excessive model complexity may reduce interpretability and limit practical applications in SWB research. In their study, the initial feature set consisted of 2018 variables, but through systematic feature selection, only 183 were ultimately identified as relevant predictors, with the top 20 reported in their findings. While ML algorithms can technically incorporate hundreds or even thousands of features, an excessive number of input variables can introduce noise, increase computational complexity, and lead to overfitting. Therefore, systematic feature selection is essential to retain only the most meaningful predictors and improve model efficiency. For instance, Wei et al. (2022) adopted a Percentile-based Variable Selection approach to filter out less influential variables based on their relative importance, reducing the feature set from 58 to 29. This process not only improved computational efficiency but also enhanced model interpretability by focusing on the most relevant predictors of SWB.

The inconsistent reporting of model performance metrics complicates the evaluation of ML applications in SWB research. Studies used a variety of metrics, including accuracy, AUC, RMSE, and R², but there was no standard approach—some studies relied on a single metric, while others reported multiple. This inconsistency hinders direct comparisons between studies in SWB research. Notably, five studies did not report any performance metrics, making it difficult to assess the reliability and generalizability of their findings. Clear and consistent reporting is crucial to ensure replicability and enable meaningful comparisons across studies.

Finally, a significant challenge for current ML studies on SWB is the lack of rigorous justification for selecting ML approaches over traditional statistical models. The review revealed that two out of the 25 studies failed to clearly explain why ML was chosen, and among studies claiming using ML approach to reveal the potential non-linear relationship, 6 studies did not provide results interpreting the non-linear relationships. An uncritical assumption that ML models are inherently superior to traditional regression methods should be approached with caution, as ML-generated associations do not always produce theoretically robust results. For instance, a regional study in London examining environmental influences on mental health identified fire-related variables as key predictors of well-being, despite a lack of clear theoretical justification for this association (Wu et al., 2020). This highlights the importance of integrating domain knowledge into ML research to ensure that findings are both interpretable and theoretically grounded. Researchers must clearly articulate their rationale for selecting ML models, demonstrating the specific advantages gained over traditional methods. Empirical results that substantiate these advantages should be provided, ensuring that the choice of analytical method aligns with the research objectives and enhances the study’s overall quality.

Future directions

The findings of this review suggest several avenues for future research. There is a clear need for the development of ML models that strike a balance between predictive accuracy and interpretability. Techniques such as explainable AI could be further explored to enhance the transparency of ML models in well-being research, allowing for more actionable insights. Additionally, longitudinal studies that leverage ML to track the evolution of well-being determinants over time could provide deeper insights into how changes in social and environmental factors influence well-being (Huang et al., 2023). Furthermore, standardizing performance metrics can improve comparability and reproducibility across studies. While some studies reported accuracy, others used metrics like AUC, RMSE, and R², leading to inconsistencies in model evaluation. Ensuring the consistent reporting of these metrics will facilitate meaningful cross-study comparisons. Finally, enhancing data quality is critical for improving the robustness and generalizability of ML findings. Utilizing high-quality, multi-source datasets can mitigate biases and reduce overfitting, ensuring that ML applications in SWB research produce reliable and actionable insights.

Strengths and limitations

This systematic review provides a comprehensive synthesis of the existing literature, focusing on the application of ML techniques in understanding the determinants of SWB. By systematically categorizing and analyzing studies that use advanced ML methods, this review highlights the growing importance of these techniques in social science research, offering a clear overview of current trends and methodologies.

However, this review also has limitations. First is the potential for publication bias. The review primarily includes peer-reviewed articles, which may overrepresent studies with positive findings or those that demonstrate the successful application of ML techniques. Additionally, the review is limited by the scope of the databases searched and the inclusion criteria used, which may have excluded relevant studies published in languages other than English or those not indexed in the selected databases. As a result, the grey literatures, such as unpublished manuscripts and government reports, as well as the relevant studies published in languages other than English or those not indexed in the selected databases may have been excluded. To address these limitations, future research should consider incorporating unpublished studies, such as preprints, reports, and government documents, and utilize a broader range of databases to ensure a more comprehensive representation of the field.

Conclusion

In conclusion, this review offers important insights into the application of ML techniques in SWB research, shedding light on both the opportunities and challenges these methods present. While ML provides substantial benefits in analyzing complex datasets and uncovering non-linear relationships, the review also emphasizes significant challenges, including overfitting, data quality, and difficulties in interpretation. By bringing these issues to the forefront, this review paves the way for future research to refine ML approaches, thereby enhancing their utility in advancing our understanding of SWB and informing the development of more effective and equitable policies.