Introduction

Societal trust is a well-established enhancer and facilitator of public health. Differential levels of societal trust correlate with differential public health outcomes for a range of interventions, with vaccine acceptance1, preventative screening2, and diagnostic testing3 all being positively associated with increased trust in healthcare providers, science, and/or the government. The role that trust—defined as the belief and conviction that an entity will do what they say they will4,5—plays in compliance and adherence to public health guidelines becomes even more pronounced in acute crises such as contagious disease outbreaks, where guidance may change rapidly as new information is learned and the scale of the outbreak evolves. In 21st-century epidemics, including SARS-CoV-1, H1N1, and Ebola, higher trust in government, media, and medical organizations was linked to increased adherence to public health recommendations6,7, decreased anxiety8, and higher vaccine uptake9.

This association between increased trust and improved public health outcomes has also been observed in the COVID-19 pandemic at both the individual and population levels. Higher societal trust correlated with not only better individual uptake of preventative measures10,11,12,13 but also better compliance with country-level public health measures and better COVID-19 health outcomes. In Europe, “lockdown”-style policies designed to slow and prevent transmission reduced mobility by 14–16% more in regions with high political trust than in areas with low political trust14 and globally, countries with over 40% of individuals reporting a high level of generalized interpersonal trust achieved a dramatic reduction in cases and deaths following spikes in COVID-19 infections15. Many studies have furthermore tied higher levels of institutional and interpersonal trust to lower infection fatality rates and fewer COVID-19 and excess deaths per million16,17,18,19,20, as well as higher vaccine confidence21.

The results from these studies, while affirming the importance of trust for COVID-19 health outcomes, also raise questions about the nuances of this multiplex relationship. COVID-19 outcomes may be impacted not only by varying degrees of trust, but also by different types of trust—such as interpersonal vs. institutional—that, due to their different underlying mechanisms, might act on metrics like mortality rates and vaccine acceptance in contrasting ways22,23,24,25,26,27. This is particularly the case for vaccination rate: studies on the relationship between trust and vaccination hesitancy and uptake have previously and conflictingly suggested that those hesitant to receive the COVID-19 vaccine have significantly lower levels of institutional trust than interpersonal trust25; that increased interpersonal trust has a negligible negative association with the probability of an individual receiving a COVID-19 vaccine28; and that increased interpersonal trust has a significant positive association with COVID-19 vaccination29. Better disentangling and understanding the tensions between institutional and interpersonal trust is crucial to optimizing public health responses to future infectious disease outbreaks and other public health emergencies. Key unanswered questions include: what types of trust, both institutional and interpersonal, are most strongly associated with COVID-19 outcomes at a country level? Do we see meaningful differences in COVID-19 outcomes across countries with varying levels of interpersonal and institutional trust?

This study aims to answer these questions by analyzing the relationship between measures of various types of trust and COVID-19 outcomes of interest. Specifically, we examine the relationships between specific types of trust—both institutional and interpersonal—and three quantitative COVID-19 health outcomes. We use trust items from Wave 7 (2017–2022) of the World Values Survey30 and link them with three COVID-19 outcomes: confirmed COVID-19 deaths per million (2020–2022), vaccination rate of at least one dose per 100 people (2020–2023), and estimated excess deaths per million (2020–2021), sourced from publicly available data from JHU CSSE (Johns Hopkins University Center for Systems Science and Engineering)31 and the World Health Organization (WHO)32. The World Values Survey (WVS) is a worldwide research program that periodically conducts cross-national, cross-sectional surveys measuring sociocultural values. All of the WVS survey data is publicly available on the WVS website30. The WVS data is increasingly leveraged as a resource in public health research to illuminate the intersections between social values and health outcomes17,33,34,35,36,37. Using questions from the WVS survey section “Social Capital, Trust and Organizational Membership,” we form aggregated metrics that capture the countrywide level of trust in various entities, including trust in domestic government, in the WHO, in people one knows personally, and in people one meets for the first time. Additionally, we incorporate five publicly available country-level markers of development as potential confounders—gross domestic product (GDP) per capita, average life expectancy from the World Bank38, average years of formal schooling for individuals aged 15 to 64 from the Barro-Lee dataset (processed by Our World In Data)39, the percentage of a country’s population that lives in urban areas from Our World In Data40, and the Freedom Score democracy index from Freedom House41—for their potential to impact both our trust covariates and our COVID-19 outcomes.

We assess the ecological relationship between country-level trust measures and COVID-19 outcomes through regression and clustering techniques using data from 61 countries worldwide after adjusting for covariates. To evaluate which of the 23 individual trust items are most strongly associated with COVID-19 outcomes, we regress our country-level COVID-19 outcomes on each trust item one at a time, adjusting for country-level features. Then, to compare national-level COVID outcomes across these trust items jointly, we consider two-step clustering and regression methods to first assign each country to a cluster (regardless of outcome), then regress COVID-19 outcomes on countries’ trust-based cluster membership. Lastly, we use a one-step method to simultaneously cluster and regress our COVID-19 outcomes on trust, where the clusters are themselves informed by the outcome of interest. Figure 1 visualizes this process and our various methodological approaches. From this analysis, we draw conclusions about the relationship between the enumerated types of trust and COVID-19 outcomes and suggest opportunities for incorporating measures of trust into public health modeling.

The rest of the paper is organized as follows. In the results section, we first describe the general structure of the individual trust items we extract from the WVS and present descriptive analyses of these items and our COVID-19 outcomes of interest. We then report our findings from our ecological regression (ordinary regression conducted at the country-level as opposed to individual survey respondent level), adjusting for country-level confounders, and identify how many individual trust items are significantly associated with COVID-19 outcomes. We then report on the two different approaches for our clustering and regression analysis (two-step and joint), demonstrating that clustering countries in terms of vectors of trust is a useful grouping for understanding COVID-19 outcomes. These two approaches allow us to both identify specific kinds of trust that are potentially informative for COVID-19 outcomes and how overall country trust profiles inform COVID-19 outcomes. We supplement our data-driven analysis with a sub-scale analysis where the trust items are thematically grouped, as opposed to data-driven clusters (see Supplementary Material). We conclude with a discussion of our findings and implications for future research, particularly the possibility of using country-level trust metrics to improve nowcasting exercises from compartmental disease transmission models. In methods that follow the results, we further detail the data extraction and summarization from the WVS and the exact analytical methods used for our analysis.

Fig. 1
figure 1

A flow chart describing the data sources used for analysis, our study objective, our various study methodologies, and our analytical outputs.

Results

Trust items

Of the 33 items in the standard questionnaire of the WVS Wave 7 that ask participants about their trust or confidence in a variety of entities, we selected 23 thematically relevant items for our research questions of interest. These 23 items capture various types of both institutional and interpersonal trust. Supplementary Table S1 contains a complete list of the 33 WVS questions and those included in our analysis. All items have ordinal response categories for the level of trust (ex. “Do not trust at all,” “Trust completely”) that we converted to a numerical value, with a higher number signifying greater trust. We then calculated the proportions of individuals responding in each category for each country and used these proportions and our numerical category values to create weighted country-level score metrics for each question; see Methods for further details on this process. All trust items are coded such that the lowest possible score means that all individuals in a country reported having the lowest level of trust in the entity questioned, and the highest possible score means that all individuals in a country reported having the highest level of trust in the entity questioned.

We conducted our primary analysis on individual trust items following a data-driven approach that would not over- or under-weight any trust item, as would happen in an analysis of sub-scales formed by combining/aggregating unequal numbers of items. However, it is a common practice to create such contextual, qualitatively driven aggregated sub-scales of trust, grouping thematically related trust items together. In order to supplement our primary, single-item analysis, we also conducted a parallel sub-scale analysis that grouped 20 of the 33 questions into seven distinct trust subdomains, based on a qualitative thematic analysis of the question topics (Supplementary Table S2.1): trust in international organizations (4 items), trust in domestic governance (6 items), trust in the WHO (1 item), trust in universities (1 item), trust in media (2 items), trust in community (3 items), and trust in strangers (3 items). Further details on how these subdomains were calculated are available in Supplementary Material Secion 2.

Descriptive statistics

Of the 66 countries participating in WVS, 61 were included in our analysis (see Methods for further details and Supplementary Figure S1 for a map of country coverage). These 61 countries include most countries in Asia and North and South America, with more limited coverage in Africa and Europe. Table 1 provides summary statistics for our covariates, country-level features, and outcomes for all 61 countries included in our analysis. The trust items with the greatest standard deviation were those asking about domestic governance (such as question 71, trust in government, and question 73, trust in parliament); these questions also had the means closest to the center of the score’s range. Those with the smallest variation were those addressing interpersonal trust with “in” groups like family, neighbors, and friends; these trust items also had the highest averages. Our country-level features and outcomes of interest all demonstrate considerable spread across countries.

Table 1 Summary statistics (mean and standard deviation) for all trust item scores (from WVS Wave 7), country-level features (for the year 2019, sourced from the World Bank and Our World in Data), and COVID-19 outcomes (sourced from the WHO and JHU CSSE) at a country-level, for all 61 countries included in our dataset.

Our correlation analysis (see Supplementary Figure S2) of all trust items, country-level features, and outcomes suggested strong multicollinearity between trust items, particularly those concerned with institutional trust. In general, the direction of this correlation was positive: trust appears to beget trust. Within our outcomes, we observe a strong positive correlation between confirmed COVID-19 deaths per million and estimated excess deaths per million (Pearson correlation coefficient \({\uprho\:}=0.69\), p-value < 0.05), though there is low correlation between vaccination and death rates. Within our country-level features, we observe a high positive correlation between all variables (all with \({\uprho\:}>0.50\), p-value < 0.05). Despite this high positive correlation, the variance inflation factor suggested low multicollinearity (below 3 for all covariates). Our correlation analysis also identifies negative correlations between trust items and national death rates, and positive correlations between most trust items and vaccination rate.

Ecological regression

To explore how different kinds of trust impact country-level COVID-19 outcomes, we developed a series of ecological regression models for our outcomes of interest. Our analysis consisted of “single item” regression models—models where we regress our outcomes on one trust item, adjusting for country-level features—for all 23 questions in our analysis (Table 2). Within single item regression, adjusting for multiple testing via a Bonferroni correction, five questions (22% of all questions) were significantly negatively associated (p-value < 0.002) with COVID-19 deaths per million; six (26% of all questions) were significantly positively associated and one (4% of all questions) was significantly negatively associated with vaccination rate per 100 people; and thirteen questions (56% of all questions) were significantly negatively associated with estimated excess deaths per million.

For our subscale analysis, we conducted both “single subdomain” regression models as above and additionally explored a joint regression of all seven subdomains and confounders at once. These analyses returned similar findings and clearly highlighted the tension between interpersonal and institutional trust’s impacts on COVID-19 outcomes, as discussed in the introduction. While increased “trust in community” (a measure of interpersonal trust) was associated with a decrease in excess deaths per million (−157.30 fewer deaths per million for one increased unit of trust, 95% CI [−283.19, −31.42]), it was also associated with a decrease in vaccination rate (−1.23 fewer vaccinations per 100 people for one increased unit of trust, 95% CI [−2.20, −0.26]) in our joint subdomain regression models (Supplementary Table S2.4). In comparison, increased “trust in universities” (a measure of institutional trust) was significantly associated with only desirable outcomes (fewer COVID-19 deaths per million, more vaccinations per 100 people). Complete results of the analysis at this aggregated sub-scale level are summarized in Supplementary Tables S2.2–4.

Table 2 Results from our series of single item regression models, where each COVID-19 outcome of interest is regressed on one trust item, adjusting for GDP, life expectancy, educational attainment, urbanicity, and Freedom Score.

Two-step clustering and regression

To explore how countries may be grouped based on their levels of trust, and how these groupings may relate to COVID-19 outcomes, we sought to identify trust-based clusters and regress COVID-19 outcomes on these clusters. We first conducted a two-step clustering and regression method, where the clustering on the 23 trust items and regression of COVID-19 outcomes were done sequentially. We used two clustering methods for this two-step analysis: k-means clustering and Gaussian mixture model (GMM) classifications. Based on various statistics used for identifying the optimal number of clusters, \(k=3\) components were selected for both methods (see Methods for further details). For our GMM analysis, we used the maximum posterior probability of cluster assignment to assign countries to clusters, as there was a high degree of certainty in cluster assignment: the maximum posterior probability of cluster assignment was above 0.95 for all 61 countries.

In both methods, there was decent visual separation between clusters and balanced distribution of cluster membership across countries (see Fig. 2). There was also decent (66%) agreement between the two clustering methods in terms of country assignments, though the GMM clusters were more imbalanced than those identified by k-means, with the “medium trust” cluster containing only seven countries. To interpret these clusters, we compared the average trust subdomain scores across the k-means and GMM clusters (Table 3). Both methods divided into groups of countries with “high trust,” “low trust,” and “medium trust” in the trust items. The interpretation of these clusters is largely the same across these two methods: the “high trust” cluster has the highest average overall trust and the highest level of institutional trust in both methods, and “low trust” the lowest; the “medium trust” cluster, for both methods, is more ambiguous, with the highest average interpersonal trust levels but middle-of-the-pack institutional trust. In terms of the country-level features, which were not used in cluster formation, across both methods, the “medium trust” clusters have the highest average GDP, years of education, and Freedom Score; the “high trust” clusters have the lowest average urbanicity and Freedom Score; and the “low trust” cluster has the lowest average GDP. Cluster interpretation is bolstered by analyzing specific country assignment; for example, China, a country recognized for reporting high levels of overall trust42 is assigned in both methods to the “high trust” cluster. See Supplementary Table S2 for a full list of country cluster assignments.

Table 3 Average trust item scores and mean country-level feature values across clusters in both methods.
Fig. 2
figure 2

Visualization of the three clusters identified by k-means (left) and GMM (right) on the first two principal components of the 23 trust items.

To assess the influence of trust cluster membership on COVID-19 outcomes, we fitted separate regressions of each outcome of interest on cluster membership indicators, adjusting for country-level features. Table 4 contains the point estimates and confidence intervals for all regression outputs, where the coefficient estimates compare the mean difference in outcome in one cluster compared to our reference cluster (the “high trust” cluster), as well as the mean outcomes and standard errors for each cluster, for both k-means and GMM. For all three outcomes across both methods, there was a statistically significant difference (all p-values < 0.01) in the mean expected outcome between a “high trust” country and a “low trust” one, with the “low trust” countries having significantly worse health outcomes (more COVID-19 and excess deaths, lower vaccination rates) than the “high trust” ones. The effect size of this difference was formidable: for GMM, the membership of a “low trust” cluster was associated with 1200.6 more COVID-19 deaths per million (95% CI [510.9, 1890.3]), 2289.1 more excess deaths per million (95% CI [971.1, 3607.2]), and a 16.6 decrease of vaccination rate per 100 people (95% CI [−27.7, −5.6]) in comparison to the “high trust” cluster. Though there was always a significant difference between the “high trust” and “low trust” countries across clustering methods, the membership of a “medium trust” cluster versus a “high trust” did not result in a statistically significant difference for any of our outcomes of interest (other than vaccination rate in the k-means clusters). Similar findings were observed for our supplemental sub-scale analysis (see Supplementary Tables S2.5 and S2.6 and Supplementary Figure S2.2).

Sensitivity analysis for clustering

In all our adjusted models, we use a core set of five confounders measured for each country at a national level. To assess the robustness of our results to the choice of confounders, we expanded our initial set of five confounders by adding three more potential confounders suggested by a reviewer that may be relevant to our analysis: hospital beds per 1000 people43, the percentage of the population over 6544, and the government effectiveness index45,46. We then assessed the robustness of our results to choice of confounders we adjust for by plotting the adjusted effect estimate (adjusted mean difference in outcomes in high and low trust group for k-means clusters) across all possible 28 choices of covariate adjustment, sifting through 256 regression models (Supplementary Figs. 4–6). We also calculated the E-value47 to assess the effect of unmeasured confounders. The results show that our reported estimates remain robust across adjustment choices and that the unmeasured confounder must have a very strong effect on both COVID-outcomes and trust to negate the significant results we report as indicated by a large E-value.

Table 4 Results from regression on cluster membership, adjusting for country-level features, with the reference group being the “high trust” cluster, and the mean COVID-19 outcomes and standard deviations per cluster across both methods.

Joint clustering and regression

Lastly, we carried out joint clustering and regression for each of our three outcomes of interest using a Bayesian profile regression model through Markov chain Monte Carlo. This method, where the outcomes are non-parametrically linked to our covariates through cluster membership via simultaneous clustering and regression, addresses uncertainty in cluster membership more directly than our previous methods48. As the outcome informs the clustering and regression in this method, we built three distinct models for our three outcomes of interest that produced three distinct cluster profiles. In order to reduce multicollinearity between the 23 trust items within the regression, only the top ten (or top five, for vaccination rate) trust items were used in each model, selected based on the single item regression p-values and effect size. The output of these models was then postprocessed to identify representative clusters that serve as a summary for all clustering explored throughout the simulations. The results of this postprocessing and clustering are in Table 5. See Methods for further details on the steps of this analysis.

In contrast to both two-step methods, our joint models identified \(K=2\) as the optimal representative clustering for the deaths per million outcomes, eliminating any kind of “medium trust” cluster, but kept a “medium trust” cluster for vaccination rate (\(K=3\) for this outcome). Most of the countries identified as “medium trust” in the two-step methods were assigned to the “high trust” cluster in the joint models, reinforcing the idea that the largest gap is between “high trust” and “low trust” countries (see Supplementary Table S2). The average silhouette width, which evaluates clusters by measuring how similar a subject is to other subjects in its cluster49 was above 0.8 for the deaths per million outcomes, suggesting strong cluster cohesion and separation. However, the average silhouette width was only 0.59 for vaccination rate, suggesting there is more ambiguity between clusters in this model.

Similar to the two-step clustering, the “high trust” cluster countries have higher average trust item scores than those in the “low trust” cluster. These differences are more pronounced for the deaths per million models, where there is very little overlap between the highest posterior density intervals of each cluster’s average trust item scores. As seen in the two-step methods, these “high trust” and “low trust” clusters had highly different mean outcomes across all three models. Similar findings were observed for our supplemental sub-scale analysis (see Supplemental Table S2.7).

Table 5 Results from the representative clusters identified in postprocessing of our bayesian profile regression models.

Discussion

Our analysis reveals several important nuances of the relationship between trust and COVID-19 and supports the use of societal trust metrics in COVID-19 modeling. Our study has several key strengths. To our knowledge, no other country-level analysis focusing on both different types of trust and multiple COVID-19 outcomes exists in extant literature. By leveraging the WVS data, we achieve a cross-national evaluation of how both institutional and interpersonal trust relate to COVID-19 outcomes at a country level. We examine the relationship between trust and estimated excess deaths, a COVID-19 outcome that has been understudied in this context, despite excess deaths often being a more reliable measurement of COVID-19-related mortality due to its accounting of underreporting and cause of death misclassification50.

While the majority of the single item regressions suggested that increased trust was associated with better health outcomes—lower deaths, higher vaccination rates—this was not uniformly observed. While increased trust, both institutional and interpersonal, was always associated with a decrease in both confirmed COVID-19 and estimated excess deaths per million, this was not the case for vaccination rate. Though all of the statistically significant institutional trust items were positively associated with vaccination rate, the one statistically significant interpersonal trust item, trust in one’s neighborhood, was negatively associated, i.e. greater trust in this setting was associated with a decrease in vaccination rate. As discussed in the introduction, existing research on the direction of interpersonal trust’s influence on vaccination rate—a conceptually distinct outcome from deaths per million51,52—has yielded conflicting results. Our analysis provides further evidence of a negative association between interpersonal community trust—particularly trust in one’s neighborhood—and vaccine uptake. It is possible that high community trust correlates with cultural resistance to institutional authorities, as people prioritize local community structures over centralized governance53,54. It may also reflect a misplaced trust that one’s neighbors and friends would not infect each other. This high community trust may reduce willingness to get vaccinated, as seen in our analysis.

Both our two-step and joint clustering analyses found that trust-based cluster membership was statistically significant for all outcomes of interest, particularly between the “high trust” and “low trust” clusters. When it came to cluster formation, institutional trust items played the most significant role in differentiating each cluster. In particular, the most extreme across-cluster differences in trust appeared in domestic governance (the government, political parties, and elections). Average interpersonal trust levels were less important and less variable across clusters. Notably, however, they did not align with the trend we observed in the institutional trust items. In both two-step methods, the “high trust” clusters did not have the highest levels of interpersonal trust—instead, the “medium trust” clusters had the highest averages, and “high trust” and “low trust” interpersonal levels were often comparable. Here again, we see that institutional and interpersonal trust do not necessarily act in tandem. Our results suggest further research is needed to fully untangle the ways interpersonal trust influences different public health outcomes, separate from institutional trust.

Across our analyses, there was a clear difference in COVID-19 outcomes between “high trust” and “low trust” countries. The distinction between “high trust” and “medium trust” was more ambiguous, indicating there may not be a monotone positive ordering in this relationship. Only k-means clustering identified a significant difference between “high trust” and “medium trust” countries, and only for one of our three outcomes of interest: vaccination rate. This ambiguity also appears in our joint clustering and regression approach. When we included country-level features and outcome of interest in our clustering model, there was only one outcome, vaccination rate, for which including a “medium trust” cluster was chosen for inclusion. While one potential explanation for this ambiguous “medium trust” cluster is potential nonlinearity to the relationship between trust and COVID-19 outcomes, we believe the explanation for this likely lies in differences between country-level features/covariates. In the two-step approach and the joint clustering approach for vaccination rate, the “medium trust” countries differ significantly in terms of country-level features: these countries have a markedly higher per capita GDP and a higher Freedom Score than countries in either the “low trust” or “high trust” clusters (see Tables 3 and 5). These differences, particularly the greater material resources afforded by a higher GDP nation, could be effectively closing the gap between “medium trust” and “high trust” countries when it comes to COVID-19 outcomes—perhaps a relative surplus of wealth makes up for a relative dearth of trust. The mediating effect of a country’s wealth and infrastructure on the relationship between trust and health outcomes suggested by our findings offers an interesting avenue for further research. Yet regardless of this potential effect, our analysis demonstrates that material resources alone do not determine positive public health outcomes. Despite their more comparable per capita GDP and Freedom Score, the “low trust” and “high trust” clusters have significantly disparate COVID-19 outcomes in both two-step approaches. Trust does matter, and in ways we can quantitatively capture.

These clear associations between trust and COVID-19 outcomes—even when using conceptually distinct approaches—open the door to the possibility of meaningfully incorporating these trust metrics into disease transmission modeling. Many models now incorporate various time-dependent and time-invariant elements of human behavior for more accurate estimations of disease transmission55,56,57. This offers many potential pathways for country-level trust metrics’ inclusion. For example, trust metrics could be included as coefficients in the regression models, such as MERMAID58, used to estimate the rate parameters for compartmental models. These parameter estimates are increasingly based not only on specific properties of the pathogen (like duration of infectiousness) but also on external factors, such as government policy or emotional contagion59. Our study suggests that trust, including institutional trust, has the potential to be meaningfully included as external, constant effect metrics in such models.

Our study has several limitations. As the WVS is an ecological study, we must beware the ecological fallacy of treating these across-country results as indicative of within-country trends. WVS Wave 7 was interrupted by the COVID-19 pandemic, leading to about a dozen countries’ surveys being conducted after the pandemic had begun—an event which could have influenced the responses of individuals in those countries. The WVS also had limited coverage in Africa and Europe. Many European countries participate in the European Values Study60, a survey done in conjunction with the WVS but without several of the key questions, such as trust in the WHO and universities, used in our analysis. This more limited global coverage restricts the generalizability of our findings and their applicability to certain regions. Lastly, our outcome of estimated excess deaths per million, taken from the WHO, was estimated with large uncertainty intervals50. Our study does not consider the impact of this uncertainty on our findings.

Institutional and interpersonal trust have played crucial roles in the unfolding of the COVID-19 pandemic. By considering the impact of trust in disease transmission models, prioritizing a better understanding of interpersonal trust’s influence on public health outcomes, and attaining high levels of trust in science and public health, we can further improve our responses to disease outbreaks and save lives.

Methods

Data sources

We used data from the WVS Wave 7 as our source for all trust-related covariates. The WVS Wave 7 is the seventh iteration of the cross-national, cross-sectional social survey periodically conducted by the World Values Survey, a non-commercial research program. For Wave 7, over 90,000 adult respondents across 66 countries and regions were interviewed (primarily via face-to-face interviews, with some phone interviews for remote regions) with a standardized questionnaire covering social, ethical, religious, and economic values. Respondents were selected via random probability representative samples within each country surveyed, with a minimum sample size of 1,000 for all countries. Data collection began in mid-2017 and, due to COVID-19-related delays, was completed in July 202330. Further details of how we analyzed the WVS dataset are below.

Our primary COVID-19 outcomes of interest were confirmed COVID-19 deaths per million from January 1, 2020 to December 31, 2022, sourced from JHU CSSE COVID-19 Data31, the vaccination rate of at least one dose per 100 people from 2020 to 2023, and excess deaths per million in 2020 and 2021, the latter both sourced from the WHO COVID-19 Surveillance Database32,50,61.

Our confounding variables, GDP per capita (in current $USD) and total life expectancy (in years), were sourced from the World Bank38; educational attainment (average years of schooling for individuals aged 15–64) was sourced from the Barro-Lee dataset, processed by Our World In Data39; urbanicity (percentage of the country’s population living in urban areas) was sourced from Our World In Data40; Freedom Score, a measure of democracy, was sourced from Freedom House41. The 2019 metrics were used to capture the status of each country when the COVID-19 pandemic first began.

This covariate, outcome, and country-level feature data resulted in a complete dataset for 61 countries of the 66 surveyed by WVS. For Taiwan, different data sources were necessary for outcome data: its excess deaths estimate was sourced from the Economist62. Missing GDP per capita and life expectancy data from the World Bank were alternatively sourced from the International Monetary Fund63 and the CIA World Factbook64; educational attainment was sourced from a report from Taiwan’s Ministry of Education65; and urbanicity was sourced from Worldometer66. Four surveyed regions (Hong Kong, Macau, Puerto Rico, and Northern Ireland) are accounted for in their country-level outcome data (China, the United States, and the United Kingdom, respectively). The final excluded country was Egypt, due to the high levels of nonresponse for WVS survey questions.

Defining and extracting trust covariates from WVS wave 7

We sought to define trust based on the questions in the WVS Wave 7 survey that included the key words “trust” or “confidence” and were part of the standard questionnaire given to all participating countries. In social science theory, confidence and trust are widely recognized as adjacent, though not equivalent, concepts. Confidence is the belief that an entity will do what they are expected to do and trust is a thornier belief made up not only of confidence in an entity but also faith in an entity’s integrity and competence4,5. In WVS Wave 7, “trust” is used exclusively when asking about interpersonal relations and “confidence” is used exclusively when asking about institutions. We will use the term “trust” to refer to all of the WVS questions, with “interpersonal trust” corresponding to the questions actually labeled “trust” in WVS and “institutional trust” corresponding to the questions actually labeled “confidence.” This decision is motivated by the similarity between the definitions of these two concepts and existing research based on the WVS that also uses the term “trust” to refer to both items labeled “trust” and “confidence”17.

Thirty-three questions met these criteria, coded \(\text{t}=57\dots\:89\), all from the “Social Capital, Trust and Organizational Membership” subsection of the survey. Thirty-two of the 33 questions followed a parallel response structure of four ordinal and four nominal responses, while one question (Q57, “Most people can be trusted”) had two ordinal and four nominal response categories. These three data structures, alongside an example of what the data looks like for one country’s individual responses to one question, are shown in Fig. 3.

Fig. 3
figure 3

The ordinal and nominal response options for our survey questions of interest, alongside an example of this data for the United States and the question “How much confidence do you have in the government?”

Not all of the 33 eligible questions from the WVS were deemed relevant to our research question of interest, primarily because the mechanism through which trust in this entity would act upon our COVID-19 outcomes appeared conceptually ambiguous, such as with trust in the women’s movement (question 80). These thematically irrelevant questions run the risk of introducing non-informative noise to our model, as they do not pertain to our research aim. In order to clearly operationalize our research goal and remove these spurious questions, we conducted a qualitative analysis of the 33 available items. This process identified key thematic groups through which trust could act upon COVID-19 outcomes: trust in “in” interpersonal groups, such as one’s family; trust in “out” interpersonal groups, such as those of another nationality; trust in media, such as the press; trust in bodies of science, such as universities; trust in domestic governance, such as parliament; and trust in international governance, such as the United Nations. These thematic groups were used to identify questions to include in our analysis; some other individual questions were also selected based on prior literature suggesting a correlation between trust in these entities and trust in public health and medicine. For example, environmental protection is frequently classified as part of public health67 leading to the inclusion of question 79 (“Confidence: The Environmental Protection Movement”). In the end, 23 of the 33 questions were selected for our analysis. See Supplementary Table S1 for a full list of the available WVS questions and those used in our analysis.

Each country (\(\text{j}=1\dots\:61\)) in Wave 7 has some \({\text{n}}_{\text{j},\text{t}}\) individual responses to each question \(\text{t}\). As our outcomes of interest are at a population level, we sought to aggregate our trust item data from the individual level to the population level so that each country had one data point for each item. For our aggregation, we excluded the nominal non-responses and focused on the ordinal categorical data.

The total n given by the WVS dataset includes counts of non-responses. For our sample proportions, we will use an adjusted n:

$${n}_{j,t}^{\text{*}}=n-\left({n}_{-1,j,t}+{n}_{-2,j,t}+{n}_{-4,j,t}+{n}_{-5,j,t}\right)$$

where \({n}_{i,j,t}\) with \(\text{i}=\left[-5,-4,-2,-\text{1,1},\text{2,3},4\right]\) corresponds to the possible non-response or response categories \(\text{i}\) for country \(\text{j}\) at question \(\text{t}\). This adjusted \({n}_{j,t}^{\text{*}}\) thus accounts only for response values.

Each country’s count data \(\left({n}_{1,j,t},{n}_{2,j,t},{n}_{3,j,t},{n}_{4,j,t}\right)\) follow a multinomial distribution with parameters \({n}_{j,t}^{\text{*}}\) and \(\left({{\uppi\:}}_{1},{{\uppi\:}}_{2},{{\uppi\:}}_{3},{{\uppi\:}}_{4}\right)\) where \({n}_{j,t}^{\text{*}}={{\Sigma\:}}_{k}{n}_{k,j,t}\:\)is the fixed number of responses (we assume each individual response is independent and identically distributed) and there are \(\text{k}=4\:\)categories of response, such that \({{\uppi\:}}_{k}\) is the probability of observing response \(\text{k}\) and \({{\Sigma\:}}_{k}{{\uppi\:}}_{k}=1\). The multinomial log-likelihood is:

$$\text{L}\left({{\uppi\:}}_{j,t}\right)={{\Sigma\:}}_{k}{n}_{k,j,t}\text{l}\text{o}\text{g}{\left({\uppi\:}\right)}_{k,j,t}$$

Our maximum likelihood estimates are the sample proportions, calculated as \({p}_{j,t}^{k}=\frac{{n}_{k,j,t}}{{n}_{j,t}^{\text{*}}}\text{\:and}\)

\({n}_{j,t}^{\text{*}}={{\Sigma\:}}_{k}{n}_{k,j,t}\). Thus

$${\sum\:}_{\text{k}=1}^{4}{\text{p}}_{\text{j},\text{t}}^{\text{k}}={\sum\:}_{\text{k}=1}^{4}\frac{{\text{n}}_{\text{k},\text{j},\text{t}}}{{\text{n}}_{\text{j},\text{t}}^{\text{*}}}=1.$$

Using these sample proportions, we can calculate a weighted question score as follows:

$$T{S}_{j,t}={p}_{j,t}^{1}\text{*}4+{p}_{j,t}^{2}\text{*}3+{p}_{j,t}^{3}\text{*}2+{p}_{j,t}^{4}\text{*}1$$

where the weights are the inverse of the recorded structure such that the higher the score, the higher the level of trust. This procedure is used for all \(\text{t}\) questions with a four-category ordinal structure, regardless of the wording of the responses, such that the “trust” and “confidence” questions are comparable. Question 57 was similarly aggregated according to its two-category structure, with \({p}_{1}\) as 2 (“Most people can be trusted”) and \({p}_{2}\) as 1 (“Need to be very careful”):\(\text{T}{S}_{j,57}={p}_{j,57}^{1}\text{*}2+{p}_{j,57}^{2}\text{*}1\)

This resulted then a \(61\times\:23\) matrix of countries and question scores used for analysis.

There is substantial within-question variation of country trust scores, particularly for questions about both domestic and international governmental organizations (see Supplementary Figure S3). The most stable questions are those about charities and interpersonal trust, particularly trust in members of one’s community or family. Our exploratory analysis also revealed a series of complexities around missing data in the survey. Country-level non-response was rather low: only a small number of questions were entirely omitted from surveys in a small number of countries. However, individual item non-response, where an individual was asked a question but chose not to respond, varied considerably. While mean individual non-response was below 10% across countries for over three-quarters of the 23 questions, this was not always the case. This was seen particularly in Egypt, where the mean non-response was over 30% and went as high as 74.7%. This led to Egypt’s responses being excluded from further analysis. No other countries were excluded for missingness, and the few remaining question scores with non-response rates above 40% were replaced via single imputation with the median score from countries with missingness below 40%.

Statistical analysis: single item regression

To understand how individual trust items were associated with COVID-19 outcomes, we conducted ordinary least-squares regression. The associations between trust and our COVID-19 outcomes of interest were assessed by a series of single item regression models, adjusting for our five country-level features. For each trust item and each outcome of interest, the following model was used:

$$\text{E}\left({Y}_{r}\right)={{\upbeta\:}}_{0,\text{r}}+{{\upbeta\:}}_{1,r}\text{X}+{{\upbeta\:}}_{2,r}\text{G}\text{D}\text{P}+{{\upbeta\:}}_{3,r}Life\:expectancy+{{\upbeta\:}}_{4,r}Education$$
$$+{\beta\:}_{5,r}Urbanicity+{\beta\:}_{6,r}Freedom\:Score$$

where the subscript r denotes that the model is built with respect to the r-th COVID-19 outcome of interest (confirmed deaths per million, COVID-19 vaccination rate per 100 people, estimated excess deaths per million), and X denotes the question of interest (any of the 23 items included in our analysis).

None of the outcomes were transformed for regression. Measures of GDP, life expectancy, education, urbanicity, and democracy were included to control for potential confounding from healthcare infrastructure, population health, and overall country development. While a multiple linear regression model was considered, the high levels of collinearity between the trust items, coupled with the relatively low sample size (\(N=61\)) made even sparse regression techniques like elastic net regression unstable and ultimately uninformative.

Statistical analysis: deterministic and probabilistic trust clustering and regression

We sought to classify countries based on their level of trust and analyze whether cluster membership impacted COVID-19 outcomes (i.e. whether being “high trust” was associated with better COVID-19 outcomes than being a “low trust” country). We used a two-step process that first identified clusters based on trust item scores and then regressed our COVID-19 outcomes of interest on cluster membership, adjusting for our country-level features. The first clustering approach used was k-means clustering, which minimizes within-cluster variances for a pre-specified number of \(\text{k}\) clusters. A variety of visual and analytical tools—including assessing the AIC (Akaike Information criterion) and BIC (Bayesian information criterion), gap statistics, and elbow method results—were used to select \(\text{k}=3\) clusters based on this algorithm.

The second two-stage approach was clustering using finite Gaussian mixture modeling. Here, we assume that the datapoints for each country are generated by a distribution made up of \(k\) groups and \(k\) corresponding components. This distribution is of the form

$$\text{f}\left({\text{x}}_{\text{i}};{\Psi\:}\right)={{\Sigma\:}}_{k=1}^{K}{{\uppi\:}}_{k}{f}_{k}\left({x}_{i};{{\upbeta\:}}_{k}\right).$$

where \({\uppsi\:}=\left({{\uppi\:}}_{1}\dots\:{{\uppi\:}}_{K-1},{{\upbeta\:}}_{1}\dots\:{{\upbeta\:}}_{k}\right)\) is the vector of all model parameters, \({f}_{k}\left({x}_{i};{{\upbeta\:}}_{k}\right)\) is the \(\text{k}\)-th component’s density function for \({x}_{i}\) with parameters \({{\upbeta\:}}_{k},\:\)and \({{\uppi\:}}_{k}\) is the mixing probability of belonging to component \(\text{k}\) such that \({{\Sigma\:}}_{k=1}^{K}{{\uppi\:}}_{k}=1\). The data’s log-likelihood is then

$$logL\left({\Psi\:}\right)={{\Sigma\:}}_{k=1}^{K}{{\Sigma\:}}_{j=1}^{n}{z}_{kj}(log{{\uppi\:}}_{k}+log{f}_{k}({x}_{i};{{\upbeta\:}}_{k}\left)\right).$$

where \({z}_{kj}\) is the indicator for whether \({y}_{j}\) belongs to component \(\text{k}\). As we assume each component’s density is Gaussian, we can write \({f}_{k}\left({x}_{i};{{\upbeta\:}}_{k}\right)\sim\text{N}\left({{\upmu\:}}_{k},{{\Sigma\:}}_{k}\right)\). The expectation-maximization algorithm can then be used to obtain estimates for \({{\upmu\:}}_{k},{{\Sigma\:}}_{k},\) and \({{\uppi\:}}_{k}\). This is the approach taken in the R package ‘mclust’, which was used for this analysis68. Through an analysis of BIC, ICL (integrated completed likelihood), and log-likelihood values, \(k=3\) components were selected.

For both methods, ordinary least-squares linear regression was used to assess the statistical significance of cluster membership in relation to our three COVID-19 outcomes of interest and trust subdomain scores. Our outcomes were individually regressed on a categorical variable for cluster membership, adjusting for country-level features, as shown below:

$$\begin{aligned} \text{E}\left({Y}_{r}\right) & ={{\upbeta\:}}_{0,\text{r}}+{{\upbeta\:}}_{1,r}I({C}_{i}=Low)+{{\upbeta\:}}_{2,r}I\left({C}_{i}=Medium\right) \\ & \quad +{{\upbeta\:}}_{3,r}GDP+{{\upbeta\:}}_{4,r}Life\:expectancy+{{\upbeta\:}}_{5,r}Education+{\beta\:}_{6,r}Urbanicity+{\beta\:}_{7,r}Freedom\:Score \end{aligned}$$

with the subscript \(r\) denoting that the model is built with respect to the \(r\)-th COVID-19 outcome of interest (confirmed deaths per million, excess deaths per million, COVID-19 vaccination rate per 100 people), \({C}_{i}\) referring to cluster membership of country \(i\), and \({{\upbeta\:}}_{0,\text{r}}\) representing the “high trust” cluster coefficient.

Statistical analysis: Bayesian profile regression

After our two-step clustering, we sought to employ a method that could perform simultaneous clustering and regression and thus better incorporate uncertainty in cluster membership into the model. To do this, we used Bayesian profile regression as implemented in the R package ‘PReMiuM’69. Based on Dirichlet process mixture modeling (DPMM), this framework uses Markov Chain Monte Carlo methods to non-parametrically link the outcome and covariates through cluster membership while adjusting for country-level features48,70. Though this DPMM-based method is not a direct joint analog to our two-step methods (which use finite mixture models for GMM), we chose to leverage this gold standard method to be able to preserve the continuous nature of our covariates. As the outcome variable is considered in the clustering produced by this method, we ran three distinct joint profile regression models for each of our COVID-19 outcomes of interest, so that, in contrast to our two-step findings, we had different clustering assignments for each outcome.

For each COVID-19 outcome of interest \({y}_{i}\), our model is \({y}_{i}={\theta\:}_{{Z}_{i}}+{\beta\:}^{T}{W}_{i}+\:{\epsilon}_{i}\), where \({\theta\:}_{{Z}_{i}}\) is the random effect coefficient of cluster membership, \({\beta\:}^{T}{W}_{i}\) is the term encapsulating the fixed effects (our five country-level features) whose coefficients \(\beta\:\) do not change with cluster membership, and \({\epsilon}_{i}\) is the normally distributed error term. As both our covariates (the trust item scores) and outcomes are continuous, we assumed a mixture of Gaussian distributions for covariates and a Gaussian distribution for our response \({y}_{i}\), whose mean \({\mu\:}_{i}\) is represented by \({\mu\:}_{i}={\theta\:}_{{Z}_{i}}+{\beta\:}^{T}{W}_{i}\). A blocked Gibbs sampler was then used to generate estimates from \(h\) runs. From these runs, partition around medoids on the dissimilarity matrix was used to identify the ‘best’ partitions for a list of possible number of clusters, which was then maximized via average silhouette width to identify an overall ‘best’ and representative partition. For further details, consult the ‘PReMiuM’ manual69.

To run the models for each outcome, we scaled and centered all covariates and country-level features. Because the number of parameters (23 trust items and five country-level features) was large relative to our sample size, we chose a subset of the trust items to use in our analysis to mitigate multicollinearity. We selected these subsets by ranking the single item regression outputs by p-value and magnitude of association to identify the most important trust items for each outcome. For the two deaths per million outcomes, the top ten questions were used; for our vaccination rate, only the top five were used to guarantee a stable model. The distribution of vaccination rate across countries—already more ambiguous than death rates when clustered by trust, as seen in our two-step analysis—likely contributed to the difficulty in finding a stable model for this outcome-influenced regression. Once the questions were selected, we used a burn-in period of \(h=5000\) runs with 10,000 recorded sweeps, default priors provided by ‘PReMiuM’, and an initial number of clusters as two. This clustering was not fixed and changed throughout the model runs; the final output for the two death rates identified the optimal number of clusters as two, but the final output for vaccination rate identified the optimal number as three. Postprocessing returned representative clusters summarizing all clustering explored throughout the algorithm. Using these representative clusters, we calculated highest posterior density intervals for all trust subdomain scores and the empirical mean outcomes for each cluster. Finally, we back-transformed these results from their scaling and centering for ease of comparison with our two-step findings.