Introduction

Historically, diseases have often been named after the animal that first presented with the pathogen (e.g., monkeypox, swine flu, bird flu); the location of first diagnosis or report (e.g., Spanish flu, Marburg virus, Ebola Zaire virus); or the discovering scientist or affected patient’s name (e.g., Hodgkin’s lymphoma, Alzheimer’s disease, Chagas disease)1,2,3. However, since the initial release of specific guidelines in 2015, the World Health Organization (WHO) has recommended avoiding such practices and instead favoring symptomatology or the causative pathogen when naming new diseases4. This recommendation is responsive to rising xenophobia and avoidance related to location-centric names (e.g., anti-Mexican sentiment following the "Mexican swine flu" moniker for the 2009 H1N1 pandemic5), culling of animal populations wrongly believed to be reservoirs for a disease (e.g., pig culling in Egypt due to the 2009 H1N1 “swine flu” pandemic6), and occasionally, the prejudicial past of a discovering scientist (e.g., Rett syndrome, named after an Austrian neurologist who was a member of the Nazi party in his youth7). In some cases, a disease name may not even accurately reflect its origins. This scenario is best illustrated by the Spanish flu, which likely originated in North America, but was first reported in Spain due to limited communication during World War I8. Further, the species of discovery may not be the reservoir species, adding to confusion. The case of mpox fits into this scenario: the virus was discovered in 1958 in a captive cynomolgus monkey in Denmark, yet rodents are far more likely than monkeys to be the true reservoir species9,10. As such, during the COVID-19 pandemic, particular care was taken in naming the disease caused by the novel coronavirus (SARS-CoV-2), precisely by using the new WHO naming conventions. Despite this care, the magnitude of the COVID-19 pandemic highlighted pervasive xenophobia and racism tied to the origins of its causative pathogen and the importance of appropriate risk communication11,12. While authors of scientific publications generally used the formal disease name ‘COVID-19’, prominent figures in politics and the media have not similarly followed suit, often emphasizing the origins of COVID-19 by using inflammatory language to refer to the virus. One such example is US President Trump’s reference to SARS-CoV-2 as the “Chinese virus” in 202013. Given the use of such stigmatizing language in tweets and press briefings, both online anti-Asian sentiment and physical violence against Asians and Asian Americans rose dramatically13,14,15,16,17,18,19.

While the COVID-19 pandemic raged on, an epidemic of mpox (formerly known as ‘monkeypox’) simultaneously spread to over 50 countries in 2022–202320. Mpox is native to west and central Africa, with only a few sporadic outbreaks recorded in other countries up until 2022—when an unprecedented epidemic, comprised of multiple country-level outbreaks, unfolded21,22,23. Sexual transmission of mpox was observed for the first time during the 2022–2023 epidemic24. Given the high visibility of this mode of transmission throughout this epidemic, comparisons with the early HIV/AIDS epidemic were drawn, leading to increased stigmatization of gay men in particular25,26,27. Additionally, given the epidemic’s unprecedented global reach, the presentation and naming of the disease were more visible than ever before. The name ‘monkeypox’ not only constituted a misnomer, but also perpetuated an offensive stereotype of African populations, among whom the bulk of mpox infections had previously occurred28,29,30. Because of this egregious comparison and the use of African patients to portray mpox lesions early in the epidemic, Africans and other Black populations unjustly faced the initial burden of discrimination and stigma associated with mpox31. However, the heightened transmission among gay, bisexual, and other men who have sex with men (GBMSM) throughout the 2022–2023 epidemic subsequently made GBMSM and other LGBTQ + populations the main target of discrimination. In order to combat pre-existing and newly emerging stigma, in August 2022 the WHO advocated for renaming the disease. Following this call, the name ‘monkeypox’ was formally changed to ‘mpox’ on November 28, 2022, allowing for one year of simultaneous use of the two names before the former was fully phased out32,33.

The transition from ‘monkeypox’ to ‘mpox’ is the first known example of a formal name change for an infectious disease in the Digital Age, designed to explicitly address prejudice and misinformation. It echoes former experiences with AIDS in the early 1980s, when the disease was referred to as "Gay-Related Immune Deficiency" or GRID34,35. Similar to mpox, which was named after its first identification in a monkey9, ‘GRID’ was named after its first presentation in gay men in Los Angeles in 198036, resulting in tremendous stigmatization of the LGBTQ + community. Ultimately, in August 1982, the Centers for Disease Control introduced a non-stigmatizing name for this novel disease: Acquired Immune Deficiency Syndrome (AIDS). Nevertheless, the damage associated with the initial name had long-term repercussions—and this stigma persists today37. As such, the 2015 WHO recommendation endeavored to avoid the use of stigmatizing language for newly-named diseases4. Yet no formal and sufficiently broad guidance has been provided to date for retroactively changing existing disease names en masse, despite countless examples with prejudicial connotations that would require such intervention. In the absence of formal retrospective naming guidelines, the 2022–2023 mpox epidemic not only provides an interesting case study but also creates a precedent for the WHO and its national counterparts to build upon.

Unfortunately, despite the formal guidance released in 2022, the global adoption of the newly recommended term ‘mpox’ has been inconsistent. Indeed, in a more recent outbreak of mpox, which was declared a Public Health Emergency of International Concern (PHEIC) in August 2024, news sources across many countries still used the now-outdated term ‘monkeypox’38,39,40,41,42,43,44,45. We view the use of ‘monkeypox’ in lieu of ‘mpox’ as perpetuating racist and bigoted ideas by continuing to use this outdated and offensive terminology, as mpox historically most impacted Black individuals but now disproportionately affects LGBTQ + individuals. Thus, while our objective was to understand how stigmatizing attitudes and language relate to social factors such as acceptance of diverse identities, the use of stigmatizing language can also be interpreted as a reflection of existing societal stigma. In this study, our goal was to understand which factors influenced whether a country or territory was a timely adopter of the name change from ‘monkeypox’ to ‘mpox’. Specifically, we leveraged Google Trends (GT) to examine national-level Internet search query intensity of the terms ‘mpox’ and ‘monkeypox’ between November 2022 and December 2023, the time period over which the WHO recommended the dual use of both ‘mpox’ and ‘monkeypox’. Previously, GT has been used to study infectious diseases such as seasonal influenza, Zika, SARS-CoV-2, and measles, among others—from predicting and monitoring their spread, to understanding spatiotemporal variation in health-seeking behaviors, to contextualizing symptom searches and developing tailored risk communication plans, and more46,47,48,49,50,51,52,53,54,55,56.

Our main analytic goal was to understand the drivers of the adoption of the term ‘mpox’ at the national level. To that end, we defined two key outcomes: (1) a binary variable reflecting whether the average Google search intensity was higher for ‘mpox’ than for ‘monkeypox’ during our study period; and (2) a continuous variable defined as the fraction of the average Google search intensity for ‘mpox’ and ‘monkeypox’ attributable to ‘mpox’ during our study period (i.e., \(\frac{{s_{mpox} }}{{s_{mpox} + s_{monkeypox} }}\), where \(s_{mpox}\) and \(s_{monkeypox}\) represent the search intensity for ‘mpox’ and ‘monkeypox’, respectively). Through outcome (1), our objective was to identify the characteristics of countries and territories where a majority of Google search users had adopted the term ‘mpox’. Yet we recognize that the broad adoption of a new disease name by the general public is a time-intensive process. Following a change in disease name, adoption typically happens incrementally rather than immediately. Through outcome (2), we thus sought to better understand factors associated with incremental improvements in the uptake of ‘mpox’, even when a country or territory was not considered a ‘majority adopter’. We constructed regression models to associate these two outcomes with static national-level factors such as type of political regime, robustness of sociopolitical and health systems, level of pandemic preparedness, extent of gender and educational inequalities, temporal evolution of confirmed local mpox cases during the 2022–2023 epidemic, and distribution of mpox cases since its discovery. To our knowledge, no analyses have investigated these national-level factors as they relate to ‘mpox’ uptake, using GT or otherwise. Through this study, we hope to provide insights not only into national-level factors that were most conducive to the public’s adoption of the destigmatized name ‘mpox’ in 2022–2023—as reflected by the intensity of Google search queries—but also into improving the adoption of future changes in disease names.

Results

Over the study period, a majority of Google search users adopted the term ‘mpox’ in 9% (n = 17) of the 188 countries and territories considered in our analysis. Hereafter, we refer to these countries and territories as ‘majority adopters’. In other words, in these locations, the average Google search intensity for ‘mpox’ was higher than that for ‘monkeypox’ between November 1, 2022 and December 1, 2023. Among majority adopters, the median average search intensity for ‘mpox’ (on a scale from 0 to 100) over the study period was 9.1 (IQR: 5.9–14.5); in contrast, the median ‘mpox’ search intensity in other locations was 0.0 (IQR: 0.0–2.3). We analyzed how the intensity of Google searches for ‘mpox’ and ‘monkeypox’ correlated with a set of 18 covariates hypothesized to affect adoption of the term ‘mpox’. In doing so, we identified three consistent national-level factors significantly associated with the adoption of the term ‘mpox’: (1) any prior history of mpox in the country or territory, (2) greater LGBTQ + acceptance, and (3) no senior leader who recently propagated infectious disease misinformation. Importantly, our results were generally robust to the imputation strategy chosen. We present key summary statistics for all measured covariates, both pre- and post-imputation, in Table 1. Additionally, we compare the relative impact of three imputation strategies—as described in the Methods section—on the outcome and explanatory variables used in our models (Table 1, columns 2–4). Further, a visual representation of the imputation process is available in Appendix Figure S5.

Table 1 Key covariates pre-imputation (column 2), after imputing only the LGBTQ + Global Acceptance Index (GAI) covariate (column 3), and after imputing all covariates affected by missing data (column 4).

Across all univariable logistic regression models, ten variables were significantly associated (p < 0.05) with our binary outcome, i.e., majority ‘mpox’ adoption (Fig. 1a). Meanwhile, in univariable linear regression models, we observed 18 significant associations (p < 0.05) with our continuous outcome, i.e., the fraction of searches attributable to ‘mpox’ (hereafter referred to as the ‘mpox’ search proportion) (Fig. 2a). When including only those variables with univariable p-values below our model inclusion threshold of 0.1 in the two corresponding multivariable models, we identified three consistently significant relationships across our two outcomes: two positive associations and one negative association. First, countries and territories that had ever reported an mpox case in humans or in mammals, either as part of the 2022–2023 epidemic or in a prior outbreak, were both more likely to adopt the new disease name and more likely to register a higher search proportion for ‘mpox’. Second, locations with a higher LGBTQ + Global Acceptance Index (GAI) score had both greater ‘mpox’ adoption and higher levels of the ‘mpox’ search proportion. Third, countries and territories where senior leaders had recently propagated misinformation about infectious diseases registered both lower ‘mpox’ adoption and lower levels of the ‘mpox’ search proportion (see Figs. 1b and 2b for the results of our multivariable logistic and linear regression models, respectively). More specifically, for every one-unit increase in the LGBTQ + GAI score of a given country or territory (on a scale from 0 to 10), we estimated 1.57 times higher odds [95% Uncertainty Interval (UI): 1.25;2.00] that a majority of Google search users in that location would adopt the term ‘mpox’, while accounting for other national-level covariates (Fig. 1b). Likewise, via our multivariable linear regression model, we estimated a 2.9 percentage point [95% UI: 1.7;4.9] increase in the ‘mpox’ search proportion for every one-unit increase in the LGBTQ + GAI score (Fig. 2b). Further, countries and territories with a prior history of mpox were 10.69 times more likely [95% UI: 1.54;15.62] to be majority adopters of the term ‘mpox’. In our continuous outcome model, we similarly found a 10.3 percentage point [95% UI: 6.6;13.6] increase in the ‘mpox’ search proportion in such locations. Lastly, countries and territories where senior leaders previously propagated misinformation regarding infectious diseases were 74% [95% UI: 53%;74%] less likely to adopt the new disease name. Likewise, through our multivariable linear regression model, we estimated a 11.0 percentage point [95% UI: -19.9;-1.7] decrease in the ‘mpox’ search proportion in such locations. Notably, the variable pertaining to misinformation propagation by senior leaders was present in only four of the ten imputation models for the binary outcome, but in all ten imputation models for the continuous outcome (Appendix Tables S3 and S4, respectively). While the absolute values of the point estimates obtained after fitting a logistic versus linear regression model differed in magnitude (i.e., OR versus \(\beta\)), their relative values were comparable. Indeed, in our binary outcome model, the OR for whether a country or territory ever had a case of mpox was approximately 6.8 times greater (10.69/1.57) than that for the LGBTQ + GAI score; similarly, in our continuous outcome model, the \(\beta\) coefficient for ever having had a case of mpox was approximately 3.6 times greater (10.3/2.9) than that for the LGBTQ + GAI score. In sum, regardless of the outcome measure (i.e., binary versus continuous), the same three key variables were identified through our statistical analyses: prior history of mpox, LGBTQ + acceptance, and prior history of misinformation propagation by senior leaders. Importantly, we observed highly consistent patterns across our sensitivity analyses for the binary and continuous outcome models: all showed a significant positive relationship with LGBTQ + acceptance; 7 of 10 eligible binary models and 11 of 11 eligible continuous models showed a significant positive relationship with prior history of mpox; and 6 of 12 binary and 11 of 12 continuous models showed a significant negative relationship with prior history of misinformation propagation by senior leaders (see Appendix Sect. 5 for more details).

Fig. 1
figure 1figure 1

National-level factors associated with ‘mpox’ majority adoption (binary outcome) in univariable (a) and multivariable (b) models.

Variables that had p-values below our inclusion threshold of 0.1 in the corresponding univariable logistic regression models (a) were included in the multivariable model (b). Highly collinear variables were removed before fitting the multivariable model. Models were all built using the fully imputed data set (i.e., Approach 3). On the figure, an asterisk (*) denotes a binary variable. Note that our multivariable model included a total of 8 variables (2 binary variables and 6 continuous variables). Notably, the logit-transformed variable ‘female access to mobile phones’ was included in only 4 of the 10 models (one for each of the 10 considered imputations); thus, the 95% uncertainty interval (UI) for the corresponding odds ratio (OR) is computed over these 4 models. Similarly, the binary variable ‘senior leaders used misinformation’ was included in only 3 of the 10 models; the 95% UI for the corresponding OR was also computed over these 3 models. The goodness-of-fit of the multivariable logistic regression model (b) was assessed using a pseudo R2 metric, namely the McFadden’s R2(ranging from 0 to 1), as well as the AIC, both averaged across all ten imputations. The average R2 was estimated at 0.184 and the average AIC was estimated at 107.8. Mean pseudo R2 values and AIC values corresponding to each univariable logistic regression model (a) appear in Appendix Table S3.

Fig. 2
figure 2figure 2

National-level factors associated with ‘mpox’ search proportion (continuous outcome) in univariable (a) and multivariable (b) models.

Variables that had p-values below our inclusion threshold of 0.1 in the corresponding univariable linear regression models (a) were included in the multivariable model (b). Highly collinear variables were removed before fitting the multivariable model. Models were all built using the fully imputed data set (i.e., Approach 3). An asterisk (*) denotes a binary variable in the model. Note that our multivariable model included a total of 12 variables (3 binary variables and 9 continuous variables). The goodness-of-fit of the multivariable linear regression model (b) was assessed using the adjusted R2 metric (ranging from 0 to 1) and the AIC, both averaged across all ten imputations. The average R2 was estimated at 0.291 and the average AIC was estimated at -129.9. Mean adjusted R2 and AIC values corresponding to each univariable linear regression model (a) appear in Appendix Table S4.

Beyond these three consistent associations emerging from both our binary and continuous outcome models, we identified two distinct significant positive associations, specific to each outcome. Through our multivariable logistic regression model for the binary outcome, we identified an additional relationship between a higher ratio of female-to-male access to the Internet and ‘mpox’ majority adoption (OR = 1.17 [95% UI: 1.01;1.41]). Through our multivariable linear regression model for the continuous outcome, we identified an additional relationship between a higher 2021 Global Health Security Index (GHSI) Overall Score and ‘mpox’ search proportion (\(\beta \hspace{0.17em}\)= 0.003 [95% UI: 0.001;0.005]). Notably, the strength of these two outcome-specific associations was less pronounced than that of the three consistent associations we uncovered and described above. Thus, caution is warranted when interpreting the role of greater female access to the Internet or higher GHSI overall scores in ‘mpox’ adoption; further research is needed to better understand the unique contribution of these factors at the national level.

Discussion

Our study suggests that countries and territories with higher LGBTQ + acceptance, as measured by the GAI, were more likely to adopt the term ‘mpox’ over the term ‘monkeypox’ in the year following the WHO recommendation of the name change. In other words, empirical evidence from the 2022–2023 epidemic indicates that locations where the public is more accepting of queer communities were also more likely to demonstrate a thoughtful use of destigmatizing language when conducting Google searches. Importantly, this finding was consistent across all sensitivity analyses we conducted, emphasizing its robustness to the choice of imputation strategy and model type. In contrast to historical mpox outbreaks that predominantly affected west and central Africa and had no reported sexual transmission, the 2022–2023 mpox epidemic has had a global reach and has largely been identified as a disease predominately afflicting the GBMSM community66,67. As a result, this community has faced new backlash and stigma, now related specifically to mpox26,27,68,69. Indeed, when interviewed, several members of the GBMSM community emphasized a fear not only of mpox itself, but also of the stigma they were likely to face69. Therefore, our study underscores the need to increase awareness and acceptance of marginalized populations. As exemplified in previous outbreaks—most notably HIV/AIDS— stigma associated with sexual transmission not only impacts a person’s quality of life, but can also limit their access to care and reduce their propensity to accurately report their behaviors. These negative repercussions can in turn lead to delayed treatment, poorer health outcomes, and sustained disease propagation70,71,72,73. More recently, a sentiment analysis of tweets suggested that mpox misinformation had contributed to heightened stigma of LGBTQ + populations, resulting in increased violence, harassment, and isolation74. Thus, in order to increase awareness and acceptance of marginalized populations going forward, strong risk communications combating bigotry, prejudice, and stigma are critical early in infectious disease crises75,76. Moreover, focused anti-bullying and anti-prejudice campaigns in schools may contribute to improving the acceptance of LGBTQ + communities and other stigmatized populations and, ultimately, to a more empathetic future generation.

Another strong correlate of ‘mpox’ search intensity—in terms of significance in both the binary and continuous models as well as in our sensitivity analyses—was countries or territories that had ever had a confirmed case of mpox, in humans and other mammals alike. We hypothesize that people living in places where cases had previously been identified had increased exposure to news and social media about mpox, thus increasing their awareness of the new disease name. This assumption aligns with prior research about the impact of news media on infectious diseases (e.g., mpox, H1N1, Ebola, COVID-19), which has demonstrated that increased exposure to quality information through news and social media can increase vaccine uptake75,76, reduce disease transmission77, and encourage protective behaviors78,79,80. Future research evaluating media communications regarding the change in disease name—from ‘monkeypox’ to ‘mpox’—may provide valuable insights into how different news sources referred to the disease over time; whether any political, geographical, or temporal trends manifested; and what messaging at what time point corresponded to higher uptake of the term ‘mpox’. In fact, the recent August 2024 mpox outbreak illustrates in near real-time the mixed adoption of the new disease name by mainstream news and social media38,39,40,41,42,43,44,45,81,82. Future investigations of the use of ‘mpox’ by the media throughout the 2024 outbreak and resulting PHEIC could provide meaningful insights into the adoption of this term—allowing researchers to generate more specific guidance for future media campaigns targeting changes to infectious disease nomenclature.

Finally, we identified a third strong correlate of ‘mpox’ adoption: in locations where senior leaders had recently propagated infectious disease misinformation, adoption of the new disease name by the general public was significantly lower. Such misinformation has been increasingly identified as a barrier to vaccination83,84,85,86, seeking timely care83,87,88, and positive behavioral change83,89; has eroded trust in governments and authorities90,91,92; and has ultimately worsened health outcomes88,91,93. During the COVID-19 pandemic, misinformation was employed by politicians in the US and abroad to promote the use of untested medicine such as ivermectin, azithromycin, chloroquine, and hydroxychloroquine94,95,96; the avoidance of many public health guidelines like vaccination or mask wearing97,98,99; and dangerous practices like ingesting or injecting bleach100,101, resulting in surges in product purchases, price hikes, and in some cases, serious injury or death94,102,103. Several strategies have been employed to combat the growing threat of misinformation: emphasizing quality sources of information; capitalizing on trusted leaders, officials, and scientists to convey accurate information; correcting and calling out misinformation; and increasing the frequency of both proactive, accurate messaging and correcting inaccurate information104. Currently, consequences incurred by leaders or politicians who spread misinformation vary by country or territory and by region. First, they may face financial and legal consequences, including penalties such as fines and, in some cases, imprisonment105. Second, social media platforms themselves may take action and either restrict user privileges (e.g., retweeting, sharing, or posting) or ban such leaders outright; yet many countries still struggle to walk the line between action and censorship106,107. While elected officials who propagate misinformation should undoubtedly be sanctioned, no one course of action is likely to suit every location. For example, countries where social media organizations are headquartered generally have the financial means to strengthen online governance efforts by supporting dedicated investments in content monitoring and corrections, provided the willingness exists. In contrast, journalists and media organizations based in countries with less democratic ideals do not have similar options105. In such contexts, the Center for International Media Assistance recommends that global health organizations instead invest in media and digital literacy efforts to protect the public from the threat posed by misinformation and thereby facilitate the adoption of appropriate disease-related terminologies105. In the future, researchers should—in collaboration with dedicated counter-misinformation oversight committees, when possible—seek to understand how effective each strategy is at reducing the spread of misinformation in a given country or territory and to provide evidence that guides local decision making. Given that sensationalized and stigmatizing naming of diseases has been weaponized in the past (most notably during COVID-19), focusing on combating misinformation, regarding both infectious diseases in general and the choice of naming conventions more specifically, will hopefully improve the adoption of carefully chosen, destigmatizing names for current and future pathogens.

Limitations

There are several limitations to this study. First, our main analysis relied on English as the lingua franca of scientific communication and our sensitivity analysis (Appendix Sect. 5.3, Figures S10 and S11) considered the five additional United Nations (UN)/WHO languages (i.e., Arabic, Chinese, French, Russian, Spanish). These five languages—while widely used—account for only a small fraction of all the languages spoken across the world; as such, we are likely missing search terms for ‘monkeypox’ expressed in other languages, resulting in an incomplete capture of Google search interest in the disease globally. While we acknowledge the diversity of languages worldwide and its importance in public health, around 4 billion people speak at least one of the 6 considered UN/WHO languages as a primary or secondary language, representing about half of the world’s population108,109.

Second, consistent with the WHO nomenclature announcement, we only searched the single terms ‘mpox’ and ‘monkeypox’ rather than more specifically for the composite terms ‘mpox virus’ and ‘monkeypox virus’, which remain the official taxonomic names. However, given the boolean methodology used by GT, search interest data collected for the terms ‘mpox’ and ‘monkeypox’ constitute a superset that would also account for the terms ‘mpox virus’ and ‘monkeypox virus’, respectively110.

Third, at the time of our analysis, foreign language data for two countries—Ecuador and Madagascar—were unavailable. To address this limitation, we attempted a counterfactual sensitivity analysis for the binary outcome of majority adoption. Specifically, we generated all possible permutations (i.e., assuming that each of Ecuador and Madagascar could have been either a majority adopter country or a non-adopter country) and re-fitted the logistic regression model for each scenario. The corresponding results were largely similar results to those from our main analyses, underscoring the robustness of our key findings (Appendix Sects. 3.8, 5.10, and Figure S24).

Fourth, this study is based exclusively on Internet searches conducted by users via Google Search alone, thus excluding those attributable to other search engines. Future research examining searches for ‘mpox’ and ‘monkeypox’ on other search engines (e.g., Bing, DuckDuckGo, Yahoo) would thus provide valuable comparative insights. Importantly, over 90% of Internet users worldwide use Google as their primary search engine—with the exception of China and Russia—suggesting that GT are capturing the majority of global Internet searches111,112.

Fifth, for the variable ‘ever had a case of mpox’, there were few countries and territories that had never reported an mpox case that were also ‘majority adopters’. As such, the uncertainty intervals for this variable were wide, especially in some of the sensitivity analyses regarding the chosen imputation strategy. Despite this imprecision in point estimates for odds ratios and beta coefficients, the variable appeared as significantly associated with ‘mpox’ adoption across many of our sensitivity analyses; as such, we believe this is a robust finding that warrants discussion and further exploration.

Sixth, this analysis is correlative in nature and not causative; as such, we cannot conclude that the societal factors we identified as significantly associated with ‘mpox’ adoption effectively lead to greater ‘mpox’ adoption. Indeed, the use of stigmatizing language is also a reflection of the state of tolerance in society, making it equally possible that greater LGBTQ + acceptance, for example, is rather an indication of a country with lower levels of existing stigma. Future research connecting survey data about tolerance in society with GT data would provide additional insights into the state of existing stigma in a given country or territory and thus help in disentangling this relationship.

Seventh, while GBMSM and LGBTQ + individuals were the most stigmatized during the 2022–2023 epidemic, the original name ‘monkeypox’ was more discriminatory towards Black individuals. Although our measure of LGBTQ + acceptance does not capture racial inequity, we hypothesize that many countries with higher social acceptance also have higher levels of racial acceptance57,113, though this may not be universally true; further research is needed to test this hypothesis.

Eighth, our main analysis employs a multivariate random normal model using the popular MASS package in R, which does not use random seeding—a critical feature to ensure that results can be reproduced114,115. As such, we acknowledge slight variability in our main model results across iterations. To address this limitation, we implemented sensitivity analyses using two alternate R packages, namely rockchalk114,116 and mvtnorm117,118. Notably, the use of the MASS package resulted in fewer statistically significant covariates and wider uncertainty intervals than analyses using the two other packages, leading us to view our main results as more conservative. Analyses using rockchalk and mvtnorm appear in Appendix Sect. 5.7, Figures S15-S18; the corresponding results largely corroborate our main conclusions using the MASS package.

Ninth, we implemented univariable screening as a preprocessing step and applied a variable inclusion threshold of 0.1. In other words, variables having a sufficiently strong association with the outcome, with a corresponding p-value < 0.1, were included into our multivariable models; others were not included. Notably, this preprocessing step was repeated for each of the ten imputations. As with any statistical threshold, a trade-off exists when choosing a relevant threshold for inclusion. Just as the statistical significance threshold “p-value < 0.05” is conventionally used as a compromise between type I and type II error rates, yet remains fundamentally context-dependent119,120, thresholds for variable inclusion used in the literature have varied. For instance, 0.05, 0.1, and 0.25 have all been previously considered121,122,123,124,125,126,127. In medical sciences in particular, the same p < 0.05 threshold is often used for both multivariable model inclusion and variable significance123,124,125. In this analysis, we chose a more conservative threshold of p < 0.1 for variable inclusion in the main models to avoid omitting variables deemed relevant by domain experts while simultaneously avoiding high multicollinearity and maintaining model parsimony. In order to assess the robustness of our regression models to the chosen variable inclusion threshold, we conducted three threshold-based sensitivity analyses: (1) we considered a lower (i.e., more stringent) inclusion threshold of 0.05, a conventional value used in univariable screening to favor parsimonious multivariable outcome models and match the level of statistical significance123,124,125, (2) we considered a higher (i.e., less stringent) inclusion threshold of 0.25, a value that has been frequently used in univariable screening to allow for the inclusion of a wider number of variables and thus balance model parsimony with the risk of omitting an essential covariate121,122, and (3) we ignored inclusion thresholding altogether, i.e., we included all non-collinear variables into our multivariable models. This third approach effectively corresponds to an inclusion threshold of p < 1.0. The results of these sensitivity analyses appear in Appendix Sect. 5.9, Figure S21 for p < 0.05, Figure S22 for p < 0.25, and Figure S23 for p < 1.0. We similarly considered the impact of using automated variable selection methods—namely, L1 regularization via LASSO regression—to fit parsimonious multivariable models. These additional sensitivity analyses were highly confirmatory of the results obtained in our main analyses; corresponding results appear in Appendix Figures S19 and S20.

Lastly, we employed a suite of 18 covariates in our regression models, with varying degrees of missingness. We used multiple imputation with as many complete variables as possible, but acknowledge differences in resulting sample sizes and covariate data distributions between the complete-case analysis (Approach 1, n = 154), the GAI-only imputation analysis (Approach 2, n = 166), and the full imputation analysis (Approach 3, n = 188). The most notable difference was observed for the gross domestic product (GDP) variable. Since this covariate was available for all 188 considered countries and territories, we could contrast its overall distribution to that in the analyses restricted to 154 (Approach 1) and 166 (Approach 2) countries and territories, respectively. Specifically, we observed a lower median GDP in both the GAI-only imputation analysis ($6609.60) and the complete-case analysis ($6609.60) than overall ($6991.30). This observation suggests that health system quality, pandemic preparedness, gender equitability, and social acceptance, which are all correlated with GDP, were likely not missing completely at random. Nonetheless, all of our sensitivity analyses regarding the chosen imputation strategy yielded similar results, reinforcing the robustness of our conclusions.

Conclusions

The first part of the twenty-first century has been marked by a surge of zoonotic disease transmission, outbreaks, and pandemics, including a recent mpox PHEIC declared in August 2024. Given increasing population connectivity and mobility, deforestation and development encroaching on zoonotic reservoirs, and climate change, this trend is likely to continue. As such, the potential for stigmatizing language and xenophobia, as well as their downstream repercussions, are becoming increasingly visible. We need to plan now for ways to counter this stigma. Learning from those countries and territories that have been most successful at instituting ‘mpox’ in lieu of ‘monkeypox’ offers a promising starting point. With focused efforts to improve social acceptance—particularly in schools—as well as more sanctions and targeted messaging to counteract misinformation—particularly on social media—we hope for greater acceptance not only of new disease nomenclature but also of stigmatized populations in future outbreak scenarios.

Methods

Data sourcing and preprocessing

Google Trends

We obtained the trajectory of Google search intensity for ‘mpox ‘ and ‘monkeypox ‘ over time through Google Trends (GT), a publicly-available online platform that provides anonymized, categorized, and aggregated data. We included 188 countries and territories with available GT data on either term throughout the study period (November 1, 2022 to December 1, 2023), out of 249 distinct countries and territories captured by the platform overall. Given that English is currently the lingua franca in science, thus serving as the language of disease communication and nomenclature, we relied on Google search queries made in English in our primary analysis. However, we acknowledge the inherent power imbalance of Anglocentrism128,129,130,131,132,133. Therefore, in an effort to also understand the dynamics of public interest in ‘mpox’ versus ‘monkeypox’ in countries and territories where English is not the primary language, we considered the five other official UN/WHO languages: Arabic, Chinese, French, Russian, and Spanish. We report the results of this sensitivity analysis, aimed at characterizing the intensity of Google searches that users made in their native language, in Appendix Sect. 5.3.

GT returns the proportion of Google search activity accounted for by each search term in a given location (e.g., metropolitan area, state, country or territory) at a given point in time (e.g., day, week). This quantity, ranging between 0 and 100%, is defined as the number of term-specific searches in that location at that time divided by the total number of searches over that same region and timespan134,135. By collecting data for both search terms simultaneously (i.e., ‘mpox’ and ‘monkeypox’), we were able to compare their respective search intensity on the same spatiotemporal scale for the period ranging from November 1, 2022 to December 1, 2023. In sum, using both search terms simultaneously allowed us to capture the comparative trajectories of ‘mpox’ and ‘monkeypox’ search intensity over thirteen months of Google searches performed by the general public following the WHO announcement of the name change made in November 2022.

Outcome variables

We derived two outcome variables defined at the national level. Both variables were based on the average Google search intensity for the terms ‘mpox’ and ‘monkeypox’ in a given country or territory over the study period. The first outcome, a binary variable, captured whether a country or territory had a greater average search intensity for ‘mpox’ than for ‘monkeypox’ (1 if yes, 0 otherwise), allowing us to distinguish ‘mpox’ majority adopters from non-adopters among the 188 countries and territories we considered. The second outcome, a continuous variable, was defined as the fraction of search intensity attributed to ‘mpox’ over the study period (i.e., the ratio of the search intensity for ‘mpox ‘ over the total search intensity for ‘mpox’ and ‘monkeypox’ combined, expressed as a percentage value); hereafter, we refer to this variable as the ‘mpox’ search proportion. While we defined two distinct outcomes to test the sensitivity of our findings to the use of metrics with different granularity (i.e., binary versus continuous), our overall analytic goal was to identify the consistent factors driving the adoption of the new name ‘mpox’; thus, our results are presented and interpreted jointly.

Covariates

In this study, we sought to identify national-level factors contributing to the adoption of the term ‘mpox’ globally. To achieve this goal, we compiled a suite of 18 publicly-available national-level covariates to model their associations with ‘mpox’ search intensity (Table 1). The considered national-level covariates span seven broad categories: LGBTQ + acceptance (1 variable), political context (4 variables), healthcare access and quality of care (1 variable), global health security (8 variables), country wealth (1 variable), average educational attainment (1 variable), relative timing of 2022–2023 mpox epidemic (1 variable), and prior history of mpox outbreaks (1 variable). In Appendix Sect. 3.9, we define each variable in greater detail and provide our rationale for its inclusion in the outcome models. Except where transformations were needed to achieve normality assumptions (4 variables), all covariates were kept in their original scale for ease of interpretation. All transformations are further described in Appendix Sect. 3.1.

Given varying patterns of data missingness across the 188 countries and territories included in the analysis (Appendix Figure S2), we used a multiple imputation approach with predictive mean matching over ten imputations. We considered three different combinations of data selection and imputation methods: (Approach 1) selecting countries and territories with no missing data and performing a complete-case analysis; (Approach 2) selecting countries and territories that either had complete data or were missing only the LGBTQ + GAI variable and imputing it where necessary; and (Approach 3) imputing all variables with missing data. Approaches 1, 2, and 3 resulted in the inclusion of 154, 166, and 188 distinct locations, respectively. Results presented in the main text capitalize on the fully imputed dataset (i.e., Approach 3), while results relying on the two other imputation strategies (i.e., Approaches 1 and 2) appear in Appendix Sects. 5.1 and 5.2, respectively.

Modeling

Following data extraction and preprocessing, we built univariable and multivariable regression models separately for each of our two outcomes. The binary outcome—whether a location was a majority adopter (1) or not (0)—was modeled using logistic regression models with time-invariant covariates. The continuous outcome—‘mpox’ search proportion—was modeled using generalized linear regression models with the same time-invariant covariates. First, we evaluated the strength of the univariable association between each of the two outcomes of interest (i.e., majority adoption, defined as a binary variable, and the proportion of searches attributable to ‘mpox’, defined as a continuous variable between 0 and 1) and each of the 18 considered covariates, using univariable logistic and linear regression models, respectively. Notably, the choice of these 18 covariates was guided by expert opinion. Multiple imputation was used to handle missing data; in total, ten imputation rounds were performed. For each imputed dataset, we built a separate logistic regression model and a separate linear regression model. We used individual univariable model p-values to decide whether to retain a given variable in the corresponding multivariable model, built upon that imputed dataset. For variable selection, we used an inclusion threshold of p < 0.1; this threshold was chosen to balance model parsimony and multicollinearity, while also allowing the inclusion of a broader set of variables deemed relevant by domain experts—namely, those that exhibited weaker associations than the classical cutoff of p < 0.05 used in medical sciences for both variable inclusion and significance123,124,125. Further, prior to model fitting, we removed variables that were highly collinear (even when they satisfied the inclusion threshold of p < 0.1). (Table 1) We subsequently implemented a multivariable regression model adjusted for the remaining subset of covariates. We then accounted for uncertainty across the ten iterations of the imputation process by drawing from a multivariate random normal distribution via the MASS package in R115. A variable was included in our final analysis if it was included in at least one of the ten multivariable models; details regarding the number of imputed datasets and multivariable models for which a variable was selected (out of ten in total) appear in Appendix Tables S3 and S4. To summarize our findings across all ten imputations, we used the median to derive a point estimate for the odds ratio (OR) or β coefficient as well as the 2.5th and 97.5th percentiles to obtain the corresponding 95% uncertainty interval (UI).

Sensitivity analyses

In addition to the analyses presented in the main text of this manuscript, we conducted eight types of sensitivity analyses to assess the robustness of our results to data and modeling decisions. First, we compared our main results with those emanating from the two alternative imputation strategies described above, namely the complete case analysis (Approach 1, see Appendix Figures S6 and S7, Sect. 5.1) and the imputation of only the LGBTQ + GAI variable (Approach 2, see Appendix Figures S8 and S9, Sect. 5.2). Descriptive statistics for these imputation approaches appear in Table 1 and Figure S5. Second, we evaluated Google searches made in languages other than English (Appendix Figures S10 and S11, Sect. 5.3). Third, we evaluated the impact of restricting our dataset to only the countries and territories that ever had a confirmed case of mpox in order to understand the role of prior exposure to this disease on the adoption of the term ‘mpox’ by the public (Appendix Figures S12 and S13, Sect. 5.4). Fourth, we implemented a Tobit model, which is particularly well-suited for zero-inflated data. In doing so, our goal was to address a large proportion of zeros in our continuous outcome measuring the ‘mpox’ search proportion (Appendix Figure S14, Sect. 5.5). Fifth, we used a Bonferroni correction to account for post-hoc testing of multiple hypotheses at once (Appendix Sect. 5.6). Sixth, we assessed the robustness of our findings to the use of different packages to draw from a multivariate random normal distribution, specifically rockchalk and mvtnorm (Appendix Figures S15-S18, Sect. 5.7). Seventh, we used LASSO regression for variable selection and model fitting in lieu of univariable screening (Appendix Figures S19 and S20, Sect. 5.8). Lastly, we varied the threshold for univariable screening from p < 0.1 to p < 0.05 (Appendix Figure S21, Sect. 5.9) and from p <  0.1 to p < 0.25 (Appendix Figure S22, Sect. 5.9). Additionally, we fitted the model with no screening at all (i.e. p < 1.0) (Appendix Figure S23, Sect. 5.9).

Role of the funder

All analyses were conducted using R version 4.3.1136. This research was supported in part by the National Institute of General Medical Sciences, National Institutes of Health (R35GM146974); the National Science Foundation (SES2200228, SES2230083, & IIS2229881); and the MIT-Harvard Broad Institute Eric & Wendy Schmidt Center. The funding sources had no involvement in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the paper for publication.