Abstract
The 2022 global mpox epidemic was caused by transmission of MPXV clade IIb, lineage B.1 through sexual contact networks, with New York City (NYC) experiencing the first and largest outbreak in the United States. By performing phylogeographic analysis of MPXV genomes sampled from 757 individuals in NYC between April 2022 and April 2023, and 3,287 MPXV genomes sampled around the world, we identify over 200 introductions of MPXV into NYC with at least 84 leading to onward transmission. These infections primarily occurred among men who have sex with men, transgender women and nonbinary individuals. Through a comparative analysis with HIV in NYC, we find that both MPXV and HIV genomic cluster sizes are best fit by scale-free distributions, and that people in MPXV clusters are more likely to have previously received an HIV diagnosis and be a member of a recently growing HIV transmission cluster. We model MPXV transmission through sexual contact networks and show that highly connected individuals would be disproportionately infected at the start of an epidemic, which would likely result in the exhaustion of the most densely connected parts of the network, and, therefore, explain the rapid expansion and decline of the NYC outbreak. By coupling the genomic epidemiology of MPXV and HIV with epidemic modeling, we demonstrate that the transmission dynamics of MPXV in NYC can be understood by general principles of sexually transmitted pathogens.
Similar content being viewed by others
Main
In May 2022 a global mpox epidemic was identified, starting with the diagnosis of a case in the United Kingdom, which was swiftly followed by localized outbreaks across the world. International transmissions of mpox virus (MPXV), particularly clade IIb, lineage B.1, seeded these outbreaks, and these transmission events were distinguishable due to an elevated evolutionary rate attributable to APOBEC3-associated hypermutation1,2,3,4,5,6. By the end of 2022 the global case count of this epidemic exceeded 80,000 and the United States was the country with the largest number of cases, at 30,092 (ref. 7), with the earliest importations into the United States likely to be coming from Western Europe4. In the United States, New York City (NYC) experienced the first and largest major outbreak, with 3,821 cases diagnosed by the end of 2022 (refs. 8,9). NYC also conducted the first mpox vaccination campaigns in the country, initiated in June 2022 (ref. 10).
During the 2022 epidemic, MPXV is believed to have primarily spread through sexual contact networks among men who have sex with men (MSM)11,12, whereas previous outbreaks were the result of zoonotic, household and, in rare cases, hospital-based transmissions13. These outbreaks were typically self-contained and with limited human-to-human transmission. Epidemic modeling suggests that the rapid global emergence, expansion and decline of mpox during the 2022 epidemic could be explained by the transmission of MPXV through heavy-tailed sexual contact networks14,15,16,17, a heterogeneous network in which a subset of individuals have a disproportionately high number of contacts. Such networks account for the transmission patterns of other sexually transmitted pathogens, including human immunodeficiency virus type 1 (HIV-1)18,19,20,21, hepatitis C virus22 and Treponema pallidum pallidum (the etiological agent of syphilis)23,24,25. Phylodynamic analyses of sexually transmitted pathogens, including HIV, have shown that they recapitulate these heavy-tailed sexual contact networks19,20,21,22; however, these hypothesized dynamics have not yet been confirmed for MPXV by empirical analyses.
The mpox outbreak in NYC peaked shortly after the initiation of its vaccination campaign, in which two doses were administered 1 month apart, but it is unclear whether the subsequent declining incidence of mpox was due to vaccination, behavioral changes, sexual contact network dynamics, or some combination thereof26. Long-term public health surveillance in NYC provides a distinct opportunity to examine whether the transmission dynamics of MPXV are consistent with those of other sexually transmitted infections, specifically HIV-1, for which molecular surveillance data have been collected for two decades and sexual transmission dynamics are well understood18,19,20,21,27,28. Here, we used phylogeographic and epidemiological modeling approaches to better understand the factors governing the trajectory of the mpox outbreak in NYC in the context of the global epidemic and the intersection between the MPXV and HIV transmission networks.
Results
Emergence of mpox in NYC
The mpox outbreak in NYC rapidly grew in the summer of 2022 (June–July; Fig. 1a), while the rest of the United States did not experience a marked increase in cases until the end of July. Mpox cases in NYC peaked in late July, approximately 1 month before the peaks in the rest of the United States and globally. A substantially higher percentage of diagnosed mpox infections were sequenced in NYC through to the end of April 2023 (19.7%; n = 760) than in either the rest of the United States (4.6%; n = 1,221) or the rest of the world (4.1%, n = 2,338), enabling a fine-grained phylogeographic analysis of the MPXV outbreak in NYC. More than 99% of the MPXV genomes from NYC were produced by the NYC Department of Health and Mental Hygiene (DOHMH) Public Health Laboratory (PHL).
a, The number of cases of mpox per week in NYC, the United States and globally, where ‘United States’ refers to all of the United States excluding NYC and ‘global’ refers to all countries other than the United States. b, The number of sequenced genomes per week in NYC, the United States and globally. c, Time-calibrated phylogenetic tree of MPXV. Genomes are represented by circles at the tips of the phylogenetic tree, with tips and branches colored by location.
To characterize the introductions and emergence of MPXV in NYC, we conducted a Bayesian phylogeographic analysis of MPXV sampled from April 2022 to the end of April 2023, with 757 high-quality genomes from NYC (756 sequenced by the NYC PHL), 1,127 from the rest of the United States, and 2,160 from the rest of the world29. We found that MPXV was likely to be cryptically circulating in NYC and the rest of the United States for 1–2 months before the earliest cases of the 2022 mpox outbreak were identified on 19 May and 17 May in NYC and the broader United States, respectively (Fig. 1c), in line with previous results4. We inferred that global importations of MPXV into NYC started in April, with a peak in mid-June (Fig. 2a), and identified a median of 174 global importations into NYC in total (95% highest posterior density [HPD], 160–191). Based on the available sampling, there were substantially fewer importation events into NYC from the broader United States (median = 59; 95% HPD, 43–75), with both importations from and exportations into the United States peaking in late June to early July (Fig. 2a). However, there were many exportations of MPXV from NYC to the rest of the United States (median = 170; 95% HPD, 143–193; Fig. 2b).
a, The estimated number of MPXV importations into, and exportations out of, NYC per week. The center line indicates the median, and the shaded areas show the 95% HPDs of the estimates. b, Cumulative number of MPXV importations into and exportations out of NYC. The violins represent the 95% HPDs and the black bars indicate the medians. c, Estimated importation times for the 84 inferred MPXV NYC transmission clusters. The left limit of the solid horizontal line is the median time of the parent node of the MRCA of each transmission cluster, and the right limit is the median time of the MRCA (tMRCA) of the transmission cluster. The left limit of the dotted line is the lower bound of the 95% HPD of the time of the parent node, and the right limit is the upper bound of the 95% HPD of the tMRCA. d, Distribution of MPXV transmission cluster sizes in NYC.
Global importations seeded the start of the epidemic in the rest of the United States (Extended Data Fig. 1). Then, starting in July, most importations into the broader United States came from NYC. By the end of 2022, there were nearly as many importations from NYC into other areas of the United States as from the rest of the world.
MPXV transmission clusters
Our phylogeographic analysis allows for the identification of NYC transmission clusters: distinct importations of MPXV into NYC that result in identifiable onward transmission within NYC (that is, at least two individuals with closely related sequenced MPXV genomes). We inferred 84 clusters in total, and many of these clusters could be reliably identified (Extended Data Fig. 2), because they often were defined by at least one substitution (Extended Data Figs. 2 and 3). We can be reasonably confident in these phylogenetic relationships because the underlying mutation tree falls in the 'near-perfect' tree space: it has a short overall length (0.0291 substitutions per site) in which most nodes are defined by only one or two mutations, leading to an expected false-positive rate for any internal node of 0.0097 (ref. 30). The importations of MPXV into NYC that led to the largest transmission clusters are estimated to have occurred in late April or early–mid-May 2022 (Fig. 2c). However, most imports, especially later in the outbreak, led to singletons: introductions characterized by a single viral genome without any identifiable onward transmission in NYC (Extended Data Fig. 4).
We assessed the association between demographic attributes obtained during interview with NYC DOHMH epidemiologists and MXPV transmission risk reported to public health surveillance, and the frequency of membership in NYC MPXV transmission clusters among 699 individuals (699/756; 92.5%) for whom this information was reported. This population mostly consisted of MSM (Table 1). We observed increased odds of membership in an MPXV transmission cluster for Black or African American individuals (P = 0.038) relative to Latino/Latina or Hispanic individuals, whereas white individuals (P = 0.005) had decreased odds of clustering. We also observed decreased odds of MPXV clustering for people reporting recent foreign travel history (P = 0.001). An explanation for this inferred negative association between clustering and foreign travel is found in our definition of clusters as unique introductions into NYC. Although each transmission cluster must have, by definition, been initiated by someone with recent travel history, introductions that did not lead to onward transmission would include only an individual with foreign travel history. (Fig. 2d and Extended Data Fig. 4). Membership in a cluster did not vary significantly by gender, sexual orientation, age or vaccination status.
Associations between MPXV and HIV transmission clusters
The distribution of MPXV transmission cluster sizes is highly right-skewed (Fig. 2d), matching previously described patterns of HIV transmission cluster size distributions among MSM19,20,21. To better characterize these MPXV transmission cluster sizes, we fit a set of distributions to them and found that scale-free distributions provided the best fit (Akaike information criterion < 550; Extended Data Table 1). To validate the fit of these distributions, we simulated transmission cluster sizes from them and calculated how similar the simulated cluster sizes were to the inferred cluster sizes. Scale-free distributions consistently fit the inferred MPXV transmission cluster sizes better than distributions that were not scale free, with a Yule–Simon distribution providing the best fit (Kullback–Leibler [KL] divergence = 0.336).
We then compared the cluster size distributions of MPXV and HIV in NYC. We first inferred transmission clusters from HIV partial-pol sequences sampled since 2001 from 76,910 people living with HIV (PLWH) in NYC using HIV-TRACE, identifying 4,259 HIV transmission clusters consisting of 17,456 individuals. We found 416 HIV transmission clusters that grew by at least one member in 2021 or 2022, consisting of 4,097 people. Of these people in growing clusters, 1,711 (40.2%) had a virus with a recent genetic link to another person with an HIV diagnosis in 2021 or 2022. Fitting distributions to the HIV transmission cluster sizes, we found that the HIV transmission cluster sizes in NYC were also right-skewed (Extended Data Fig. 5) and best fit by scale-free distributions (Extended Data Table 1). Both MPXV and HIV transmission clusters in NYC follow scale-free distributions, recapitulating transmission across scale-free sexual transmission networks with heavy-tailed partnership distributions31, suggesting that their transmission dynamics were likely to have emerged from similar forms of transmission.
We queried the HIV surveillance database in NYC to determine whether persons living with HIV and in the HIV transmission network were disproportionately represented in MPXV clusters (Table 2). A substantial proportion of the individuals with an mpox diagnosis had a reported HIV diagnosis (n = 328/756; 43.4%), and we found a significant association between a reported HIV diagnosis and membership in MPXV transmission clusters (logistic regression; odds ratio = 1.58; P = 0.012). The majority (310/328, 94.5%) of people with both an HIV and mpox diagnosis received their HIV diagnosis prior to 2022, highlighting the different time frame in which these viruses spread among the same people (Extended Data Fig. 6). Individuals who were clustered in both the MXPV network and the HIV network did not share connections in their networks, suggesting that the contact networks for MPXV and HIV were non-overlapping, in line with the different time periods in which individuals received MPXV and HIV diagnoses.
Clustering in the HIV network is associated with more densely connected sexual networks and rapid viral transmission18,27,32. We did not observe an association between mpox diagnoses with reporting of HIV sequences or clustering in the HIV transmission network. However, the frequency with which people with an mpox diagnosis clustered in the HIV network (n = 96/187; 51.3%) is substantially higher than would be expected; when resampling the HIV network matching transmission risk and diagnosis year of people with an mpox diagnosis, one would expect only 67 individuals (35.8%; 95% quantile range: 55–80) to be clustered in the HIV network. These results indicate that people with both HIV and mpox diagnoses are part of more densely connected sexual networks.
This association of mpox cases with active sexual networks is further demonstrated by their increased occurrence in recently growing HIV transmission clusters (Fisher’s exact test; P = 0.019). Nearly all individuals with an mpox diagnosis and membership in a growing HIV cluster (that is, ≥1 new member diagnosed in 2021–2022) were also part of an MPXV cluster (n = 31/32). Furthermore, all 15 individuals with an mpox diagnosis and a genetic link to someone diagnosed with HIV in 2021 or 2022 in the HIV transmission network belonged to an MPXV cluster, although the small dataset offers limited statistical power (Table 2).
We next assessed the association between demographic attributes and MXPV transmission risk reported to public health surveillance and the presence of a reported HIV diagnosis among the 699 individuals for whom this information was reported (Extended Data Table 2). We observed lower odds of having a reported HIV diagnosis among individuals identifying as heterosexual (P < 0.001), compared with gay men, according to NYC DOHMH interviews. Compared with Latino/Latina and Hispanic people, we observed higher odds for a reported HIV diagnosis among Black and African American individuals (P = 0.01) and lower odds for white individuals. Furthermore, all age groups above 34 years old had increasingly higher odds of a reported HIV diagnosis (P < 0.001), highlighting the intersection between MPXV and HIV among people who have been living with HIV for years or decades.
Decline of the NYC mpox outbreak
To quantify when and why the 2022 mpox outbreak in NYC began its decline, we inferred the time-varying reproduction number (also known as the effective reproduction number, Re) using case counts, generation time and incubation period. The epidemic in NYC reached its peak at the end of July 2022 (Fig. 1a), before most of the first vaccine doses of a two-dose vaccine series were given (Fig. 3a). We found that Re peaked in late June and began decreasing as the rise in infections slowed and before fewer than 1,000 doses of the first vaccine dose were administered (that is, <1% of the 102,183 first vaccine doses delivered in NYC in 2022) (Fig. 3). Furthermore, Re decreased to below 1 by August 2022, before 200 doses of the second vaccine dose were given (that is, <1% of the 52,374 second vaccine doses delivered in NYC in 2022).
a, Number of mpox vaccines administered in mid-late 2022. b, Inferred effective reproduction number (Re) of MPXV based on the case data when incorporating generation time and incubation period into the inference (EpiNow2, blue), solely on the case counts (REACT, yellow), or MPXV genomes in a phylodynamic approach using an episodic birth–death-sampling model (EBDS; red). The dashed line indicates the inflection point (that is, when the epidemic is in decline; Re = 1). Lines and shaded regions represent the median and 95% confidence interval, respectively. The EBDS model assumes week-long intervals with a uniform Re per interval.
Generation time and incubation period can vary throughout an epidemic33, thereby resulting in model misspecification. Therefore, we also inferred Re using solely case counts, and found that Re could have dropped below 1 up to 1 week later (Fig. 3b), but still before a moderate proportion of the second vaccine was given.
The identification of local transmission clusters through our phylogeographic inference provides an alternative approach to measure the changes in Re during the NYC mpox epidemic that relies on genomics and inferred local transmission. We constructed an episodic birth–death-sampling model in a Bayesian framework that jointly analyzed all imported NYC clusters with at least 10 genomes (n = 13). We found that Re decreased below 1 by mid-July, approximately 1–2 weeks before the dates inferred by primarily using case data, in line with many of the introductions of MPXV into NYC after June that did not have identifiable onward transmission (Extended Data Fig. 4). Additionally, across all models, Re had declined to near 1 by the time that infections probably peaked and fell below 1 by the time that reported cases began declining (Fig. 3).
To confirm these results, we inferred the growth rate of mpox (Extended Data Fig. 7), given that growth rate has been shown to be less sensitive than Re to model misspecification34. The growth rate also began declining by the end of June, when fewer than 1,000 doses of the first vaccine dose had been given, and decreased below zero by August. Growth rate inference was consistent both with and without the generation time and incubation period parameterization. When focusing on local transmission clusters, we again found that the growth rate decreased below zero approximately 1–2 weeks prior to the inferred dates when using the other approaches.
Heavy-tailed sexual contact network drives outbreak dynamics
To determine whether a heavy-tailed sexual contact network could explain the decline of the outbreak before there was substantial infection- or vaccine-derived immunity, we simulated mpox epidemics based on MSM sexual contact networks, under varying assumptions for the infectious period and the risk of transmission between sexual partners (secondary attack rate [SAR] during the infectious period). By the time Re dropped below 1 in our simulations, we estimated a cumulative incidence among MSM of below 2% when assuming an SAR of 0.1 (Extended Data Fig. 8a); a higher SAR produces slightly higher average cumulative incidence, but with a larger range (Extended Data Fig. 8b). Assuming an MSM population in NYC of approximately 235,000 (Methods), we observed an empirical cumulative incidence of 0.52–0.96% (n = 1,230–2,248) at the time Re dropped below 1 (Fig. 3). If we conservatively assume a 60% mpox case ascertainment rate4,35, there would have been an empirical cumulative incidence of 0.87–1.59% at this time. These results are broadly consistent with our simulations, suggesting that the dynamics of the mpox outbreak in NYC were likely governed by a heavy-tailed sexual network, with an SAR between 0.1 and 0.4 (Extended Data Fig. 8).
From these epidemic simulations, we found that the most highly connected individuals (that is, people with more sexual partners than at least 99% of the sexual contact network) become infected early in the epidemic and nearly 40% of these individuals were infected by the time the empirical Re dipped below 1 (Fig. 4a). This dynamic results in more than 25% of all infections consisting of highly connected individuals by the time Re declined to 1 (Fig. 4b); across varying connectivity thresholds, a disproportionate number of the cumulative cases are attributed to the highly connected individuals. The rapid infection of highly connected individuals in the sexual network at the start of the epidemic suggests that their connectivity was a major contributor to the initially high Re and that the early saturation of densely connected parts of the sexual network contributed to the epidemic ending with low cumulative incidence.
a, Proportion of highly connected susceptible individuals remaining after observing given cumulative cases. b, The proportion of cumulative infections, consisting of infected highly connected individuals. The high-connectivity threshold refers to the top percentile of network connectivity. Lines and shaded regions represent the median and 95% quantile range, respectively. The red and blue blocks represent when the inferred growth rate and Re reached their inflection points based on the empirical data. The left and right limits of the blocks are, respectively, the lowest lower limit and the highest upper limit across the 95% confidence intervals of the methods based on case counts detailed in Fig. 3 and Extended Data Fig. 7.
Discussion
Long-standing molecular surveillance in NYC provided an opportunity to investigate the associations between the MPXV and HIV transmission networks and whether MPXV recapitulated the dynamics of a sexually transmitted pathogen such as HIV. We found that earlier introductions of MPXV into NYC resulted in, on average, larger transmission clusters than later introductions, following model-based expectations36, and that the largest clusters were likely to have been seeded before the first dozen cases were identified in NYC. Our comparative analysis of MPXV with HIV and epidemic modeling showed that transmission through a heavy-tailed sexual contact network can explain the epidemic trends observed in NYC.
Sexual transmission networks, particularly among MSM, are characterized by heavy-tailed degree centrality distributions14,37,38, which are also observed in HIV transmission networks20,21. Considering the concordance between HIV and MPXV transmission networks and the similarities in their cluster size distributions, the MPXV sexual network is likely to be a heavy-tailed partnership distribution as well. The extent to which these MPXV and HIV cluster-size distributions truly represent scale-free contact networks is a matter of ongoing debate39,40. However, cluster size distributions depend on the structure of the contact network, and heavy-tailed degree distributions are known to produce power-law cluster size distributions31.
Epidemic simulations developed to predict and explain the course of the 2022 mpox epidemic assumed a heavy-tailed transmission network, based on previously characterized sexual networks14,15. By using genomic epidemiology and building upon these simulations, we confirmed this assumption is valid and demonstrated that the 2022 mpox outbreak in NYC was consistent with the rapid and early infection of highly connected individuals in the sexual contact networks. Although these highly-connected individuals make up a small proportion of the overall contact network, their immunity, whether from vaccination or natural infection, would disrupt chains of transmission. The decrease in transmission cluster sizes as the outbreak progressed is therefore likely the result of infection-derived immunity of these highly connected individuals, when fewer than 2% of the at-risk MSM population are believed to have been infected15.
Both MPXV and HIV spread through heavy-tailed sexual contact networks in NYC, primarily consisting of MSM. However, the differing natural histories and evolutionary histories of these viruses produced transmission networks that intersected, but did not recapitulate each other. The preponderance of HIV diagnoses among MPXV cases were from the 1990s and 2000s, resulting in older individuals being disproportionately infected with both viruses. Hence, people living with HIV acquired MPXV from sexual partners who were not members of their original HIV transmission cluster. In the years separating HIV and MPXV transmission events, sexual contacts are likely to have shifted, while an overall scale-free sexual contact network structure was maintained. Although we conclude that the structure of this network led to the decline of the 2022 mpox outbreak in NYC, the saturation of this network has not produced a concomitant decline in HIV cases. The ability of HIV to produce chronic infections and transmit over years and decades in this same population suggests that shifting sexual contact networks are key to its persistence. Future work on the role and timing of sexual network restructuring may improve long-term prevention strategies for HIV and other sexually transmitted pathogens.
Contact network heterogeneity, changes in behavior, and vaccines could each individually affect MPXV transmission dynamics. Although there were undoubtedly changes in sexual behavior globally after the World Health Organization (WHO) declared mpox a public health emergency of international concern (PHEIC) on 23 July 202241,42,43, these changes would have occurred after Re was already near or below 1 and therefore could not have led to the decline of the NYC outbreak. Instead, the behavioral changes in NYC that would have contributed to limit the transmission of MPXV are likely to have occurred early in the 2022 outbreak44,45,46,47. A previous genomic epidemiological investigation found that vaccination had limited impact in curbing the North America mpox outbreak, with fewer than 10% of high-risk individuals developing vaccination-induced immunity before Re reached 1. By coupling genomic epidemiology with epidemic modeling and focusing on NYC, which experienced the largest outbreak in North America and one of the earliest vaccine rollouts, we underscore the limited role that vaccination played in curbing the 2022 mpox epidemic and provide a mechanistic explanation for its trajectory.
Although NYC rapidly distributed mpox vaccines to those at greatest risk of infection, the rise and fall of the epidemic appears to have outpaced the delivery of the vaccine. Mpox vaccination requires two doses 1 month apart, and immunocompetent individuals are considered fully immunized 2 weeks after the second dose48,49. A substantial fraction of at-risk people in NYC did not reach this point until September 2022, suggesting that the mpox vaccination campaign most probably occurred too late to substantively affect the trajectory of the NYC mpox outbreak. Even though the mpox vaccination campaign was probably not responsible for slowing and ultimately curbing the 2022 epidemic, vaccination – even a single dose – is likely to have protected many individuals from infection and severe disease50,51,52,53. Importantly, the lack of a resurgent mpox outbreak in NYC in the subsequent year, despite the continued circulation of MPXV among MSM7, suggests that vaccination may have played a role alongside natural (infection-induced) immunity in preventing resurgent community transmission.
These results should be interpreted in the context of several limitations. First, uneven sampling and sequencing coverage could bias the inference of viral importations and onward transmission. Specifically, genomic sampling peaked in NYC before the cases did (Fig. 1a,b); however, considering that NYC has the highest proportion of cases sequenced, it is unlikely that we overestimated the number of introductions of MPXV into NYC. Nearly all of the sequenced NYC cases were from sexual health clinics, which, although not necessarily representative of citywide cases, are likely to represent a sentinel population with which to understand sexually transmitted pathogens in NYC. Second, generation times, serial intervals, and delays to diagnosis of MPXV probably varied over the course of the epidemic, as evidenced by different estimates across regions and timeframes54,55,56,57,58. We therefore inferred the growth rate in addition to the effective reproduction number and used multiple approaches for the inference of each, arriving at consistent results for when the outbreak ended. Last, for our simulations, we restricted the population to MSM, assumed that the sexual partnership distribution estimated for the MSM population in the United Kingdom was a sufficient proxy for that in NYC, and did not account for network structure or changes in behavior. However, heterosexual transmission of MPXV has not been shown to be a major factor in the 2022 mpox epidemic, and a more heterogeneous population than the one modeled or substantive shifts in behavior could result in an epidemic that could end with even lower cumulative incidence.
The WHO declared an end to mpox as a PHEIC in May 2023; however, there have since been numerous outbreaks of MPXV, specifically clade Ia and clade Ib in west and central Africa, as well as continued transmission of clade IIb throughout the world. As a consequence of the clade I outbreaks and an upsurge of mpox in the Democratic Republic of the Congo and other countries in Africa, the WHO again declared mpox a PHEIC in August 2024. These 2024 outbreaks have impacted a broader demographic than the 2022 epidemic, including an increased proportion of cases among children and sex workers59,60, and could indicate that MPXV is capable of spreading across differently structured contact networks. It is therefore critical to deploy targeted and rapid vaccination strategies to mitigate current and future mpox outbreaks.
Methods
Ethics review
The activities pertaining to this study at NYC DOHMH are routine analysis of public health surveillance data and thus not subject to institutional review board (IRB) approval. The IRB of the University of California San Diego determined the MPXV and HIV analysis to be of minimal risk and as not qualifying as human subjects research per the Code of Federal Regulations, Title 45, part 46, and the UC San Diego Standard Operating Procedures and Policies.
Incidence and vaccine data
The number of reported mpox cases per country was downloaded from Our World in Data (OWID; https://ourworldindata.org/; last accessed on 6 February 2024).
The number of reported mpox cases and vaccines administered in NYC were generated by the NYC DOHMH (https://github.com/nychealth/monkeypox-data and https://github.com/nychealth/mpv-vaccine-data?tab=readme-ov-file, respectively; last accessed on 6 February 2024).
Genomic data and initial processing
We generated 1,144 MPXV sequences from 759 individuals as part of public health surveillance by the NYC DOHMH PHL61, which are available for download on NCBI as part of BioProject PRJNA949682.
We downloaded all available MPXV sequences from GISAID (Global Initiative on Sharing All Influenza Data; last accessed 23 May 2023) while excluding the NYC sequences matching ours from NCBI. From these genomes, we subsequently excluded any genomes sampled before 28 April 2022 or after April 2023 and those that did not belong to the clade II, B.1 lineage, resulting in one additional genome from NYC, 1,221 from the rest of the United States, and 2,338 from the rest of the world. A 2021 Maryland sample (NCBI accession number ON676708) was added to this dataset for use as an outgroup.
Genomes with fewer than 180,000 non-'N’ sites were subsequently excluded. We aligned the remaining MPXV genomes against the clade II reference genome (NC_063383) and masked the inverted terminal repeat regions, as well as repetitive and low complexity regions, using squirrel62 (https://github.com/aineniamh/squirrel). Given that some regions of a subset of genomes were still poorly aligned, we removed stretches of sequences with many Ns. Specifically, we iteratively examined each sequence using a window of size 100 nucleotides, and if the start and end of the window were each an N and more than ≥25% of the window were Ns, we masked the given window. Genomes that had either fewer than 10 (n = 1) or more than 90 (n = 10) substitutions relative to the reference genome were considered outliers and were excluded.
Sequences from individuals from NYC with multiple high-quality genomes were filtered to include only the earliest genome from each individual, and if there were multiple genomes with the same collection date for a given individual, the genome with the fewest ambiguities was chosen. The final dataset consisted of 4,044 genomes, with 757 genomes from NYC (756 generated by PHL), 1,127 from the broader United States, and 2,160 from the rest of the world.
The percentage of cases sequenced is based on the number of cases from OWID and the number of genomes from each region from 28 April 2022 to 28 April 2023. The number of genomes was calculated before filtering, with the exception of only counting each case from NYC with multiple genomes once.
The accession IDs for all genomes used can be found in Supplementary Table 1 (all) and Supplementary Table 2 (GISAID).
Phylogenetic analysis
We inferred a maximum likelihood tree with MAPLE v0.3.3 (ref. 63) using a general time-reversible model. We extracted the B.1 clade subtree and then converted it from a bifurcating tree to a multifurcating tree using the ape package64 in R v4.3.2 (ref. 65).
We performed time-calibration of the resulting multifurcating tree using a recently implemented model in BEAST v1.10 (ref. 66), which replaces the traditional likelihood with a more efficient likelihood approximation67 based on a Poisson model. In this approach, a starting tree scaled to substitutions per site is provided and the tree search is constrained such that only node heights and polytomy resolutions are sampled. We used a starting clock rate of 5.0 × 10−5 and a relatively uninformative gamma prior with a shape of 0.001 and scale of 1,000, as well as a nonparametric coalescent tree68 prior with 26 evenly placed grid-points between the most recently sampled sequence and 2 years prior to that time. We simulated 15 Markov chain Monte Carlo (MCMC) chains of 5 × 108 to 1 × 109 generations, subsampling every 25,000 iterations to continuous parameter log files and every 250,000 iterations to continuous tree files. The first 10–20% of each chain was discarded as burn-in for the analyses, and the chains were then combined using LogCombiner. The resulting parameter log and tree files were resampled every 500,000 and 5,000,000 states, respectively. Model convergence and mixing was assessed in Tracer v1.7.1 (ref. 69), and all effective sample sizes were >150.
To reconstruct the importation dynamics of MPXV, we used an asymmetric discrete-trait analysis model70 implemented in BEAST v1.10 with samples assigned to NYC, United States (excluding NYC), and global (excluding the United States in its entirety) locales. We used the resampled empirical tree distribution described above and simulated one MCMC chain of 1 × 106 generations, subsampling every 100 iterations to a continuous parameter log file and every 500 iterations to a continuous tree file. We discarded the first 10% of the chain as burn-in and resampled the trees every 1,500 states. A maximum clade credibility (MCC) tree was then generated with TreeAnnotater v1.10 using the subsampled trees. For individual posterior tree samples, an introduction into a region is defined when a node is assigned to a particular location with high probability from the posterior distribution and its parent node has a different location71. For the MCC tree, we define an introduction into a region when a node has a posterior probability >0.5 for the given location and its parent node has a posterior probability of ≤0.5 for the same location. Based on an established approach to identify the cluster sizes resulting from an introduction36,72, we performed a depth-first search starting at all internal nodes that correspond to an introduction of MPXV, traversing forward in time until a node corresponding to a different region (that is, an exportation) is encountered or there are no more nodes remaining to be explored. Transmission clusters are defined as introductions that led to at least two genomes, and introductions resulting in one genome are labeled singletons. When showcasing the time of importation for each transmission cluster, we include the entire 95% HPD intervals of the time of the most recent common ancestor (MRCA) of the transmission cluster and the time of the parent node of this MRCA.
We next sought to determine how many identifying mutations preceded each imported transmission cluster. Given that the only difference between the mutation tree and the time tree topologies is in the resolution of polytomies, we are able to convert the MCC tree from BEAST into a mutation tree with analogously resolved polytomies. Using the same approach described earlier except with this mutation tree, we identified introductions into NYC and the size of the imported transmission clusters (note that these are identical as when determined using the MCC tree). We then inferred the branch length leading to the MRCA of each transmission cluster in terms of the inferred number of substitutions per site. We rounded the number of inferred substitutions to the nearest whole number (Extended Data Fig. 2).
We further explored the transmission dynamics of MPXV through the application of an episodic birth–death sampling (EBDS) model in BEAST. This phylodynamic approach enables the investigation of dynamic changes in birth, death and sampling rates of individuals over discrete time epochs73. We conducted a joint phylogeny analysis, in which we inferred conditionally independent trees for all clusters consisting of at least 10 genomes (n = 13), all derived from the same EBDS model.
We implemented an EBDS model with a cut-off of 10 months, inferring time-varying birth and sampling rates across 40 epochs, each representing 1 week. The model assumes a constant death rate over time, no intensive sampling events at epoch transitions, and the removal of lineages upon sampling. Gaussian Markov random field priors are used for the birth and sampling rates to accommodate the substantial variability in effective reproduction number over time. The prior distribution for the constant death rate follows a log-normal distribution, with mean and standard deviation values estimated to yield a 95% confidence interval for infectious duration from 2 to 4 weeks.
For tree inference, we applied a general time-reversible nucleotide substitution model for all trees and a strict molecular clock model for each cluster, incorporating continuous-time Markov chain reference priors74 for the evolutionary rates.
We simulated 10 MCMC chains of 1 × 109 generations each, facilitated by Hamiltonian Monte Carlo transition kernels75, and subsampling every 1 × 105 iterations to continuous parameter log files. The first 10% of each chain was discarded as burn-in for the analysis, the resulting parameter log files were resampled every 1 × 106 states, and the chains were then combined using LogCombiner.
The effective reproduction number for each epoch is computed as
and the growth rate is calculated by
where λ, μ and ψ are functions of time (t) representing the birth rate, death rate and sampling rate, respectively.
Phylogenetic trees were processed using TreeSwift v1.1.42 (ref. 76) and visualized using baltic v0.2.2 (ref. 77).
Demographics analysis
Individual demographic and risk characteristics were abstracted by NYC DOHMH from medical records, which were later updated by NYC DOHMH epidemiologists during interviews. Correlates of membership in MPXV transmission clusters were assessed using logistic regression in R v4.0.2. Of the 756 people with a high-quality sequenced MPXV genome and in the PHL surveillance network, demographic and transmission risk metadata were available for 708 individuals. To achieve statistical convergence, we excluded all groups with fewer than 10 individuals (that is, 2 cis-women, 3 individuals with unknown gender, and 4 individuals residing in Staten Island) from our regression analysis, resulting in 699 included individuals.
HIV transmission network
Because HIV has been circulating in NYC since the 1970s and we are interested in the comparison of more recent transmission clusters, we used HIV-TRAnsmission Cluster Engine (HIV-TRACE)78 to infer these clusters based on genetic distance between these virus genomes, rather than identifying clusters using phylogeographic inference. The first reported sequence for each person was aligned to the HXB2 reference PR + POL sequence (positions 2253–5094) and trimmed to PR/partial-RT to positions 2253–3749. Sequences with ≥5% ambiguous nucleotides and those with ≤0.015 substitutions per site from the HXB2 reference sequence were excluded. Pairwise TN93 genetic distances was compared among the first reported viral sequences from 76,910 individuals residing in or receiving HIV care in NYC reported to the NYC DOHMH as of January 2024; an HIV transmission network was constructed using a distance threshold of 0.015 substitutions per site (resolving distances between ambiguous nucleotides for sequences with ≤1.5% ambiguities). For each clustered individual, we determined, first, whether their cluster had added a case diagnosed in 2021; and second, if they were directly genetically linked to a virus from a case diagnosed in 2021.
Matching between individuals with an MPXV and HIV diagnosis reported to DOHMH was performed based on a 36-key deterministic matching algorithm79. The relationship between MPXV cases and HIV cases reported to the NYC DOHMH was assessed using logistic regression when expected values were ≥5 and Fisher’s exact test when expected values were <5.
Transmission lineage size distribution fitting
We fit the following distributions to the MPXV transmission cluster sizes using Python v3.10.10: exponential, binomial, negative binomial, Pareto, power law and Yule–Simon. We used a minimum cluster size of 2 and truncated the probability distribution to the minimum cluster size. Fit was determined using the Akaike information criterion. Because much of the probability distribution was typically truncated during the fitting, to further examine which distributions best fit the transmission cluster sizes, we (1) subsequently simulated 10,000 transmission cluster sizes using the fit distributions; (2) we truncated the simulated data to the minimum cluster size; (3) we normalized the inferred and simulated cluster sizes based on the total number of cluster sizes for each; and (4) we calculated the Kullback–Leibler divergence between these two sets of normalized cluster sizes. We repeated this process for the HIV transmission cluster sizes.
Epidemiology
To supplement the phylogenetic analyses (see above) and characterize the growth and decline of the MPXV epidemic in NYC using more traditional epidemiological approaches, we inferred the time-varying reproduction number (also known as the effective reproduction number, Re) and the growth rate using EpiNow2 v1.5.2 (ref. 80), assuming a gamma-distributed generation time with a mean of 12.6 days, a standard deviation of 5.7 days and a maximum of 25 days, and a gamma-distributed incubation period with a mean of 9.1 days, a standard deviation of 3 days and a maximum of 20 days. We ran four chains for 2,000 iterations each, with a warmup of 400 iterations, a maximum tree depth of 12, and an average proposal acceptance probability ('adapt_delta') of 0.999 for the No-U-Turn Sampler81 used by Stan82. Given that the generation time and incubation period can vary over the course of an epidemic33, we also calculated Re and growth rate using the approach from ref. 83, which uses only case counts from the epidemic. For this approach, we ran four chains for 5,000 iterations each, with a warmup of 1,000 iterations, a maximum tree depth of 20 and an average proposal acceptance probability of 0.999. Note that the package from ref. 81 needed to be manually adjusted to enable the changing of maximum tree depth and average proposal acceptance probability. The time periods in which the effective reproduction number and growth rate reached their inflection points shown in Fig. 4 were calculated using the two approaches described here.
Simulating epidemics
We simulated mpox epidemics using Julia v1.10.2 based on the approach in ref. 14. In brief, this approach uses a branching process model of transmission among an MSM population. The MSM sexual network was characterized by a Weibull distribution with parameters that were estimated by fitting a Weibull distribution to the empirical degree distribution from the Natsal dataset84,85,86, which is composed of MSM who had at least one sexual partner over the course of 1 year. The simulations assumed a single starting case and in each case the virus was transmitted to their sexual partners with a constant probability per partner (that is, secondary attack rate [SAR]) over the infectious period.
To determine the size of the MSM population in NYC in 2022 for the simulations, we multiplied the proportion of MSM per county in NYC based on ref. 87 by the expected number of adult men living in each county88. We used the reported proportions of MSM for each county in NYC, but we used the average New York State proportion of MSM for Richmond County (Staten Island) because it was not listed in ref. 87. This resulted in an MSM population of approximately 235,000.
We performed 1,000 simulations with this population size, SARs of 0.1 and 0.4, and infectious periods of 14 and 21 days. The effective reproduction number was calculated at each infected case as per ref. 14. High-connectivity individuals were defined as those with a degree above a certain percentile (for example, 95th percentile, or the top 5% of connected individuals). We then calculated the proportion of high-connectivity susceptible people remaining as each case was infected.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
MXPV genomes from the NYC DOHMH PHL that the authors generated have been deposited at NCBI (BioProject PRJNA949682). The accession IDs for all MPXV genomes used can be found in Supplementary Table 1 (all) and Supplementary Table 2 (GISAID). The genomes available at NCBI are public; those at GISAID require registration with GISAID. HIV surveillance activities are protected by New York State Redisclosure Law Articles 21 and 27-F, which prevents the submission of HIV-1 genetic sequences to public databases. Data were shared with investigators under data use agreements with UC San Diego. Data requests can be made by emailing ftaki@health.nyc.gov, with an expected time for response of 1 week. Time to provide the data depends on the data request and the potential need of a data use agreement.
Code availability
Code is available at https://github.com/pekarj/nyc_mpox_phylogeography.
References
O’Toole, Á. et al. APOBEC3 deaminase editing in mpox virus as evidence for sustained human transmission since at least 2016. Science 382, 595–600 (2023).
Gigante, C. M. et al. Multiple lineages of monkeypox virus detected in the United States, 2021–2022. Science 378, 560–565 (2022).
Isidro, J. et al. Phylogenomic characterization and signs of microevolution in the 2022 multi-country outbreak of monkeypox virus. Nat. Med. 28, 1569–1572 (2022).
Paredes, M. I. et al. Underdetected dispersal and extensive local transmission drove the 2022 mpox epidemic. Cell 187, 1374–1386 (2024).
Ndodo, N. et al. Distinct monkeypox virus lineages co-circulating in humans before 2022. Nat. Med. 29, 2317–2324 (2023).
Borges, V. et al. Viral genetic clustering and transmission dynamics of the 2022 mpox outbreak in Portugal. Nat. Med. 29, 2509–2517 (2023).
Mathieu, E., Spooner, F., Dattani, S., Ritchie, H. & Roser, M. Mpox (monkeypox). Our World in Data https://ourworldindata.org/mpox (2022).
Centers for Disease Control and Prevention. Mpox outbreak global map. Retrieved 2 April 2024 from https://www.cdc.gov/poxvirus/mpox/response/2022/us-map.html (2022).
New York City Department of Health and Mental Hygiene. Monkeypox data. GitHub https://github.com/nychealth/monkeypox-data (2024)
Otterman, S. A monkeypox vaccine is available for at-risk New Yorkers. The New York Times (2022).
Laurenson-Schafer, H. et al. Description of the first global outbreak of mpox: an analysis of global surveillance data. Lancet Glob. Health 11, e1012–e1023 (2023).
World Health Organization. Multi-country outbreak of mpox, External situation report 12–14 December 2022. https://www.who.int/publications/m/item/multi-country-outbreak-of-mpox--external-situation-report-12--14-december-2022
Mitjà, O. et al. Monkeypox. Lancet 401, 60–74 (2023).
Endo, A. et al. Heavy-tailed sexual contact networks and monkeypox epidemiology in the global outbreak, 2022. Science 378, 90–94 (2022).
Jung, S.-M. et al. Dynamic landscape of mpox importation risks driven by heavy-tailed sexual contact networks among men who have sex with men in 2022. J. Infect. Dis. 231, e234–e243 (2025).
Xiridou, M. et al. The fading of the mpox outbreak among men who have sex with men: a mathematical modelling study. J. Infect. Dis. 230, e121–e120 (2024).
Murayama, H. et al. Accumulation of immunity in heavy-tailed sexual contact networks shapes mpox outbreak sizes. J. Infect. Dis. 229, 59–63 (2024).
Wertheim, J. O. et al. Social and genetic networks of HIV-1 transmission in New York City. PLoS Pathog. 13, e1006000 (2017).
Wertheim, J. O. et al. The global transmission network of HIV-1. J. Infect. Dis. 209, 304–313 (2014).
Leigh Brown, A. J. et al. Transmission network parameters estimated from HIV sequences for a nationwide epidemic. J. Infect. Dis. 204, 1463–1469 (2011).
Hughes, G. J. et al. Molecular phylodynamics of the heterosexual HIV epidemic in the United Kingdom. PLoS Pathog. 5, e1000590 (2009).
Romano, C. M. et al. Social networks shape the transmission dynamics of hepatitis C virus. PLoS One 5, e11170 (2010).
Copen, C. E. et al. Factors associated with syphilis transmission and acquisition among men who have sex with men: protocol for a multisite egocentric network study. JMIR Res. Protoc. 11, e40095 (2022).
Schmidt, R., Carson, P. J. & Jansen, R. J. Resurgence of syphilis in the United States: an assessment of contributing factors. Infect. Dis. 12, 1178633719883282 (2019).
Beale, M. A. et al. Genomic epidemiology of syphilis in England: a population-based study. Lancet Microbe 4, e770–e780 (2023).
Ogaz, D. et al. Mpox diagnosis, behavioral risk modification, and vaccination uptake among gay, bisexual, and other men who have sex with men, United Kingdom, 2022. Emerg. Infect. Dis. 30, 916–925 (2024).
Wertheim, J. O. et al. Growth of HIV-1 molecular transmission clusters in New York City. J. Infect. Dis. 218, 1943–1953 (2018).
Collura, R., O’Grady, T., Swain, C.-A., Patterson, W. & Rajulu, D. T. Molecular HIV clustering among individuals with mpox and HIV co-morbidity in New York State, excluding New York City. AIDS Res. Hum. Retroviruses 39, 601–603 (2023).
Shu, Y. & McCauley, J. GISAID: global initiative on sharing all influenza data – from vision to reality. Euro Surveill. https://doi.org/10.2807/1560-7917.ES.2017.22.13.30494 (2017).
Wertheim, J. O., Steel, M. & Sanderson, M. J. Accuracy in near-perfect virus phylogenies. Syst. Biol. 71, 426–438 (2022).
Moreira, A. A., Paula, D. R., Costa Filho, R. N. & Andrade, J. S. Jr Competitive cluster growth in complex networks. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 73, 065101 (2006).
Oster, A. M. et al. Identifying clusters of recent and rapid HIV transmission through analysis of molecular surveillance data. J. Acquir. Immune Defic. Syndr. 79, 543–550 (2018).
Chen, D. et al. Inferring time-varying generation time, serial interval, and incubation period distributions for COVID-19. Nat. Commun. 13, 7727 (2022).
Parag, K. V., Thompson, R. N. & Donnelly, C. A. Are epidemic growth rates more informative than reproduction numbers? J. R. Stat. Soc. Ser. A Stat. Soc. https://doi.org/10.1111/rssa.12867 (2022).
Polk, C. et al. Evaluation of a health system’s implementation of a monkeypox care model under the RE-AIM framework. Ther. Adv. Infect. Dis. 10, 20499361231158463 (2023).
Tsui, J. L.-H. et al. Genomic assessment of invasion dynamics of SARS-CoV-2 Omicron BA.1. Science 381, 336–343 (2023).
Liljeros, F., Edling, C. R., Amaral, L. A., Stanley, H. E. & Aberg, Y. The web of human sexual contacts. Nature 411, 907–908 (2001).
Jones, J. H. & Handcock, M. S. Sexual contacts and epidemic thresholds. Nature 423, 605–606 (2003).
Broido, A. D. & Clauset, A. Scale-free networks are rare. Nat. Commun. 10, 1017 (2019).
Holme, P. Rare and everywhere: perspectives on scale-free networks. Nat. Commun. 10, 1016 (2019).
Hubach, R. D. & Owens, C. Findings on the monkeypox exposure mitigation strategies employed by men who have sex with men and transgender women in the United States. Arch. Sex. Behav. 51, 3653–3658 (2022).
MacGibbon, J. et al. Mpox (monkeypox) knowledge, concern, willingness to change behaviour, and seek vaccination: results of a national cross-sectional survey. Sex. Health 20, 403–410 (2023).
World Health Organization. WHO Director-General declares the ongoing monkeypox outbreak a Public Health Emergency of International Concern. https://www.who.int/europe/news/item/23-07-2022-who-director-general-declares-the-ongoing-monkeypox-outbreak-a-public-health-event-of-international-concern (2022).
Zhang, X.-S. et al. Transmission dynamics and effect of control measures on the 2022 outbreak of mpox among gay, bisexual, and other men who have sex with men in England: a mathematical modelling study. Lancet Infect. Dis. 24, 65–74 (2024).
Brand, S. P. C. et al. The role of vaccination and public awareness in forecasts of Mpox incidence in the United Kingdom. Nat. Commun. 14, 4100 (2023).
Guzzetta, G. et al. The decline of the 2022 Italian mpox epidemic: role of behavior changes and control strategies. Nat. Commun. 15, 2283 (2024).
Clay, P. A. et al. Modelling the impact of vaccination and sexual behaviour adaptations on mpox cases in the USA during the 2022 outbreak. Sex. Transm. Infect. 100, 70–76 (2024).
Rao, A. K. et al. Use of JYNNEOS (smallpox and monkeypox vaccine, live, nonreplicating) for preexposure vaccination of persons at risk for occupational exposure to orthopoxviruses: recommendations of the Advisory Committee on Immunization Practices – United States, 2022. MMWR Morb. Mortal. Wkly Rep. 71, 734–742 (2022).
Priyamvada, L. et al. Serological responses to the MVA-based JYNNEOS monkeypox vaccine in a cohort of participants from the Democratic Republic of Congo. Vaccine 40, 7321–7327 (2022).
Wolff Sagy, Y. et al. Real-world effectiveness of a single dose of mpox vaccine in males. Nat. Med. 29, 748–752 (2023).
Dalton, A. F. et al. Estimated effectiveness of JYNNEOS vaccine in preventing mpox: a multijurisdictional case-control study – United States, August 19, 2022–March 31, 2023. MMWR Morb. Mortal. Wkly Rep. 72, 553–558 (2023).
Rosenberg, E. S. et al. Effectiveness of JYNNEOS vaccine against diagnosed mpox infection – New York, 2022. MMWR Morb. Mortal. Wkly Rep. 72, 559–563 (2023).
Deputy, N. P. et al. Vaccine effectiveness of JYNNEOS against mpox disease in the United States. N. Engl. J. Med. 388, 2434–2443 (2023).
Miura, F. et al. Estimated incubation period for monkeypox cases confirmed in the Netherlands, May 2022. Euro Surveill. 27, 2200448 (2022).
Ward, T., Christie, R., Paton, R. S., Cumming, F. & Overton, C. E. Transmission dynamics of monkeypox in the United Kingdom: contact tracing study. BMJ 379, e073153 (2022).
Guzzetta, G. et al. Early estimates of monkeypox incubation period, generation time, and reproduction number, Italy, May–June 2022. Emerg. Infect. Dis. 28, 2078–2081 (2022).
Madewell, Z. J. et al. Serial interval and incubation period estimates of monkeypox virus infection in 12 jurisdictions, United States, May–August 2022. Emerg. Infect. Dis. 29, 818–821 (2023).
Hinch, R. et al. Quantification of the time-varying epidemic growth rate and of the delays between symptom onset and presenting to healthcare for the mpox epidemic in the UK in 2022. Sci. Rep. 14, 19755 (2024).
Vakaniaki, E. H. et al. Sustained human outbreak of a new MPXV clade I lineage in eastern Democratic Republic of the Congo. Nat. Med. 30, 2791–2795 (2024).
World Health Organization. Mpox – African Region. https://www.who.int/emergencies/disease-outbreak-news/item/2024-DON528 (2024).
Akther, S. et al. Genomic epidemiology of monkeypox virus during the 2022 outbreak in New York City. Preprint at bioRxiv https://doi.org/10.1101/2024.07.23.604793 (2024).
O’Toole, Á. Squirrel. GitHub https://github.com/aineniamh/squirrel (2022)
De Maio, N. et al. Maximum likelihood pandemic-scale phylogenetics. Nat. Genet. 55, 746–752 (2023).
Paradis, E. & Schliep, K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35, 526–528 (2019).
Ripley, B. D. The R project in statistical computing. MSOR Connect. 1, 23–25 (2001).
Suchard, M. A. et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 4, vey016 (2018).
McCrone, J. T. et al. Context-specific emergence and growth of the SARS-CoV-2 Delta variant. Nature 610, 154–160 (2022).
Gill, M. S. et al. Improving Bayesian population dynamics inference: a coalescent-based model for multiple loci. Mol. Biol. Evol. 30, 713–724 (2013).
Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 67, 901–904 (2018).
Lemey, P., Rambaut, A., Drummond, A. J. & Suchard, M. A. Bayesian phylogeography finds its roots. PLoS Comput. Biol. 5, e1000520 (2009).
Minin, V. N. & Suchard, M. A. Counting labeled transitions in continuous-time Markov models of evolution. J. Math. Biol. 56, 391–412 (2008).
du Plessis, L. et al. Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK. Science 371, 708–712 (2021).
Stadler, T., Kühnert, D., Bonhoeffer, S. & Drummond, A. J. Birth–death skyline plot reveals temporal changes of epidemic spread in HIV and hepatitis C virus (HCV). Proc. Natl Acad. Sci. USA 110, 228–233 (2013).
Ferreira, M. A. R. & Suchard, M. A. Bayesian analysis of elapsed times in continuous‐time Markov chains. Can. J. Stat. 36, 355–368 (2008).
Shao, Y., Magee, A. F., Vasylyeva, T. I. & Suchard, M. A. Scalable gradients enable Hamiltonian Monte Carlo sampling for phylodynamic inference under episodic birth–death-sampling models. PLoS Comput. Biol. 20, e1011640 (2024).
Moshiri, N. TreeSwift: a massively scalable Python tree package. SoftwareX 11, 100436 (2020).
Dudas, G. Baltic – backronymed adaptable lightweight tree import code for molecular phylogeny manipulation, analysis and visualisation. GitHub https://github.com/evogytis/baltic (2021).
Kosakovsky Pond, S. L., Weaver, S., Leigh Brown, A. J. & Wertheim, J. O. HIV-TRACE (TRAnsmission Cluster Engine): a tool for large scale molecular epidemiology of HIV-1 and other rapidly evolving pathogens. Mol. Biol. Evol. 35, 1812–1819 (2018).
Drobnik, A. et al. Matching HIV, tuberculosis, viral hepatitis, and sexually transmitted diseases surveillance data, 2000-2010: identification of infectious disease syndemics in New York City. J. Public Health Manag. Pract. 20, 506–512 (2014).
Abbott, S. et al. epiforecasts/EpiNow2: 1.4.0 release. Zenodo https://doi.org/10.5281/ZENODO.3957489 (2023).
Hoffman, M. & Gelman, A. The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J. Mach. Learn. Res. 15.1, 1593–1623 (2014).
Carpenter, B. et al. Stan: a probabilistic programming language. J. Stat. Softw. 76, 1 (2017).
Eales, O. et al. Appropriately smoothing prevalence data to inform estimates of growth rate and reproduction number. Epidemics 40, 100604 (2022).
Johnson, A. et al. National survey of sexual attitudes and lifestyles, 2010–2012. [Data collection.] 2nd edition. UK Data Service https://doi.org/10.5255/UKDA-SN-7799-2 (2024).
Johnson, A., Fenton, K., Wellings, K., Erens, B. & Field, J. National survey of sexual attitudes and lifestyles, 2000–2001. [Data collection.] UK Data Service https://doi.org/10.5255/UKDA-SN-5223-1 (2024).
University of Glasgow, MRC/CSO Social and Public Health Sciences Unit, University College London & London School of Hygiene and Tropical Medicine. National survey of sexual attitudes and lifestyles COVID-19 study, 2020. [Data collection.] UK Data Service https://doi.org/10.5255/UKDA-SN-8865-1 (2021).
Grey, J. A. et al. Estimating the population sizes of men who have sex with men in US states and counties using data from the American Community Survey. JMIR Public Health Surveill. 2, e14 (2016).
US Census Bureau. (n.d.) QuickFacts: New York. Retrieved 9 March 2023, from https://web.archive.org/web/20230309103836/https://www.census.gov/quickfacts/fact/table/NY/PST045222
Acknowledgements
The authors thank A. Endo for assistance in using his software to simulate mpox epidemics; R. Collura for suggestions related to the comparison of HIV and MPXV networks; and M. Harrington, F. Lim, N. Moshiri and M. Paredes for insightful comments. The authors gratefully acknowledge the authors from the originating laboratories and the submitting laboratories, who generated and shared through GISAID the viral genomic sequences and metadata on which this research is based (Supplementary Table 2). The authors also acknowledge support from Advanced Micro Devices, Inc., with the donation of parallel computing resources used for this research. J.O.W. and J.E.P. were supported in part by an NIH-NIAID R01 (AI135992). J.E.P. was also supported in part by the University of California, San Diego Merkin Fellowship. T.I.V. was supported by the Branco Weiss Fellowship. Y.S., P.L. and M.A.S. were supported in part by an NIH-NIAID R01 (AI153044) and U19 (AI135995). HIV surveillance work at DOHMH was supported by CDC (PS21-2102). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. The remaining authors received no specific funding for this work.
Author information
Authors and Affiliations
Contributions
J.E.P. and J.O.W. conceived the research. J.E.P., Y.W., J.C.W., L.A.F., Y.S., P.L., M.A.S. and J.O.W. analyzed the data. J.C.W., L.A.F., H.A., T.C., F.T. and L.V.T. generated and managed the data. J.E.P. performed the simulations. J.E.P. and J.O.W. wrote the first draft of the manuscript. All authors contributed to the interpretation of results and manuscript editing. J.O.W. supervised the work.
Corresponding authors
Ethics declarations
Competing interests
J.O.W. receives contracts through his institution from the Centers for Diseases Control and Prevention related to HIV public health surveillance. M.A.S. receives grants and contracts from the US Food & Drug Administration, the US Department of Veterans Affairs and Johnson & Johnson, all outside the scope of this work. The other authors declare no competing interests.
Peer review
Peer review information
Nature Medicine thanks Kathy Leung, Carla Mavian and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Anna Maria Ranzoni, in collaboration with the Nature Medicine team.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Importations and exportations of MPXV in the United States.
a, The estimated number of MPXV importations into and exportations out of the United States per week. Center line indicates median value, and shaded areas show 95% highest posterior densities (HPDs) of the estimates shaded areas show 95% highest posterior densities (HPDs) of the estimates. b, Cumulative number of MPXV importations into and exportations out of the United States. The violins represent the 95% HPDs and the black bars indicate the median values.
Extended Data Fig. 2 Posterior probability of importations.
The posterior probability of each importation cluster, excluding singletons, compared against its posterior probability of being imported into NYC. Note that each importation event must appear as an importation in at least 50% of the posterior samples. Points are colored by cluster size, and their shapes indicate the number of substitutions separating the import from its parent node in the mutation tree. Jitter has been applied at posterior probability values of 1 to make individual points more visible.
Extended Data Fig. 3 Mutation tree of MPXV.
Branches are colored based on locations inferred from the phylogeography.
Extended Data Fig. 4 Estimated importation times for NYC transmission clusters, including singletons.
The left limit of the solid line is the median time of the parent node of the MRCA of each transmission cluster and the right limit is the median tMRCA of the transmission cluster. The left limit of the dotted line is the lower bound of the 95% HPD of the time of the parent node and the right limit is the upper bound of the 95% HPD of the tMRCA. For singletons, the right limit of the solid line is the date of sampling, and there is no dotted line extending rightward.
Extended Data Fig. 5 HIV transmission cluster sizes.
Inset shows frequency of cluster sizes of at least 20.
Extended Data Fig. 6 Year of HIV diagnosis for individuals diagnosed with mpox and living with HIV.
The bars are colored by whether the individuals are members of mpox clusters.
Extended Data Fig. 7 Effective reproduction number and growth rate of MPXV.
Inferred growth rate of MPXV based on the case data whilst incorporating generation time and incubation period into the inference (EpiNow2, blue), solely on the case counts (REACT, yellow), or MPXV genomes in a phylodynamic approach using an episodic birth-death-sampling model (EBDS; red). Dashed line indicates inflection point (that is, when the epidemic is in decline; growth rate=0). Lines and shaded regions represent the median and 95% confidence interval, respectively. The EBDS model assumes week-long intervals with a uniform growth rate per interval.
Extended Data Fig. 8 Effective reproduction number and depletion of highly connected susceptibles of simulated MPXV epidemics.
The effective reproduction number (Re) after observing given cumulative infections with varying infectious periods and an SAR of (a) 0.1 and (b) 0.4. Lines and shaded regions represent the median and 95% quantile range, respectively.
Supplementary information
Supplementary Tables 1–2
Accession numbers of genomes used in this study.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pekar, J.E., Wang, Y., Wang, J.C. et al. Transmission dynamics of the 2022 mpox epidemic in New York City. Nat Med 31, 1464–1473 (2025). https://doi.org/10.1038/s41591-025-03526-9
Received:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/s41591-025-03526-9