Main

Arboviruses pose a substantial threat to public health, with the World Health Organization (WHO) reporting close to 5.5 billion people at risk globally1,2. ZIKV is an arbovirus of concern that is mainly transmitted via Aedes mosquitoes but also human-to-human via sexual transmission3. Initially discovered in Uganda in 1947, ZIKV was only detected on the African and Asian continents until the mid-2000s4. The first outbreak of ZIKV outside these regions occurred in the Federated States of Micronesia in 2007 (ref. 5), with the virus subsequently spreading to the Americas6,7. An increase in cases of ZIKV, microcephaly, congenital Zika syndrome (CZS) and Guillain–Barré syndrome in Brazil in 2015 led to the declaration of a Public Health Emergency of International Concern (PHEIC) by the WHO from February to November 2016 (refs. 8,9,10).

While many ZIKV infections are asymptomatic3,5,11, the serious complications in the fetus and negative pregnancy outcomes associated with ZIKV infection during pregnancy, including CZS and its association with Guillain–Barré syndrome and other neurological disorders, underscore why future ZIKV re-emergence remains a global public health concern10. To date, no licensed vaccines or therapeutics exist, and although ZIKV cases have declined globally since 2017, they continue to be detected worldwide12. ZIKV remains listed among the pathogens with pandemic potential and is prioritized for research and development by the WHO13,14 and the UK Health Security Agency (UKHSA)15.

Mathematical models play a critical role in outbreak response and risk assessment by providing insights into the potential location, timing and magnitude of disease transmission. These tools can help guide the selection of sites for vaccine or therapeutic trials and inform the prepositioning of protocols and other preparedness activities. Central to these activities is the need for quantitative estimates of key epidemiological parameters, such as the reproduction number, epidemiological delays and seroprevalence, together with their variation and a catalogue of existing mathematical models. Here, we conducted a systematic review to generate a comprehensive database to aid the global modelling and public health community in their response to future ZIKV outbreaks. This systematic review is part of a set of reviews of nine WHO priority pathogens16,17,18,19 and contributes to a wider central, up-to-date and accessible database of information critical to mathematical modelling for timely outbreak response.

Results

From the 27,491 studies identified through the database search, 11,845 studies were screened following de-duplication, of which 101 were identified through backwards citation screening of the systematic reviews. A total of 10,500 studies were excluded at the abstract/title screening stage, and we undertook full-text review of the remaining 1,343 studies, of which 574 met the criteria for inclusion (Fig. 1 and Supplementary Table A1). We extracted 159 outbreak records, 229 models and 1,334 parameters, of which 127 outbreak records, 154 models and 969 parameters belonged to publications that met the quality score ≥ 50% and are included in our main analysis. All extracted data and the full list of publications included in the main text of this review are available in the R package ‘epireview’ (version 1.4.5)20. The full analysis, including all publications, regardless of their quality assessment score, is provided in Supplementary Figs. B25B28). Information about the extracted models is provided in Table B7 and Fig. B4.

Fig. 1: Flow chart for study selection.
figure 1

Studies were screened according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (Supplementary Information). Reasons for exclusion at the abstract or title stage were not collected by the Covidence software.

Outbreaks

Forty studies reported 127 outbreak records between 2006 (Thailand) and 2021 (India) from 29 countries across the Americas (n = 97), Oceania (n = 13), Asia (n = 7), Africa (n = 4) and Europe (n = 2; Fig. 2 and Supplementary Table B8). The largest number of records was reported in Colombia across several subregions (n = 55), followed by French Polynesia (n = 12) and Suriname (n = 11). The largest ZIKV outbreaks were reported in Puerto Rico between 2015 and 2016 (with 39,717 suspected cases, of which 36,390 were laboratory-confirmed) and in Colombia between 2015 and 2017 (with 108,087 suspected cases, 62% of which were among women, and 9,802 of which were laboratory-confirmed).

Fig. 2: ZIKV outbreak and seroprevalence mapping.
figure 2

a, Countries with studies reporting ZIKV outbreak information, coloured by number of studies. Outbreaks reported in France and the Netherlands reflect outbreaks in overseas regions. Countries with black diagonal stripes indicate locations where ZIKV transmission has been reported by the WHO78. b, Geolocated areas or regions with ZIKV seroprevalence studies (using IgG assay, HAI/HI, MIA, NS1 BOB ELISA, IFA, capture ELISA and neutralization assays) conducted in the general population in the Americas (left), Africa (middle) and Asia (right). Each dot represents a location-specific estimate, while shaded areas indicate estimates at the administrative-unit level (region, province, district or entire country). Base maps adapted from GADM79. NA, not applicable.

Parameters

The 969 extracted parameter estimates came from 281 studies, comprising seroprevalence (n = 354), risk factors (n = 242), reproduction numbers (n = 117), CZS proportion (n = 53), epidemiological delays in humans (n = 66), attack rates (n = 52), extrinsic incubation period (n = 22), evolutionary rate (n = 15), substitution rate (n = 3), pregnancy loss proportion (n = 23), case fatality ratio (CFR; n = 6), proportion of symptomatic cases (n = 10), relative contributions to transmission (n = 2) and growth rates (n = 4) (Supplementary Fig. B3).

Seroprevalence

A total of 139 studies reported 354 ZIKV seroprevalence estimates from serosurveys conducted between 1980 (Nigeria21) and 2022 (El Salvador22) in 59 countries across the Americas (n = 143), Africa (n = 97), Asia (n = 62), Europe (n = 36) and Oceania (n = 16). All seroprevalence estimates extracted in this study are reported in Supplementary Table B10. Almost one-third of these estimates were from surveys of the general population (n = 107), while the rest were from pregnant women (n = 58), suspected ZIKV infection (n = 64), children (n = 35), mixed population groups (n = 35), blood donors (n = 10) and other at-risk populations (n = 43) (Supplementary Figs. B9 and B10). The timing of serosurveys coincided with the major epidemics (Supplementary Figs. B9 and B10). The majority of ZIKV seroprevalence estimates were derived from immunoglobulin G (IgG) assays (n = 139) and neutralization tests (n = 109; Supplementary Fig. B11).

Seroprevalence estimates varied widely from 100% in a hospital-based study of 29 individuals in Cúcuta, Colombia23, to 0% in local regions in Austria24, Iran25 and Taiwan26, and specific locations within Brazil27,28, Bolivia29, Kenya30, Indonesia31 and Thailand32. Within Brazil, ZIKV exposure in Salvador (Bahia) ranged between 7% (95% CI: 4–10%) in 2014 and 63% (95% CI: 60–65%) in 2015 (ref. 33), consistent with the timing of the major outbreak in the Americas. In contrast, ZIKV seroprevalence in Rio de Janeiro was reported to be 3% in 2018 (ref. 34). In Asia, a national serosurvey conducted in Thailand during 2017–2020 yielded an average seroprevalence of 2.8%35, but estimates were as high as 23.5% from a serosurvey conducted in 2011–2012 in Nakhon Sawan city36. In Africa, serosurveys conducted from 2013 to 2017 reported ZIKV seroprevalence less than 11.5% in several sites of Kenya30, 42.1% in Gabon37, 10.9% in Cabo Verde38 and ranging from 3.1% to 43.3% in Mali25.

Transmissibility

We extracted estimates of the basic reproduction number R0 (n = 77), human-to-human (sexual) \({R}_{0}^{\rm{h}}\) (n = 14) and vector-borne \({R}_{0}^{\rm{v}}\) (n = 9), effective reproduction number Re (n = 15) and sexual \({R}_{\rm{e}}^{\rm{h}}\) (n = 2) from 40 studies. The methods used to estimate R0 and Re varied across the different studies. Compartmental models were used most frequently (n = 45), and among these, the next-generation matrix method was used for 27 estimates. Branching processes were used for 16 estimates. Most estimates were from the 2013–2014 outbreaks in French Polynesia (n = 24), and the 2015–2016 epidemics in Brazil (n = 26) and Colombia (n = 15).

The majority (63/77) of the central estimates of R0 were between 1.5 and 4 (Fig. 3). Estimates of the other transmission type-specific reproduction numbers are shown in Supplementary Figs. B5 and B7 and Table B12).

Fig. 3: ZIKV basic reproduction number estimates.
figure 3

R0 estimates by geographic location, type of sample and population group. Sub-national estimates for Brazil, Colombia and French Polynesia are shown in separate panels because of the high number of estimates; all other estimates are shown in the respective continent panels. Overall country estimates for Brazil, Colombia and French Polynesia are shown in the corresponding continent panels. Points are central estimates reported in the published studies (defined as mean, median or unspecified central as estimated in the extracted paper), error bars are 95% confidence or credible intervals, and thicker shaded bars are ranges of central estimates over disaggregated groups. The grey vertical dashed line marks 1. When multiple estimates for the same location were available, the estimates were jittered. The sample sizes for each study shown in the figure are reported in Supplementary Table B12.

Epidemiological delays

We analysed 66 human epidemiological delays from 29 studies and 22 extrinsic incubation period estimates from 12 studies (Fig. 4). The 11 central intrinsic incubation period estimates ranged from 4 days (95% CI: 3.15–5.48) in Australes, French Polynesia in 2013–2014 (ref. 39) to 12.1 days in Brazil in 2016 (ref. 40), although roughly half of the reported estimates (n = 6/11) were between 5 and 7 days. The 15 estimates of the human infectious period were highly variable, ranging from central estimates of 3 days in Brazil in 2016 (ref. 40) to 50 days among asymptomatic cases in Florida, United States41 in July to September 2016. We extracted only two estimates from two studies of the serial interval: a mean of 7.4 days (95% CI: 4.59–10.2) in Singapore in August to November 2016 (ref. 42) and of 32.9 days in French Polynesia between October 2013 and October 2016 (ref. 43). The single extracted estimate for the generation time was estimated from data collected in the French West Indies (Guadeloupe, Martinique and Saint-Martin) in 2015–2017 and was described by a mean of 2.5 weeks (s.d. 0.7).

Fig. 4: Epidemiological delays in human and mosquitoes.
figure 4

ae, Estimates by location and type of sample of Zika intrinsic incubation period (a), infectious period (b), serial interval, symptom onset to admission and symptom onset to recovery (c) and admission to discharge or recovery (d). Estimates by location and study of ZIKV extrinsic incubation period in the mosquito (e). Points are the central estimates reported in the studies (defined as mean, median or unspecified central estimates as shown in the extracted paper), error bars are 95% confidence or credible intervals, and shaded bars are ranges of central estimates over disaggregated groups. When multiple estimates for the same location were available, the estimates were jittered. The sample sizes for each study shown in the figure are reported in Supplementary Table B9.

The 22 central estimates of the extrinsic incubation period ranged from 5.1 (ref. 44) to 24.2 days44,45. The central estimates of the time from symptom onset to admission to care ranged from 1 (interquartile range (IQR) 1–2) day among hospitalized patients in Colombia46 to 8 (IQR 4–21) days among travellers47. The central estimates of time from admission to recovery or discharge ranged from 4 (ref. 48) to 23 (IQR 9.5–36)23 days. All epidemiological delays extracted in this study are listed in Supplementary Table B9.

Severity and adverse birth outcomes

We extracted 53 estimates of CZS proportion given a ZIKV-infected mother from 36 studies. The CZS proportion was highly variable across studies (Fig. 5a), 92.5% (49/53) of estimates were from the Americas, and all estimates were obtained from data collected between January 2015 and February 2020.

Fig. 5: Proportion of CZS.
figure 5

a, Estimates of reported CZS probabilities by pregnancy stage of the infected mother of an infant with CZS. ‘Unspecified’ is used when no specific pregnancy stage was reported. Points are central estimates (defined as mean, median or unspecified central values as estimated in the extracted paper), solid lines are confidence or credible intervals, and shaded segments are ranges of central estimates across disaggregated groups. b, Meta-analysis of CZS risk stratified by pregnancy stage of the infected mother of an infant with CZS. ‘Unspecified’ is used when no specific pregnancy stage was reported. We used generalized linear mixed-effect model (GLMM) estimates of CZS proportion. Red squares represent the observed study effect sizes, and the solid black horizontal lines are CIs. Blue diamonds represent pooled common and random-effect estimates by trimester and overall. Common effect estimates assume that all aggregated data come from a single data-generating process with one common CZS proportion, and overall random-effect estimates allow the CZS proportion to vary by study and give different weights to each study in the overall estimate. The ‘Events’ column indicates the reported number of CZS cases. The sample sizes for each study shown in the figure are reported in Supplementary Table B16. Statistical analyses were performed using the chi-squared test for heterogeneity, with two-sided significance. Between-study variance was estimated using tau², and I² indicates the proportion of total variation due to heterogeneity rather than chance. Figures report the P value for each group, rounding to <0.0001 when values fall below this threshold in the meta-analysis.

Among the 53 estimates, 39 estimates from 24 studies were eligible for meta-analysis based on our stringent meta-analysis inclusion criteria (see ‘Data analysis’ in the Methods). The pooled CZS proportion, restricted to non-trimester-specific estimates, was 4.65% (95% CI: 3.38–6.37%) using the random-effects model (I2 = 77.10%; Supplementary Fig. B33). Stratifying the meta-analysis by trimester of maternal ZIKV exposure showed a higher pooled estimate of CZS proportion in the first trimester (6.50%, 95% CI: 1.31–26.51%, I2 = 86.10%) compared to the second (4.96%, 95% CI: 2.91–8.34%, I2 = 57.40%) or third (0.71%, 95% CI: 0.06–8.22%, I2 = 0.00%) trimester (Fig. 5b). Sensitivity analyses of the CZS proportion by continent showed that the estimate for the Americas (4.43%, 95% CI: 3.21–6.07%, I2 = 78.10%) was lower than that from travel-based studies in Europe (16.13%, 95% CI: 6.88–33.37%, I2 = 0.00%); Supplementary Fig. B31). The random-effects estimate for Brazil (5.80%, 95% CI: 3.74–8.91%, I2 = 85.20%) was marginally higher than the overall estimate (Supplementary Fig. B32).

We extracted 23 central estimates of the proportion of pregnancy loss associated with maternal ZIKV infection, which were derived from 17 studies and ranged from 0%49,50,51 to 25%52 (Supplementary Fig. B14). Of these, we included only 14 studies (20 estimates) in the meta-analysis because of our more stringent meta-analysis inclusion criteria (see ‘Data analysis’ in the Methods). The pooled overall estimates of the proportion of pregnancy loss were 2.48% (95% CI: 1.62–3.78%, I2 = 91.10%) using the random-effects model (Supplementary Fig. B33). When we stratified based on the time of pregnancy loss, the pooled estimate of miscarriage proportion (2.04%, 95% CI: 1.21–3.43%, I2 = 64.70%) was marginally lower than the pooled estimate for general pregnancy loss (3.16%, 95% CI: 1.76–5.61%, I2 = 95.40%); Supplementary Fig. B34).

The CFR estimates (Supplementary Fig. B15 and Table B14) extracted from five studies conducted between 2015 and 2017 in the Dominican Republic53, the West Indies54, Brazil55,56 and Colombia57 varied from 0.04% reported in a population-based study of 5,161 individuals in the Dominican Republic in 2016 (ref. 53), to 38% in a small hospital-based cohort of 16 persons under investigation in Brazil in 2016 (ref. 56). The proportion of symptomatic individuals (Supplementary Fig. B16 and Table B14) was reported in three studies, yielding ten estimates ranging from 23.2% in pregnant women in French Guiana58 to 71% (95% CI: 66–76%) in children in the Society Islands (French Polynesia)59. Of the three studies, two had sample size information and thus were included in our meta-analysis, which showed a pooled estimate of the proportion of symptomatic cases of 51.20% (95% CI: 38.00–64.23%, I2 = 96.70%; Supplementary Figs. B35 and B36).

Discussion

In this systematic review, we collated and analysed epidemiological parameters, transmission models and outbreak records of ZIKV to inform future modelling in response to ZIKV outbreaks. One of the clearest findings of this review is the concentration of Zika-related publications within a narrow time window—from the 2015 outbreak in Brazil to the onset of the coronavirus disease 2019 pandemic in 2020. The 2015–2016 ZIKV epidemic in Latin America marked a turning point in ZIKV research: before this, only 6 of the 574 studies included in this review were published, none of which involved modelling. With the emergence of ZIKV as a global health concern, research activity surged, particularly following the outbreak in Brazil in 2015 and the subsequent PHEIC declaration, resulting in a sharp increase in publications (often with a quality assessment score < 50%; Supplementary Fig. B2). Since 2020, however, the volume of new studies has declined despite continued ZIKV circulation, likely due to reduced data availability and shifts in funding priorities after the end of the PHEIC.

Among the 354 estimates characterizing ZIKV exposure through seroprevalence included in this review, there is a marked heterogeneity in geographic representation, with the Americas accounting for the majority of the published seroprevalence estimates. While these statistics reflect the attention that ZIKV received during the 2015–2016 epidemic in the Americas, they also underscore gaps in our understanding of the population-level immunity of populations outside these regions, which is critical to evaluate global outbreak risks now and in the future, and to design clinical trials for future vaccine and therapeutics. Seroprevalence estimates should also be interpreted with caution, as variations in assay sensitivity, specificity and cross-reactivity—especially with other flaviviruses like dengue—could bias the results. Our analysis has demonstrated that ZIKV is moderately to highly transmissible (R₀ typically 1.5–4, up to >7), with both vector-borne and sexual transmission contributing to infection. In combination with a pooled CZS proportion of 4.65% and a pooled pregnancy loss proportion of 2.48%, continued monitoring is essential to track outbreak potential and protect vulnerable populations. Moreover, our meta-analysis on the proportion of ZIKV-infected symptomatic cases suggests that up to half of ZIKV infections may go undetected in surveillance systems that rely on symptomatic reporting (51.20%, 95% CI: 38.00–64.23%, I2 = 96.70%).

Estimates of CZS proportion vary widely due to differences in study design (that is, recruitment based on confirmed infection versus retrospective serology), population characteristics, case definitions and methods used for the identification of ZIKV infections during pregnancy. Several studies from which we extracted CZS proportion were not specifically designed to assess the CZS proportion in recently infected mothers (that is, studies tested for past exposure by IgG assays rather than recent infections through IgM detection). Our meta-analysis produced an overall estimate of 4.65% (95% CI: 3.38–6.37%, I2 = 77.10%), which is slightly higher than other published estimates60,61,62,63, although the CIs overlap. We also found only one published review that conducted a trimester-specific meta-analysis of the proportion of CZS64 including only two studies65,66. Our meta-analysis of CZS proportion by trimester, based on seven studies, yielded higher estimates than those published in Gallo et al.64, with consistently higher estimates in earlier trimesters.

Although we extracted 969 parameters from the literature, it was not possible to perform meta-analyses across most parameters (other than CZS, pregnancy loss probabilities and proportion of symptomatic cases) due to low numbers of comparable estimates, unavailability of sample size information and substantial heterogeneity in study contexts and methodologies. For example, only one estimate of the generation time and two estimates of the serial interval have been published. We also noted substantial variation in the terminology and methods used for estimation in studies reporting attack rates and reproduction number estimates, where basic and effective reproduction numbers were occasionally defined interchangeably, and in studies reporting genomic information, where mutation rates (per generation) and evolutionary or substitution rates (per year) were often conflated.

This study has limitations. We included only peer-reviewed publications in English, which may have led to the exclusion of relevant studies published in other languages and in grey literature or outbreak reports. Studies reporting outbreak dynamics or seroprevalence estimates that did not meet the formal inclusion criteria were excluded67,68,69,70,71; hence, the outbreak data reported in this systematic review should not be considered a complete compendium, as several sources of ZIKV outbreaks exist outside peer-reviewed publications, specifically PAHO72 and national Ministries of Health. Moreover, while we attempted to extract disaggregated parameter estimates where available, for feasibility purposes, we prioritized summarizing data over fully disaggregated information (for example, by age or sex; Supplementary Information). All information is, however, included in our open-source database (also in Supplementary Tables B7B16) that is accessible via the R package ‘epireview’ to support future reuse. Finally, data extraction was conducted by a team of 16 reviewers using detailed, standardized guidelines73. Although double extractions were conducted at early stages, a residual risk of inconsistency and human error remains.

Despite these limitations, this study has several strengths as it summarizes the information published in 418 studies on 984 epidemiological parameters, 154 models and 127 outbreak records, which provides the most thorough quantitative overview of ZIKV epidemiology and disease risk available to date. As well as documenting key epidemiological parameters needed for outbreak response and preparedness planning, the richness of the data collated in this systematic review allows us to provide global, regional and country-level estimates of the proportion of adverse effects of ZIKV, including CZS, pregnancy loss and symptomatic cases.

In summary, this study presents a dynamic, open-source resource designed to support continuous updates by the research community to provide timely quantitative information on ZIKV epidemiology. By enabling the development of data-driven and evidence-based analytical, modelling and computational tools, this resource will aid mathematical modellers, public health officials and policymakers in outbreak prevention, preparedness, response and disease control planning.

Methods

We followed the PRISMA guidelines and registered our study protocol with PROSPERO (International Prospective Register of Systematic Reviews, CRD42023393345). The checklist is included in the Supplementary Information.

Search and screening

We searched PubMed and Web of Science for studies published from database inception to 31 October 2024. Results were imported into Covidence74 and de-duplicated. Titles, abstracts and then full texts were independently screened by two reviewers, and conflicts were resolved by consensus. The Cohen’s Kappa scores for screening and full-text review are reported in Supplementary Fig. A1. Non-peer-reviewed literature and non-English language studies were excluded. We also excluded papers with co-infection studies, comments, letters, case reports, case studies and those that reported estimates from other papers. Systematic reviews (n = 66) were excluded from extraction (Supplementary Information Section A), but peer-reviewed literature identified through backwards citation screening meeting the selection criteria was included. Further details on inclusion and exclusion criteria are provided in the Supplementary Information.

Data extraction

Of the full texts meeting the inclusion criteria, 16% (n = 91) were randomly selected for double extraction to ensure concordance between a team of 16 extractors. After consensus was reached, reviewers independently conducted single extractions on the remaining full texts.

Data were extracted using a custom-made Microsoft Access database (version 2305; Supplementary Tables A4A6). We collected information on publication details and three categories: transmission models, outbreaks and epidemiological parameters. In the transmission model category, we extracted information on the mathematical or statistical models of transmission used in the included papers. In the outbreak category, we reported outbreak data only when it was newly observed, rather than derived from another publication. Finally, the epidemiological parameters included were basic and effective reproduction numbers, epidemiological delays (the intrinsic incubation period, the extrinsic incubation period, the time between symptom onset to hospitalization or from symptom onset to outcome), CFRs, attack rates, growth rates, evolutionary, mutation and substitution rates, the proportion of symptomatic cases, overdispersion (quantifying heterogeneity in transmission), seroprevalence, proportion of CZS/microcephaly in newborns born to infected mothers, proportion of pregnancy loss in infected pregnant women, relative contributions to transmission from different routes and risk factors related to Zika or CZS. Pregnancy loss proportion was extracted as either the proportion of miscarriage if the study explicitly reported this, or general pregnancy loss, which included both the sum of miscarriage and stillbirth and pregnancy loss with no specified information on the trimester of loss.

We extracted estimates of the basic and effective reproduction numbers (\({R}_{0}\) and \({R}_{e}\), representing the average number of secondary infections generated by a case in a fully susceptible and partially susceptible population, respectively), the basic vector-borne reproduction number (\({R}_{0}^{\rm{v}}\), representing the average number of new infections generated via vector-borne transmission in a fully susceptible population) and the basic and effective sexual reproduction numbers (\({R}_{0}^{\rm{h}}\) and \({R}_{e}^{\rm{h}}\), representing the average number of new cases generated via sexual transmission in a fully susceptible and partially susceptible population, respectively).

We used a custom quality assessment tool to assess the quality of each study (Supplementary Table A2). Full details on the data extraction process and extracted data are provided in the Supplementary Information.

Data analysis

The analysis was conducted in R software using the ‘orderly2’ workflow package (version 1.99.14)75. The quality score of each study was calculated as the proportion of ‘yes’ answers to the total number of applicable questions in the quality assessment tool for that study. Only parameters with a quality score ≥ 50% were included in the analyses presented in the main text. Sensitivity analyses including all studies, regardless of the quality assessment score, are presented in Supplementary Information Section B4.

We conducted meta-analyses for the proportion of symptomatic cases, proportion of CZS and proportion of pregnancy loss among confirmed ZIKV-infected mothers. For the latter two, we applied more meta-analysis inclusion criteria by only including estimates with at least ten pregnant women with confirmed ZIKV infection and with a study design that did not select for the outcome (CZS or miscarriage), using the ‘meta’ R package (version 8.2-1)76. We used random-effect and fixed-effect models using the logit link function to calculate pooled estimates for both parameters, with 95% CIs and I2 heterogeneity estimates. We report the random-effects estimates in the paper, and the fixed-effect estimate is reported in meta-analysis plots (Fig. 5 and Supplementary Information Section B5). We estimated the pooled CZS and pregnancy loss probabilities by country, continent and sample population type using maximum likelihood to estimate the between-studies variance. Subgroup analyses were considered to assess differences in CZS and pregnancy loss probabilities. Funnel plots assessing publication bias are shown in Supplementary Figs. B29 and B30.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.