Abstract
Background
The spread of SARS-CoV-2 has been influenced by multiple factors, from the inherent transmission capabilities of the different variants to the control measurements put in place. Understanding how new variants enter a country is essential for managing future outbreaks. This study investigates how three major variants—Alpha, Delta, and Omicron (BA.1)—entered Spain and how different restrictions potentially affected their introduction.
Methods
We collected Spanish and international SARS-CoV-2 genomes from the GISAID database. Leveraging connectivity data from different countries with Spain, we performed a phylodynamic Bayesian analysis of the SARS-CoV-2 introductions into Spain.
Results
Most introductions of the Alpha variant originated from France. As travel restrictions eased, the number of introductions from different countries increased. During the Delta and Omicron waves, the United Kingdom and Germany became important sources of the virus. The highest number of introductions occurred during the Delta wave, coinciding with fewer travel restrictions and the summer season, when Spain receives a considerable number of tourists.
Conclusions
Our findings highlight the role of international travel in the spread of new variants. They underscore the importance of monitoring travel patterns and implementing targeted public health measures to manage the spread of SARS-CoV-2.
Plain language summary
The spread of SARS-CoV-2, the causal agent of COVID-19, has been influenced by various factors. In this study, we examined how three major variants (Alpha, Delta, and Omicron-BA.1) entered Spain and how different restrictions potentially affected their arrival. We combined SARS-CoV-2 genetic data with travel information to trace how the different variants were introduced into Spain. We found that most Alpha cases came from France. As travel restrictions eased, more cases arrived from the United Kingdom and Germany. The highest number of introductions occurred during the Delta wave, when travel was least restricted and tourism was high. These findings suggest that international travel plays a key role in the spread of SARS-CoV-2. Monitoring travel patterns and adjusting public health measures should help control future outbreaks.
Similar content being viewed by others
Introduction
The COVID-19 pandemic caused by SARS-CoV-2 resulted in governments worldwide imposing control measures of different severity. The extent of these restrictions varied with time and country and, in some cases, by region1,2,3. Such was the case in Spain, with its decentralized healthcare system that allowed each of its 17 regions or autonomous communities to impose their own limitations. Thus, depending on the period and region, the COVID-19 measures in Spain were more or less restrictive4. Apart from those regional measures, the Spanish central government also imposed several nationwide restrictions, such as a general lockdown and mandatory mask-wearing, and coordinated the vaccination campaigns in the different regions. SARS-CoV-2 sequencing efforts in Spain (Fig. 1a) also varied with time and region, leading to regional differences in the number of sequences available for each period.
a Number of sequences from Spain available in the GISAID database until June 11, 2022. Bar colors represent different lineages or variants, as explained in the legend. Shaded areas represent the periods studied for each wave. b Number of diagnosed cases (left-axis; black line) and deaths (right-axis; red line) of COVID-19 in Spain. Data taken from OurWorldInData50. c Spain’s Oxford Stringency Index51, representing how strict the measures were in Spain at any given moment (the index represents the response level of the strictest region within Spain). Data taken from OurWorldInData (https://ourworldindata.org/) which adapted the original data collected by the Oxford Covid-19 Government Response Tracker (OxCGRT). Shaded areas represent periods in which masks were mandatory outside.
From the start of the pandemic up to the beginning of 2022, Spain suffered six epidemic waves of COVID-19 (Fig. 1b). The first wave, which started around March 2020, following the first introductions of SARS-CoV-2 into the country, resulted in a national lockdown imposed to limit viral transmission in order to avoid overwhelming the healthcare system. Despite not being the most significant wave in terms of the number of cases, it was the largest in terms of deaths. Several factors contributed to these statistics, such as the novelty of the disease, for which there was no vaccine or previous immunity, an aging population, and the virus’ early entrance into nursing homes.
After the lifting of the lockdown on June 21, 2020, tied to the end of the first epidemic wave, the loosened restrictions (Fig. 1c) favored the start of a new wave of infections towards the end of the summer (Fig. 1b). This second wave was associated with the spread of the B.1.177 lineage from Spain and across Europe5,6 and lasted until the end of 2020. Around this time, the Alpha variant of concern (VOC) was introduced in Spain, spreading with the rest of the preexisting variants and dominating the third wave. Despite the containment measures, increased social connections during the Christmas holidays likely sustained this spread, which resulted in the highest peak of cases until that point7. The pre-existing immunity and the stringent containment measures put in place, probably aided in limiting the extension of the fourth wave towards April 2021, also dominated by Alpha (Fig. 1a, b). While the vaccination campaign started around the same time as the first Alpha wave, it is not clear whether it was sufficiently advanced to have an effect on the posterior spread of Alpha. During the two Alpha epidemic waves, there were also some anecdotal detections of other VOCs (namely Beta and Gamma) or variants of interest (VOIs; e.g., Eta and Zeta)7.
The fifth epidemic wave, linked to the introduction of the Delta VOC, occurred during the summer of 2021. It was potentially facilitated by the lifting of certain control measures, such as the outdoor mask-wearing mandates8, and resulted in a new peak in the number of cases, although lower than in the third wave. Subsequently, the end of 2021 encompassed the last moments of Delta and the start of the sixth wave, characterized by the introduction of a diversity of Omicron VOC lineages, starting with BA.1 and its sublineages (Fig. 1a). This wave presented the highest number of detected cases of the pandemic in Spain and a notable number of deaths, although much less than in the first three epidemic waves.
Here, we focus on the study of waves three to six of the pandemic in Spain, considering the predominant VOCs during those periods: Alpha, Delta, and Omicron-BA.1. The first year of the pandemic in Spain has already been studied4,5,7,9, but the origin and importance of virus importations after 2020 have yet to be described. Hence, we here shed light on the events of the following waves (until the start of Spring 2022) that we subsequently discuss in the context of the control measures in place during those periods.
France is the country that shows the highest number of introductions into Spain during the pandemic. Remarkably, the summer season, together with lower levels of restrictions, is the period that leads to the highest number of total introductions, with the origins of the virus diversifying as the epidemic waves progress and control measures are lifted.
Methods
Number of sequences sampled per country
To study the history of Alpha, Delta, and Omicron-BA.1 introductions in Spain, we selected worldwide SARS-CoV-2 whole-genome sequences from GISAID based on the prior probability of their sampling location resulting in introductions to Spain. To calculate this probability, we used aggregated Facebook mobility data10 from January 2021 to March 2022 (Supplementary Fig. 1), which reflects the number of Facebook users transiting from one country to another.
First, we computed the connectivity from all available countries in the Facebook data to Spain for each of the three studied periods (i.e., Alpha, Delta, and Omicron-BA.1) and ranked them from most to least connected to Spain. Then, we selected those most-connected countries that encompassed 95% of the total mobility to Spain. For each country, we calculated the number of sequences proportionately to the cumulative number of cases times the fraction of connectivity to Spain. We set an arbitrary threshold of 6.5 sequences per 10,000 cases, with a minimum number of sequences per country of 100 (see Truong Nguyen et al.)11. For Omicron-BA.1, we set a maximum number of 500 sequences for each location for the analysis to remain computationally feasible, as the peak of cases in most countries increased the number of selected sequences.
We also included Delta sequences from India to account for the geographical origin of the Delta VOC12. We already had selected a number of Alpha sequences from the United Kingdom (UK)13 for our dataset, and for Omicron-BA.1, although the first cases were detected in South Africa, there is no clear country of origin14. We could not obtain mobility data from India to Spain, so to compute the number of Indian sequences to be sampled, we used public information about the number of flights between Spain and India during the Delta period15.
In addition, we selected 1000 sequences from multiple Spanish regions, with the number of sequences per region chosen proportionately to the number of detected positive cases. As in previous studies11, we set these thresholds arbitrarily to obtain a manageable dataset that could be studied under a Bayesian framework within a reasonable computational time.
Subsampling of SARS-CoV-2 sequences from GISAID
We downloaded the metadata for all the Alpha, Delta and Omicron-BA.1 sequences available in GISAID16 that met the following GISAID criteria: were “complete” (i.e., >29,000 nt); came from a human host; were associated with a complete collection date (year-month-day); and, for Alpha and Delta, were “high coverage” (<1% of missing bases and <0.05% of unique amino acid mutations). For Omicron-BA.1, we only excluded those with “low coverage” (>5% missing bases). However, we did not enforce the “high coverage” filter due to the substantial number of sequences that would have been discarded. We downloaded the Alpha and Omicron-BA.1 data on 2022-06-11 and the Delta data on 2022-04-26. The sampled periods were between weeks 52 of 2020 and 32 of 2021 (2020-12-20 to 2021-08-14) for Alpha, between weeks 25 and 50 of 2021 (2021-06-20 to 2021-12-18) for Delta, and between weeks 49 of 2021 and 13 of 2022 (2021-12-05 to 2022-04-03) for Omicron-BA.1.
Based on the metadata and the number of sequences determined from each country or Spanish region, we downloaded an even number of weekly sequences from GISAID. We only considered sequences that passed Nextclade quality tests and presented a “good” overall status, and which were assigned to the corresponding Nextclade clade and PANGO lineage (clade 20I/lineage B.1.1.7 for Alpha, clades 21A, 21I & 21J/lineage B.1.617.2 and sublineages (AY) for Delta, and clade 21K, lineage BA.1 and sublineages for Omicron). For Omicron, we limited the studied lineages to BA.1 and descendants because their pandemic wave was still ongoing when we initiated this study.
We used seqkit v2.1.017 to remove identical sequences, aligned the sequences with Nextalign v.1.11.018, and estimated a maximum likelihood phylogenetic tree with IQ-TREE v.2.1.319, using 2,000 ultrafast bootstraps20 under the best-fit substitution model selected by ModelFinder21. We then used TreeTime v.0.8.122 with the option “clock” to detect and subsequently remove “temporal outliers”, which involves sequences with divergences that are unexpected given their sampling time. We repeated this subsampling when possible until all sequences passed the filters. The final number of analyzed sequences was 3024, 3390, and 4271 for Alpha, Delta, and Omicron-BA.1, respectively.
Bayesian analysis of introductions
We inferred viral introductions in Spain using BEAST v1.10.5 (prerelease #23570d1)23. For these analyses, we followed the same strategy as Gallego-García et al.24 and employed a phylogeographic generalized linear model (GLM) in which we incorporated the country of origin of the sequences as a discrete trait and the aggregated connectivity matrix between those countries as a covariate6,25. The connectivity matrix was computed based on the Facebook international mobility data previously described and was log-transformed and standardized before being included in the analysis.
We performed Markov chain Monte Carlo (MCMC) analyses in BEAST using a timed tree estimated by TreeTime (with options --max-iter 10 and --reroot least-squares) as the starting tree, HKY as the substitution model26 with empirical base frequencies and rate variation among sites following a discretized gamma distribution with four rate categories27. We fixed the evolutionary rate at 7.5 × 10-4 nucleotide substitutions/site/year28 and used a Bayesian non-parametric coalescent (skygrid) model as tree prior29. Missing covariate values in the connectivity matrices were integrated out using efficient gradient approximations30 to the trait data likelihood31. All the BEAST analyses were run in the Galician Supercomputing Center (CESGA) in combination with the BEAGLE (v.4)32 high-performance computational library to accelerate the computations.
We simulated a minimum of seven independent MCMC chains per dataset, discarded at least 10 percent of each chain as burn-in, and combined the results with LogCombiner23. We ran the chains until all continuous parameters presented effective sample sizes (ESSs) > 200 for Alpha and ESSs > 100 for Delta and Omicron-BA.1. After combining the chains, they totaled 638 M generations for Alpha, 794 M for Delta, and 1388 M for Omicron-BA.1. The transition histories and their timings were summarized from the posterior tree distributions using the tool TreeMarkovJumpHistoryAnalyzer provided within the BEAST codebase6,33.
Statistics and reproducibility
All Bayesian analyses were computed using BEAST v1.10.5 (prerelease #23570d1), as described in the previous section. All downstream analyses and plotting were performed using the R software (v. 4.1.3)34. The 95% highest posterior density (HPD) intervals represented in plots were computed using the HDInterval package35.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Results
Alpha
The analyses revealed that during the Alpha wave period, there were at least 99 independent introduction events (95% highest posterior density [HPD]: 83–113) from other countries to Spain (Fig. 2a, b). By far the most introductions to Spain during the Alpha period came from France (83, 95% HPD: 70–97). However, most introductions produced fewer than ten sampled descendants (84, 95% HPD: 68–96). Of those, around half were singletons, meaning there was only one sampled descendant (48, 95% HPD: 39–56), and thus most introductions did not produce substantial local transmission. On the other hand, 15 introductions (95% HPD: 11–19) resulted in a sizeable number of descendants (i.e. >10 sampled tip nodes) and these introductions originated almost exclusively in France.
Circular migration flow plots based on the Markov jumps between different countries and Spain (a: Alpha; c: Delta; e: Omicron-BA.1). Mean and number of transitions from those countries to Spain, computed from all the trees from the posterior distribution. The error bars represent the 95% highest posterior density (HPD). The width of the violin plots represents the density of observations, derived from 638 M, 794 M, and 1388 M generations for Alpha, Delta, and Omicron-BA.1, respectively (b: Alpha; d: Delta; f: Omicron-BA.1). Codes ISO 3166-2 AD, IN, UK, and US correspond with Andorra, India, the United Kingdom and the United States of America, respectively.
Delta
In comparison with Alpha, during the Delta period there was an increase in the number of introductions and the number of originating countries (Fig. 2c, d). We detected 202 (95% HPD: 186-221) introductions, with France again being the most important source of viral introductions (117, 95% HPD: 99–133). The number of introductions from the UK increased considerably (38, 95% HPD: 31–44), followed by Germany (14, 95% HPD: 7–20) and the Netherlands (11, 95% HPD: 5–15). There were also introductions from the USA, Italy, Portugal, and Belgium. Again, most introductions did not lead to substantial transmission clusters (187 tip nodes, 95% HPD: 170–205), with most being singletons (127, 95% HPD: 112–142). However, we did not detect introductions from India to Spain (or vice versa), despite it being the geographical origin of the Delta variant (Fig. 2d).
Omicron-BA.1
In the case of the first wave of Omicron (BA.1 and its sublineages) (Fig. 2e, f), there was a decrease in the number of identified introduction events (114, 95% HPD: 100–128) compared to the Delta period, although in both periods many introductions were small (104, 95% HPD: 90–118), primarily singletons (76, 95% HPD: 62–88). France was again the primary source of introductions, with around 27 (95% HPD: 18–35) estimated introductions, but the number of detected introductions from the UK and Germany was quite similar, with 24 (95% HPD: 17–31) and 22 (95% HPD: 14–28) respectively. Italy and the USA were also important sources of Omicron-BA.1 introductions to Spain. In this case, we also detected at least five introductions (95% HPD: 2–9) of SARS-CoV-2 from Portugal to Spain.
Discussion
The introductions of SARS-CoV-2 to Spain for the Alpha, Delta, and Omicron-BA.1 VOCs (epidemic waves three to six) likely reflects the changing landscape of COVID-19 restrictions and containment measures (Fig. 1c), as well as the mobility peaks linked to vacation periods, notably around August (Supplementary Fig. 1).
Contrary to the previous SARS-CoV-2 waves, most of the Alpha, Delta, and Omicron-BA.1 introductions came from France. For example, for the first wave, most of the detected introductions came from Italy, the Netherlands, England, and Austria9. This large contribution from France could have been shaped by the combination of a high number of cases and a high connectivity to Spain, making it the country with the largest number of representative sequences in this study, after Spain. However, we also have to take into account that, together with Portugal, which is much smaller and is connected primarily by terrestrial ways to Spain, France was the primary way of entrance to Spain from the rest of Europe, especially in a context where air connections were much more restricted. Peninsular Spain also shares borders with Andorra and the UK (via Gibraltar), but their much smaller population and territory renders their importance at a national level negligible.
This predominance of French introductions was especially noticeable during the Alpha period (third and fourth waves), when, despite the movement restrictions put in place by Spain and France, almost all of the introductions into Spain (>80%) came from France. At the same time, we detected markedly fewer introductions from other countries with high connectivity with Spain. Towards the Alpha period, Spain and France were under stringent containment measures, including curfews and perimeter closures that restricted mobility around their territories. Apart from this, movements across the border were restricted to essential ones, such as cross-border workers and freight transports, or required the presentation of a negative PCR test (or a vaccination certificate, when these became more widely available). Despite these aspects and the recommendations to reduce traveling, the Spanish-French border was not outright closed, so there could still be some mobility, allowing a steady exchange of travelers and, therefore, of viruses. The influx of viruses from France was especially prominent towards the beginning of the Alpha period and declined as time passed, without peaks that could be linked to holiday periods such as the Holy Week. On the other hand, the border with Portugal was closed from the end of January to the beginning of May, except for exceptional reasons, which could help explain why we did not detect almost any introductions from Portugal to Spain. We must also consider that Portugal is much less populated than Spain and France, which also relates to fewer infected people, and its geographical location, not bordering any other country. All these factors combined could help to explain Portugal’s limited role as a source of introductions, at least towards Spain.
Additionally, in the case of the UK, there were several extra restrictions from both sides to travel between Spain and the UK due to the latter being the origin of the Alpha VOC13,36. While these measures aimed to limit SARS-CoV-2 transmission and the number of introductions, they differed for both countries. While Spain banned UK travelers from entering the country unless they were Spanish nationals or residents, the UK imposed a self-quarantine period upon return, in both cases combined with other restrictions in the national territory. According to our analyses, these measures could have helped limit direct introductions from the UK to Spain.
The peak of Alpha introductions took place towards the beginning of the study period, while during the last weeks, i.e., in summer, as mobility increased (Supplementary Fig. 1A) and restrictions eased, it was already being replaced by Delta. Meanwhile, the sharp increase in introductions to Spain during the Delta wave could be linked to the fact that it started during the summer. This period was characterized by increased international tourism (Supplementary Fig. 1B) and higher national mobility (Supplementary Fig. 2), while the level of restrictions was the lowest until that point. Lifting the outdoor mask mandates and appropriate social distancing, combined with the relaxation derived from the vaccination rollout, may have helped to propel this peak in introductions as more people decided to travel. Also, at the end of September, towards the middle of the Delta period, with around 80% of the Spanish population already vaccinated (Supplementary Fig. 3), restrictions were further relaxed, and several regions entered the so-called “new normality” without specific restrictions.
The increase in tourism and lowered restrictions is likely reflected in the fact that, although France still ranked first in the number of introductions, it only represented around 58% of all introductions, a sharp decrease compared to the Alpha period. On the other hand, lifting the travel restrictions from and to the UK could explain its second position as a virus source. The sources of introductions during this period were much more diverse than during the Alpha wave, which could also be related to the lower level of restrictions across Europe. The increased transmissibility of Delta37,38, as well as the higher risk of infection by this variant after vaccination39, could also help explain the rapid variant replacement. Despite this, the peak in cases was slightly lower for Delta, the number of deaths was dramatically reduced, and an even higher percentage of the introductions were non-expansive, which means that pre-existing immunity in the population (by previous infections or vaccination), as well as some of the measures in place, may have played a role in restricting Delta spread.
Lastly, the Omicron-BA.1 period or sixth wave, which was associated with lower national mobility than for the Delta period, albeit higher than for Alpha (Supplementary Fig. 2) and not linked either with a significant peak in international tourism (Supplementary Fig. 1C), showed fewer introductions than the Delta wave. Nonetheless, with the lowest stringency of measures in place, despite the mandatory use of the mask outdoors again, the number of detected introductions was still higher than for Alpha. The fact that most of the introductions were non-expansive appears to contrast with the number of cases of this wave, which was the highest of the whole pandemic in Spain. This higher number of cases, on the other hand, might be explained by the increased transmissibility of Omicron-BA.1 compared to the previous VOCs40, to a reduced sense of risk from the population, when everyone who wanted had already been vaccinated, and to the so-called “pandemic fatigue”41,42.
The reasons for the limited number of Omicron-BA.1 introductions in Spain could be multiple. One possible explanation is that the Omicron period, taking only BA.1 into account, is much shorter than the Alpha and Delta period (35, 26, and 16 weeks for Alpha, Delta, and Omicron-BA.1, respectively). Additionally, the proportion of sequenced cases respective to the total cases was the lowest of the three waves (Supplementary Fig. 4), as the sequencing reached peaks similar to Delta while the number of cases skyrocketed. This low proportion of sequenced cases could potentially limit the detection of introductions in the country, as many transmission chains could be undetected or underrepresented.
The sources of Omicron-BA.1 introductions were considerably more diverse than for previous VOCs. Although still in first place, France was only accountable for 24% of the introductions, and a very similar number of introductions was detected from the UK and Germany, in line with a lower level of preventive measures and travel limitations. We should note that even though the connectivity from France to Spain was much higher than that of the rest of the countries, the same number of Omicron-BA.1 sequences from France, Germany, Italy, and the UK were present in the study, as they reached the limit of 500 sequences per country of origin (Supplementary Table 1). The much larger number of initially selected sequences for this variant is related to the peak of cases that Omicron caused in the connected countries, as well as in Spain, and limiting those numbers was necessary to maintain the analysis computationally feasible. Even so, we expect the number of sequences from each country in the study to have a limited influence over the number of introductions as, for example, for Delta we included more UK sequences than French ones. Still, we detected three times as many introductions from France as from the UK. We should then consider that the proportion of German and British travelers that caused an introduction of SARS-CoV-2 in Spain had to be substantially higher and could be related to the different anti-COVID-19 measures put in place by each region, as the regions that receive most of those travelers are not the same. In the case of France, the mobility was higher towards Catalonia (Supplementary Fig. 5A), in the northeast of Spain and bordering France, which imposed strict control measures during the Christmas period and January, including nocturnal curfews, the closure of nightlife and the mandatory presentation of the COVID-19 certificate. On the other hand, the highest mobility from Germany and the UK occurred towards the Canary Islands (Supplementary Fig. 5B, C), which only required presenting the COVID-19 certificate. These differences in mobility trends, coupled with the asymmetric containment measures within Spain, could explain the lower proportion of French introductions compared to the previous waves and possibly highlight the importance and impact of those regional measures.
One trend common to the three VOCs studied was that most introductions produced only a handful of descendants. This could be due to the various prevention and containment measures, which greatly limited the interaction between people without masks for long periods, possibly reducing the odds of a massive transmission event. Apart from this, quarantines after testing positive were mandatory at the beginning and then at least recommended, as well as maintaining social distancing and mask-wearing, which could also help deter local transmission. This situation is also typical of SARS-CoV-2 transmission, as there is significant variability in the number of descendants produced, which means most of the introductions into a territory tend to make a low number of descendants, while some others are responsible for most of the cases43,44,45. It is also important to highlight that most of the detected introductions happened in the first stages of the studied periods or even a bit before them, so these variants may have already been circulating in the country before they were detected.
The number of introductions into Spain per variant also highlights the different impacts of external introductions during the pandemic, depending on the country and the specific VOC. In the case of Delta, having detected 202 individual introductions after using 1000 Spanish sequences, it nearly doubles the capacity of Alpha and Omicron-BA.1 of establishing local transmission chains and is noticeably higher than the one observed in the UK for the same variant (1458 detected introductions for 52,992 sampled sequences46). Conversely, the contribution of the Omicron-BA.1 introductions was only slightly higher relative to the UK (6455 detected introductions for 81,039 sampled sequences44) but lower than in Mexico (160 detected introductions for 641 sequences47).
Lastly, we acknowledge that the mobility data we used present limitations as they depend on the number of users sharing their location with the application and, therefore, may only be representative of some of the population. Nonetheless, this type of data has already been used to study SARS-CoV-2 transmission linked to human mobility6,11,48, and can still be considered a good approximation that should reduce the sampling bias in our analysis. Indeed, other types of sampling bias still exist, such as the difference in sequencing effort and strategy between countries and between Spanish regions, which can be substantial (Supplementary Fig. 6). Despite this, in most cases, we believe the number of sequences available relative to the number of sequences used was large enough to ensure an adequate distribution of sampling dates and locations, mitigating this source of bias.
Another caveat is that it is not easy to back up some of the conclusions about the impact of the control measures on the number of introductions and their local spread using more quantitative analyses. This is because control policies were not always applied consistently or in the same period across Spanish territories and cities. Besides, the time series of the introductions is very sparse, diminishing statistical power. All in all, our analysis of the SARS-CoV-2 introductions in Spain suggests that the control measures might have reduced further importations of the virus into the country. Moreover, given that the number of cases does not correlate with the number of introductions and the relatively small number of descendants detected for most introductions, it seems that foreign importations had a limited impact on the progression of the pandemic in Spain.
Data availability
All the SARS-CoV-2 genomic sequences and associated metadata used in this study are available in the GISAID’s EpiCoV database (https://gisaid.org/) (EPI_SET_ID: EPI_SET_240610da). Source data for the main figures can be found in Supplementary Data 1. All other data are available from the corresponding author upon request.
Code availability
The following software was used to perform the analysis. For the sequences quality control and lineage assignment: Nextclade (v.2.3.1), Pangolin (v.4.1.3). For the nucleotide alignment: Nextalign (v.1.11.0). For phylogenetic analyses: IQ-TREE (v.2.1.3), TreeTime (v.0.8.1), BEAST (v.1.10.5, prerelease #23570d1), BEAGLE library (v.4). And for general data processing R version 4.1.3 (2022-03-10), using standard libraries. Custom code used for this paper has been made available through Zenodo49 to ensure long-term accessibility.
References
European Union Agency for Fundamental Rights. Coronavirus Pandemic in the EU—Fundamental Rights Implications. Bulletin #1, 1 February–20 March 2020. https://data.europa.eu/doi/10.2811/009602 (2020).
Bergquist, S., Otten, T. & Sarich, N. COVID-19 pandemic in the United States. Health Policy Technol.9, 623 (2020).
Baniasad, M., Golrokh Mofrad, M., Bahmanabadi, B. & Jamshidi, S. COVID-19 in Asia: Transmission factors, re-opening policies, and vaccination simulation. Environ. Res. 202, 111657 (2021).
García-García, D. et al. Assessing the effect of non-pharmaceutical interventions on COVID-19 transmission in Spain, 30 August 2020 to 31 January 2021. Eurosurveillance 27, 2100869 (2022).
Hodcroft, E. B. et al. Spread of a SARS-CoV-2 variant through Europe in the summer of 2020. Nature 595, 707–712 (2021).
Lemey, P. et al. Untangling introductions and persistence in COVID-19 resurgence in Europe. Nature 595, 713–717 (2021).
Iftimie, S. et al. Characteristics of hospitalized patients with SARS-CoV-2 infection during successive waves of the COVID-19 pandemic in a reference hospital in Spain. Sci. Rep. 12, 1–8 (2022).
Real Decreto-ley 13/2021. 76290–76294 https://www.boe.es/eli/es/rdl/2021/06/24/13 (2021).
López, M. G. et al. The first wave of the COVID-19 epidemic in Spain was associated with early introductions and fast spread of a dominating genetic variant. Nat. Genet. 53, 1405–1414 (2021).
Maas, P. Facebook Disaster Maps: Aggregate Insights for Crisis Response & Recovery. In: Proc. 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD ’19). https://doi.org/10.1145/3292500.3340412 (Association for Computing Machinery, 2019).
Truong Nguyen, P. et al. The phylodynamics of SARS-CoV-2 during 2020 in Finland. Commun. Med. 2, 65 (2022).
Vaidyanathan, G. Coronavirus variants are spreading in India - what scientists know so far. Nature 593, 321–322 (2021).
Kraemer, M. U. G. et al. Spatiotemporal invasion dynamics of SARS-CoV-2 lineage B.1.1.7 emergence. Science 373, 889–895 (2021).
WHO. Updated Working Definitions and Primary Actions for SARS-CoV-2 Variants. https://www.who.int/publications/m/item/historical-working-definitions-and-primary-actions-for-sars-cov-2-variants (2023).
Strohmeier, M., Olive, X., Lübbe, J., Schäfer, M. & Lenders, V. Crowdsourced air traffic data from the OpenSky Network 2019–2020. Earth Syst. Sci. Data 13, 357–366 (2021).
Elbe, S. & Buckland-Merrett, G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob Chall 1, 33–46 (2017).
Shen, W., Le, S., Li, Y. & Hu, F. SeqKit: a cross-platform and ultrafast toolkit for FASTA/Q file manipulation. PLoS ONE11, e0163962 (2016).
Hadfield, J. et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics 34, 4121–4123 (2018).
Minh, B. Q. et al. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).
Hoang, D. T., Chernomor, O., von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35, 518–522 (2018).
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Sagulenko, P., Puller, V. & Neher, R. A. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol. 4, vex042 (2018).
Suchard, M. A. et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 4, vey016 (2018).
Gallego-García, P. et al. Dispersal history of SARS-CoV-2 in Galicia, Spain. JMV 96, e29773 (2024).
Gill, M. S., Lemey, P., Bennett, S. N., Biek, R. & Suchard, M. A. Understanding past population dynamics: bayesian coalescent-based modeling with covariates. Syst. Biol. 65, 1041–1056 (2016).
Hasegawa, M., Kishino, H. & Yano, T. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. 22, 160–174 (1985).
Yang, Z. Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods. J. Mol. Evol 39, 306–314 (1994).
du Plessis, L. et al. Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK. Science 371, 708–712 (2021).
Gill, M. S. et al. Improving Bayesian population dynamics inference: a coalescent-based model for multiple loci. Mol. Biol. Evol. 30, 713–724 (2013).
Didier, G., Glatt-Holtz, N. E., Holbrook, A. J., Magee, A. F. & Suchard, M. A. On the surprising effectiveness of a simple matrix exponential derivative approximation, with application to global SARS-CoV-2. Proc. Natl. Acad. Sci. USA. 121, e2318989121 (2024).
Ji, X. et al. Gradients do grow on trees: a linear-time O(N)-dimensional gradient for statistical phylogenetics. Mol. Biol. Evol. 37, 3047–3060 (2020).
Ayres, D. L. et al. BEAGLE 3: improved performance, scaling, and usability for a high-performance computing library for statistical phylogenetics. Syst. Biol. 68, 1052–1061 (2019).
Minin, V. N. & Suchard, M. A. Counting labeled transitions in continuous-time Markov models of evolution. J. Math. Biol 56, 391–412 (2008).
R. Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, Vienna, Austria, 2022).
Meredith M. K. J. Highest (Posterior) Density Intervals [R Package HDInterval Version 0.2.4]. (Comprehensive R Archive Network (CRAN), 2022).
Orden PCM/1237/2020. 118673–118677 https://www.boe.es/eli/es/o/2020/12/22/pcm1237/con (2020).
Campbell, F. et al. Increased transmissibility and global spread of SARS-CoV-2 variants of concern as at June 2021. Eurosurveillance 26, 2100509 (2021).
Earnest, R. et al. Comparative transmissibility of SARS-CoV-2 variants Delta and Alpha in New England, USA. Cron. Med. 3 (2022).
Andeweg, S. P. et al. Elevated risk of infection with SARS-CoV-2 Beta, Gamma, and Delta variants compared with Alpha variant in vaccinated individuals. Sci. Transl. Med. 15, eabn4338 (2023).
Elliott, P. et al. Rapid increase in Omicron infections in England during December 2021: REACT-1 study. Science 375, 1406–1411 (2022).
Du, Z. et al. Pandemic fatigue impedes mitigation of COVID-19 in Hong Kong. Proc. Natl Acad. Sci.119, e2213313119 (2022).
World Health Organization. Regional Office for Europe. Pandemic Fatigue – Reinvigorating the Public to Prevent COVID-19: Policy Framework for Supporting Pandemic Prevention and Management. https://iris.who.int/handle/10665/335820 (2020).
Borges, V. et al. SARS-CoV-2 introductions and early dynamics of the epidemic in Portugal. Commun. Med.2, 1–11 (2022).
Tsui, J. L.-H. et al. Genomic assessment of invasion dynamics of SARS-CoV-2 Omicron BA.1. Science 381, 336–343 (2023).
Dellicour, S. et al. Variant-specific introduction and dispersal dynamics of SARS-CoV-2 in New York City - from Alpha to Omicron. PLoS Pathog. 19, e1011348 (2023).
McCrone, J. T. et al. Context-specific emergence and growth of the SARS-CoV-2 Delta variant. Nature 610, 154–160 (2022).
Castelán-Sánchez, H. G., Martínez-Castilla, L. P., Sganzerla-Martínez, G., Torres-Flores, J. & López-Leal, G. Genome evolution and early introductions of the SARS-CoV-2 Omicron variant in Mexico. Virus Evol. 8, veac109 (2022).
Kraemer, M. U. G. et al. Mapping global variation in human mobility. Nat. Hum. Behav.4, 800–810 (2020).
Gallego et al. SARS-CoV-2 in Spain. https://doi.org/10.5281/zenodo.15350090.
Mathieu, E. et al. Coronavirus Pandemic (COVID-19). OurWorldInData.org https://ourworldindata.org/coronavirus (2020).
Hale, T. et al. A global panel database of pandemic policies (Oxford COVID-19 Government Response Tracker). Nat. Hum. Behav. 5, 529–538 (2021).
Acknowledgements
The authors gratefully acknowledge all data contributors, i.e., the authors and their originating laboratories responsible for obtaining the specimens and their Submitting laboratories for generating the genetic sequence and metadata and sharing via the GISAID Initiative. PGG was supported by grant ED481A-2021/345 from the Consellería de Cultura, Educación e Universidade Xunta de Galicia. SD acknowledges support from the Fonds National de la Recherche Scientifique (F.R.S.-FNRS, Belgium; grant no. F.4515.22). SD and GB acknowledge support from the Research Foundation—Flanders (Fonds voor Wetenschappelijk Onderzoek—Vlaanderen, FWO, Belgium; grant no. G098321N) and from the European Union Horizon RIA 2023 project LEAPS (grant no. 101094685). GB acknowledges support from the Internal Funds KU Leuven (Grant No. C14/18/094), from the Research Foundation—Flanders (Fonds voor Wetenschappelijk Onderzoek—Vlaanderen, FWO, Belgium; grant no. G0E1420N) and from the DURABLE EU4Health project 02/2023-01/2027, which is co-funded by the European Union (call EU4H-2021-PJ4; grant no. 101102733). SD and PL acknowledge support from the European Union Horizon 2020 project MOOD (grant agreement no. 874850). PL and MAS acknowledge support from the European Union’s Horizon 2020 research and innovation programme (grant agreement no. 725422—ReservoirDOCS), from the Wellcome Trust through project 206298/Z/17/Z and from the National Institutes of Health grants R01 AI153044, R01 AI162611 and U19 AI135995. PL also acknowledges support from the Research Foundation—Flanders (Fonds voor Wetenschappelijk Onderzoek—Vlaanderen, G0D5117N, and G051322N).
Author information
Authors and Affiliations
Contributions
D.P. conceived the work; P.G.G., S.L.H., N.B., S.D., G.B., M.A.S. and P.L. contributed to data analysis; P.G.G. and D.P. drafted the manuscript. All authors contributed to and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Gallego-García, P., Hong, S.L., Bollen, N. et al. International importance and spread of SARS-CoV-2 variants Alpha, Delta, and Omicron BA.1 into Spain. Commun Med 5, 209 (2025). https://doi.org/10.1038/s43856-025-00912-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s43856-025-00912-9