Introduction

Preterm birth (PTB) occurs in 10% of pregnancies, affecting 13.4 million babies. It is the leading cause of mortality in newborns and children under the age of five1,2. Spontaneous preterm birth (sPTB), is preceded by the onset of premature contractions, cervical shortening and foetal membrane rupture, and accounts for over half of PTBs3, with the rest classified as iatrogenic (iPTB) being medically indicated for either maternal or foetal reasons. The causes of PTB are multifactorial, with known risk factors including previous premature birth and previous cervical excisional treatment (LLETZ) for pre-cancerous cells. Cervical mechanical weakness is a contributing factor in women who have had LLETZ. In other cases, the most common causal mechanisms are linked to adverse vaginal microbiota and/or activation of inflammation. This is particularly the case for early sPTB <34 weeks, where highest rates of neonatal morbidity and mortality are seen3,4,5,6. Despite this, common current treatments to prevent sPTB, progesterone and cervical cerclage, have not been shown to improve the vaginal microbiome or have a beneficial effect upon the immune milieu7,8,9.

During pregnancy, high oestrogen levels facilitate a shift towards a stable vaginal microbial composition dominated by Lactobacillus spp10. This creates an acidic environment, which limits polymicrobial diversity. A vaginal microbial composition dominated by Lactobacillus crispatus, often referred to as Community State Type I (CST I)11, is protective against sPTB7,12,13,14, whereas women with L. iners dominance (CST III), or a depletion of Lactobacillus species and polymicrobial composition (CST IV) are at a higher risk7,8,15. The cervicovaginal immune milieu is a key factor driving the process of microbial-associated sPTB. CST III and IV both associate with higher concentrations of cervicovaginal fluid (CVF) cytokine and complement proteins when compared to CST I. Further increases are seen in women who deliver preterm compared to term8,9,15. Despite a clear association between vaginal microbial composition and the host immune response, the factors influencing bacterial composition and individual variations in immune responses and pregnancy outcome are poorly understood.

The histo-blood groups (ABO) are the most widely known glycan structures, with four main types—A, B, AB, and O16. The A, B, and O (H) blood group antigens are presented on Type-1 and Type-2 antennae of O- or N-glycans and Type-3 backbones displayed on O-glycans. The presence of ABO blood group phenotypes is reliant on the synthesis of ABO(H) antigens under the regulation of two genes: FUT1 and ABO. The precursor of the A and B antigens is the H antigen, the production of which is controlled by the FUT1 gene17. FUT1 encodes a specific fucosyltransferase (α1-2FucT) which adds fucose (Fuc) in an α1-2 linkage to the terminal galactose of the precursor. The ABO genes encode glycosyltransferases enzyme that determine the modification of the H antigen and ultimately the production of A and B antigen. The α1-3GalNAcT (or Enzyme A) generates the A epitope by adding a N-acetylgalactosamine (GalNAc) residue to the H antigen; α1-3GalT (or Enzyme B) forms the B blood group by adding a galactose (Gal) residue to the H antigen. Unmodified H antigens correspond to the O blood group phenotype. As well as being produced on red blood cells, ABO(H) antigens are also synthesised by epithelial cells, including those within the vagina and cervix18,19. This starts with the action of the glycosyltransferases (FUT2), which adds an α1-2-linked Fuc to Gal to form the blood group H antigen on Type-1 and Type-2 chains on N- and O-glycans. The glycan structures of the ABH antigens are shown in Supplementary Fig. 1.

ABO(H) antigens are secreted into the bodily fluids of individuals at varying amounts depending on blood group and secretor status20,21. ABO blood group phenotype associates with susceptibility to pathogen-driven disease, since the ABO(H) antigens can act as adhesion molecules and a source of nutrients for bacteria, whereas the presence of anti-A, B and H antibodies can offer protection or immunogenicity22. For example, blood group O individuals are more susceptible to infection by Helicobacter pylori, whereas blood group B individuals are more susceptible to Neisseria gonorrhoea, Escherichia coli, and Group B Streptococcus infection22,23,24, the latter known to be associated with an increased risk of sPTB25,26. The ABO blood group phenotype has also shown differential associations with the gut microbiome structure27,28,29, with preferential binding of certain Lactobacilli strains to the A antigen and gut epithelial surfaces of blood group A individuals28,29.

We hypothesised that blood group A individuals are more likely to have L. crispatus colonisation, conferring protection from sPTB, whereas blood group O and B individuals are at increased risk of vaginal microbial diversity and thus early sPTB. This study reveals an association between ABO blood group phenotype, microbial composition, immune milieu and sPTB risk. These findings provide mechanistic insights with the potential to lead to glycan-directed therapeutics, live biotherapeutics and immune modulators in a targeted individualised approach for preterm birth prevention.

Results

Study population

A total of 85,044 maternities from an inner London population were identified from electronic patient records (iCARE Secure Data Environment) (Fig. 1a). The PTB rate was 8.35%, 58.43% were spontaneous (sPTB) and 41.57% were iatrogenic (iPTB) (Fig. 1b). A second population of 2084 women at increased risk of sPTB were prospectively recruited from Preterm Birth Prevention Clinics (Fig. 1a) and had a PTB rate of 20.68%, 75.87% were spontaneous and 24.13% were iatrogenic, reflecting a population enriched with women at high-risk of sPTB (Fig. 1c). Demographic data according to ABO blood type is available in Supplementary Tables 1 and 2.

Fig. 1: Summary of study design and study population.
figure 1

a Schematic representing study design: The iCARE study population included maternities identified via electronic healthcare records representing general population and the Prematurity Research study population included maternities with a pre-defined risk factor for spontaneous preterm birth (sPTB), a proportion of whom provided samples for vaginal microbial composition, cervicovaginal (CVF) immune mediators and CVF glycan profiles. b Pregnancy outcomes and ABO blood group status were analysed from a general obstetric population (iCARE population) across three inner London Hospitals between April 2014 and x 2023 (85,044 maternities). c Pregnancy outcomes and ABO blood group status were analysed from the Prematurity Research cohort of women defined as at risk of sPTB and/or who participated in the multi-centre VMET-2 study at four inner London hospitals, and Edinburgh (n = 2084).

ABO blood group phenotype and risk of PTB

There was a significant increase in the rate of sPTBs in women of blood group B phenotype compared with blood group A, 5.8% vs 4.9% (p = 0.001); 1.9% vs 1.6% (p = 0.009); 0.8% vs 0.6% (p = 0.006) at the thresholds of gestational age <37, <34 and <28 weeks, respectively, in the general maternity population (Fig. 2a–c). An increase in the need for cervical cerclage intervention was observed in blood group B women compared with blood group A women, 1.4% vs 0.9% (p < 0.001) (Supplementary Table 3). There were no differences in iatrogenic PTB rates across ABO blood groups (Fig. 2d–f).

Fig. 2: Pregnancy outcomes according to ABO blood group.
figure 2

In the iCARE cohort of 74,913 maternities with pregnancy outcomes, the rates of spontaneous preterm birth by ABO group are shown for (a) sPTB < 37 weeks (b) <34 weeks and (c) <28 weeks. No relationship between ABO blood group and iatrogenic preterm birth is seen (d) <37 weeks (e) < 34 weeks and (f) <28 weeks. g Adjusted odds model for sPTB < 37 weeks generated from logistic regression modelling (n = 74,913). Outcome was adjusted for blood group, ethnicity, past pregnancy history and history of previous cervical treatment. There was no improvement to the model with the addition of interaction terms between co-variates. h In women with a pre-defined risk factor, sPTB rates are increased in women with blood group B and O phenotype (n = 1886. all-risk factors), (i) and in blood group B with previous cervical treatment n = 724), j but in women with a history of previous sPTB and MTL an increased risk is seen in women of blood group O phenotype (n = 798). N.B. Previous PTB/MTL population excludes those with concurrent uterine anomaly or previous cervical treatment. Comparisons made using Chi-squared test and adjusted for multiple comparisons. P value significance: *<0.05, **<0.01, ***<0.001, ****<0.0001. Adjusted OR ± 95% CI and p value.

ABO blood group phenotype and established risk factors for sPTB were tested in an adjusted odds model for sPTB < 37 weeks using logistic regression modelling (Fig. 2g). The adjusted odds ratios (OR) were as follows; B blood group phenotype 1.14 (p = 0.006), Asian and Black ethnicity 1.22 (p < 0.001) and 1.27 (p < 0.001) respectively, a history of previous cervical treatment 1.45 (p < 0.001), sPTB 3.39 (p < 0.001), and mid-trimester loss (MTL) 2.17 (p < 0.001). Whilst there were significant differences between the distribution of blood groups and ethnicities, an interaction between these co-variates did not significantly affect the overall model. In the high-risk population, compared to blood group A, women of blood group B had an increase in the risk of sPTB <34 weeks 10.6% vs 6.8% (p = 0.016) (Fig. 2h), sPTB <28 weeks (p = 0.01) and cervical shortening (p = 0.02) (Supplementary Table 4), OR 1.63, 2.01 and 1.35 respectively. In addition, there was a significant increase in the rates of sPTB and MTL in women of blood group O compared to blood group A, 9.3% vs 6.8% (p = 0.037), OR 1.40 and 1.17, respectively (Fig. 2h, Supplementary Table 4).

In women with a history of cervical treatment, higher rates of sPTB (7.4% vs 2.6%, p = 0.02) (Fig. 2i) and cervical shortening (22.1% vs 13.6%, p = 0.02) were seen in blood group B compared to blood group A, however this was not seen in women with a history of sPTB. In women with a history of sPTB, an increase in sPTB rate was seen only in women with blood group O phenotype (13.6% blood group O vs 9.1% in blood group A) (p = 0.04) (Fig. 2j). Collectively this implies that the different ABO blood group phenotypes may contribute to sPTB aetiology by different mechanisms and suggests that blood group A appears to be protective of cervical shortening, the need for intervention, and PTB.

ABO(H) glycans at the cervicovaginal interface

MALDI-TOF MS based glycomic analyses30,31 of CVF from pregnant women confirmed the presence of the H antigen (Fuc-Gal-) in O blood group individuals, the A antigen (GalNAc-(Fuc-)Gal) in blood group A individuals and the B antigen (Gal-(Fuc-)Gal) in blood group B individuals (Fig. 3a). ABO(H) epitopes were mainly expressed on O-glycans and were detected at varying levels on N and O-glycans of different individuals (Fig. 3b). The A antigen showed highest levels of expression, presented on 10-63% of all O-glycans in blood group A donors, while blood group B antigen was only expressed on 0–32% of all O-glycans in blood group B donors (Fig. 3b). It was not possible to determine the relative abundance of the H antigen on the N or O-linked glycans, because of the possibility of isomeric Lewis X/Lewis A antigen structures.

Fig. 3: ABO blood group glycans at the cervicovaginal interface and interaction with common vaginal microbiota.
figure 3

a Representative MALDI-TOF mass spectrum of O-glycans released from CVF of an A blood group and B blood group donors. Assignments are based on composition, tandem MS and knowledge of biosynthetic pathways. All molecular ions are [M+Na]+. b Semi-quantitative analyses of whole spectrum show higher percentages of A and B blood group epitopes expressed on O-glycans than on N-glycans. n = 64. c The highest abundance of the A blood group antigen on glycans isolated from CVF is detected in association with CST-I and term delivery, whereas the presence of the B antigen, which is lower in abundance, is seen across all CSTs, and is predominantly seen in women who have a preterm delivery. Mass range 3000-4000 for N-glycan analysis, whole spectrum for O-glycan analysis. n = 46. d A schema of glycan microarrays constructed to detect glycan binding with fluorescently labelled in-house cultured bacterial isolates of species and strains. e Heatmap showing the relative binding intensities to blood group glycan probes at pH4 and pH7 of fluorescently labelled bacteria. Differential binding of commensals and pathobionts is seen depending on ABH backbone and pH. The average rank of mean fluorescence intensities of quadruplicate spots from three binding assays was calculated and the resulting values coloured as follows: Dark blue (0-10%); Light blue (10-25%); Yellow (25-50%); Orange (50-70%); Red (70-100%). 100%, the maximum binding score observed across pH4 and pH7 for a given strain.

ABO and the microbiome

Differential abundances of A and B glycan expression were seen depending on CST and pregnancy outcome (Fig. 3c). The highest abundance was type A on the O-linked glycans, and this was associated with L. crispatus dominance, and women delivering at term. In contrast, lower abundances of the type B were seen, and most women expressing the B antigen delivered preterm, regardless of CST. Bacterial binding of key bacterial species known to be present in the vagina (Supplementary Table 5) to a set of blood group glycan probes (Supplementary Table 6) was performed using glycan microarrays (Fig. 3d, e). Differential bacterial binding signals to blood group A type 2, B type 1, B type 2, H type 1 and H type 2 glycans were seen for L. crispatus 1398, L. iners 13335, S. agalactiae 776, and G. vaginalis 540 strains at pH4 (Fig. 3e). Reduced binding was observed at pH7, except for the pathogenic species L. iners, S. agalactiae, and G. vaginalis to the H type 1 glycan. However, binding to blood group A on the type 1 backbone was consistently poor across all bacteria from the lower reproductive tract tested under various conditions. Our data show that bacteria from the vaginal microbiota can bind to blood group A, B, H glycans presented on glycan microarrays. These interactions could be part of the mechanism modulating bacterial composition in the female lower reproductive tract.

Vaginal microbial composition was analysed from women at risk of preterm birth in early (timepoint 1) and mid-pregnancy (timepoint 2). Higher proportions of CST I were seen in blood group A and O women compared to blood group B (timepoint 1: vs A p < 0.01, vs. O p < 0.01; timepoint 2: vs A p = 0.03, vs. O p = 0.03), with enrichment for CSTs III and IV seen in blood group B (Fig. 4a, b). When CSTs were compared according to risk factors for sPTB, there was no significant difference between women with cervical treatment and those with a previous PTB or MTL in blood groups A and B at either time point (Fig. 4c, d). In women with blood group O and a previous sPTB or MTL, the proportion of CST I was lower than those with a history of cervical treatment (timepoint 1: 31% vs 68% p < 0.0001, timepoint 2: 28% vs 67% p < 0.0001) (Fig. 4c, d). Additionally, there were higher proportions of CST III and IV in women with previous PTB or MTL compared to previous cervical treatment in women of blood group O (Fig. 4c, d). We concluded that regardless of risk factor, in early pregnancy, women of blood group B are more likely to have adverse vaginal microbial communities (CST III/IV), and women with blood group A are more likely to show L. crispatus dominance. In contrast, women of blood group O maintain a favourable microbial environment in the context of previous cervical treatment, but in those who have experienced a previous PTB or MTL, there is a high chance of L. crispatus depletion.

Fig. 4: Vaginal Microbiome Community State Types (CSTs) according to blood group in women at risk of preterm birth.
figure 4

a Proportions of CST I-V in women defined as being at risk of preterm birth by ABO blood group phenotype at timepoint 1 (12+0–15+6), b and at timepoint 2 (20+0–23+6). c Proportions of CST1-V in women categorised by risk factors of previous cervical treatment (CT) or previous preterm birth of mid-trimester loss (pPTB/ MTL) and by ABO blood group phenotype at timepoint 1 (12+0–15+6) and at (d) timepoint 2 (20+0-23+6). Proportions of CST I-V at timepoint 1 (12 + 0-15 + 6) in women categorised by outcomes of cervical length and pregnancy outcome in women with blood group (e) A, (f) B and (g) O. Comparisons made using Chi-squared and adjusted for multiple comparisons. P value significance: *<0.05, **<0.01, ***<0.001, ****<0.0001.

The association between vaginal microbial composition, blood group and outcomes in the index pregnancy was assessed. In blood group A, there were no differences in microbial composition in women who experienced cervical shortening or sPTB <34 weeks compared to those who did not (Fig. 4e), implying that microbiome state is less likely aetiological factor in these women. In blood group B, there were high proportions of CST III and IV, low proportions of CST I, and no differences based on pregnancy outcome (Fig. 4f), and we concluded that adverse microbial composition alone is not sufficient to cause sPTB in these women. Women of blood group O who experienced cervical shortening and/or sPTB were significantly less likely to be CST I (p < 0.01 and p < 0.05) and had higher proportions of CST IV (Fig. 4g), indicating that adverse host responses to the microbial composition are a key aetiological factor.

ABO and inflammation at the cervicovaginal interface and PTB

IL-1β concentrations were higher in women with CSTs not dominated by L. crispatus in all blood groups (A: p < 0.001, B: p < 0.01, O: p < 0.01) (Fig. 5a). IL-8 concentrations were significantly elevated in non-L. crispatus CSTs in women of blood groups B and O, but not A (B: p < 0.05, O: p < 0.05), demonstrating stronger inflammatory phenotypes in these blood groups, and a differential effect of the A phenotype on IL-1β production compared to IL-8 (Fig. 5b). At timepoint 1, the same responses were seen for IL-1β but to a lesser degree, and there were no significant relationships for IL-8 by CST, supporting an increase in inflammatory response to microbial composition across gestational age (Supplementary Fig. 2a, b). Comparison of cytokine concentrations according to risk factor demonstrated that in women of blood group O, IL-1β and IL-8 were significantly higher in those with a history of previous sPTB or MTL compared with previous cervical treatment (p < 0.001 and p < 0.01 respectively), a relationship not seen in the other blood groups (Fig. 5c, d). This is likely to be reflective of the reduced prevalence of CST I seen in blood O women with previous sPTB or MTL. The same relationships were seen in early pregnancy at timepoint 1 (Supplementary Fig. 2c, d), demonstrating a pro-inflammatory phenotype in blood group O women with previous sPTB or MTL.

Fig. 5: Cervicovaginal Fluid (CVF) cytokine concentrations by ABO blood group phenotype at timepoint 2 (20 + 0-23 + 6).
figure 5

a CVF IL-1\(\beta\) (pg/ml) and community state type (CST) (n = 219). b CVF IL-8 (pg/ml) and CST (n = 219). c CVF IL-8 (pg/ml) and risk factor (n = 180). d CVF IL-8 (pg/ml) and risk factor (n = 180). e CVF IL-1\(\beta\) (pg/ml) and sPTB (iPTB excluded) (n = 202). f CVF IL-8(pg/ml) and sPTB (iPTB excluded) (n = 202). Wilcoxon Signed-Rank Test adjusted for multiple comparisons (p < 0.05*, p < 0.01**, p < 0.001***, p < 0.0001****).

In blood group A, there was no association between IL-1β or IL-8 concentrations and pregnancy outcome, whereas in blood group O, concentrations are significantly higher in those who deliver preterm compared to term (IL-1β p = 0.05, IL-8 p = 0.01) at timepoint 2 (Fig. 5e, f). In blood group B women, there was a higher median concentration of cytokines in women who delivered preterm at timepoint 2, but this was not statistically significant. Only IL-1β in blood group O was significantly higher in women who delivered preterm in early pregnancy at timepoint 1 (p = 0.04) (Supplementary Fig. 2e, f). These data are consistent with local inflammation driving preterm birth, particularly in blood group O women.

In women of blood group A, the rate of sPTB <34 weeks was similar for women regardless of IL-8 and L. crispatus proportion (Fig. 6a). In women of blood group B, sPTB <34 weeks was lowest in those with normal IL-8 who were dominant (13%) or depleted (17%) for L. crispatus, and highest in those with high IL-8 who were dominant for L. crispatus (67%). In blood group O women, the highest rate of sPTB <34 weeks was seen in those who were both high in IL-8 and deplete of L. crispatus (31%), with the number of preterm births in this quadrant being 4 times that of any of the others (Fig. 6a). In blood group O women there was a significant correlation between gestational age at delivery, proportion of L. crispatus and IL-8 and IL-1β (Fig. 6b). In blood group B women, there were significant correlations between gestational age at delivery and IL-8 and IL-1β, but not proportion of L. crispatus. In blood group A women, whilst there were significant correlations between the proportion of L. crispatus and IL-8 and IL-1β, none of these were correlated with gestational age at delivery (Fig. 6b).

Fig. 6: Associations between cervicovaginal immune mediators, vaginal microbiota and ABO blood group at timepoint 2 (20 + 0-23 + 6).
figure 6

a CVF IL-8 – classified as normal or high (according to tertile) by ABO blood group and Lactobacillus Crispatus dominance (defined as 80% relative abundance) with the shading in each quadrant indicating the percentage of sPTB <34 weeks gestational age at is shown in. b Correlation plots for gestational age at delivery, Lactobacillus crispatus proportion, CVF IL-8 and IL-1β concentrations for blood groups A (yellow), B (red) and O (blue) are shown in. (c-e), Canonical Correlation Analysis between 20 dominant bacterial taxa across samples (shown in blue) and CVF immune analytes (shown in red). Objects that are close together are correlated with each other. Distance from the centre represents strength of association. c Blood group A (n = 82). d blood group B (n = 33). e Blood Group O (n = 98). List of 20 dominant taxa: Lactobacillus crispatus, Lactobacillus iners, Gardnerella vaginalis, Lactobacillus gasseri, Lactobacillus jensenii, Bifidobacterium, Atopobium vaginae, Streptococcus, Lactobacillus spp., Prevotella timonensis, Aerococcus, Lachnospiraceae, Prevotella bivia, Prevotella amnii, Anaerococcus, Rothia, Finegoldia, Peptoniphilus, Atopobium rimae, Fusobacterium.

Canonical correlation analysis of the 20 dominant taxa across samples and CVF immune analytes according to blood group showed association between L. crispatus and the anti-inflammatory cytokine IL-10 in blood group A (Fig. 6c). No correlation between diverse bacterial taxa and pro-inflammatory immune analytes was seen in women of blood group B (Fig. 6d), suggesting an alternative driver of this inflammatory response. In blood group O, correlations between diverse taxa pro-inflammatory analytes were observed (Fig. 6e). Women with blood group O delivering preterm are characterised by an adverse vaginal microbiota and high levels of inflammation. In blood group A, the same microbiota does not evoke a strong inflammatory response and thus does not result in early delivery. In blood group B, there is an inflammatory phenotype associated with earlier delivery, but this is not driven by an adverse vaginal microbiota. These results may reflect differing aetiological pathways in women delivering preterm on the basis of blood group.

Discussion

Despite microbial—immune driven sPTB being considered to be the most common causal factor for sPTB, there is a lack of the mechanistic understanding required for the development of new and individualised preventative therapies. This study provides evidence for a role for cervicovaginal ABO(H) glycans and the ABO blood group phenotype in influencing cervicovaginal microbial composition, immune milieu and risk of sPTB. Although ABO glycan binding of microbiota in the context of the vaginal microbiome has not previously been described, reports exist of lactobacilli species, including Lactobacillus crispatus binding strongly to type A glycans, and weakly to type B in the context of gut microbiome health29,32. Our glycan array data showed that L. crispatus and L.iners bind to A type 2 at pH4; however, L. crispatus was the only species able to bind to the A glycan at pH7. In contrast, only the pathogenic species L. iners, S. agalactiae and G.vaginalis showed binding at pH7 and this was to the H type 2 glycan. This difference in A and O(H) glycan-microbe interaction may explain why women with the A blood group phenotype are more likely to have L. crispatus dominance, have lower concentrations of pro-inflammatory cytokines, and are more likely to deliver at term compared to women of the O phenotype. Preterm birth that occurs in blood group A phenotype is therefore more likely to be due to a mechanism unrelated to microbe-induced local inflammation compared to other blood group phenotypes.

This study has demonstrated an increased risk of sPTB in women of blood group B both in a general maternity population, and in a population at-risk of preterm birth. We also saw a clear association with a vaginal composition depleted in L. crispatus, and a vaginal microbiome with high bacterial diversity. We demonstrated B glycan expression on N and O-linked glycans predominantly in women who delivered preterm. Our glycan array data showed the strongest binding of S. agalactiae with a type 1 B glycan probe which may explain why women of blood group B are more likely to be colonised with Group B Streptococcus than women of blood group O and A24. The most striking increase in risk of sPTB in the blood group B phenotype was seen in women who had a history of previous cervical treatment. Cervical treatment is performed for women with cervical intraepithelial neoplasia (CIN). CIN and cervical cancer are caused by oncogenic Human Papilloma Virus (HPV) strains, mainly HPV-16, 18, 31, 33 and 4526. The presence and persistence of HPV, and the development of high-grade CIN/cancer is associated with high diversity and the deviation away from Lactobacillus dominance33,34. The possible co-existence of HPV and an adverse microbiome in women of blood group B may explain the increased risk of cervical shortening and sPTB in women with a history of cervical treatment either through inflammation secondary to microbial composition or mechanical damage following treatment. Local IL-1β and IL-8 concentrations are elevated in CIN, and this inflammation persists at 6-months post-excisional treatment despite disease and HPV clearance35. This may explain why there was a pro-inflammatory response in women of blood group B who delivered preterm, yet no clear association between bacterial taxa and pro-inflammatory mediators. Alternatively, HPV carriage may be driving this inflammation, as between 15-58% of women of childbearing age are HPV positive36. Although we do not have information on HPV carriage in this study population, this will be a focus of our future work.

A and B glycans have been shown to be expressed in the squamous epithelium of the cervix, whereas H antigens are more highly expressed in endocervical epithelium37. There is a loss of the A and H antigen in premalignant and malignant lesions of the cervix38. This differential expression of ABO(H) glycans to favour B glycan expression over A or H, or changes in concentrations of anti-A antibodies may also explain the predominance of adverse pregnancy outcomes in women of blood group B who have had previous cervical treatment for CIN. Our future work will aim to develop an understanding between cervical stroma, ABO blood group secretion and the various sPTB phenotypes, like those who develop cervical shortening in the context of previous excisional treatment to the cervix in women with premalignant lesions.

Our study demonstrates that women with a history of sPTB of blood group O had higher rates of early sPTB compared to women of blood group A and shows a correlation between Lactobacillus spp. depletion and a pro-inflammatory milieu. It is plausible that anti-A and anti-B natural antibodies play a role, as they can be detected in cervical mucus of blood group O, even when absent in serum39. Natural antibodies to A and B are mainly IgM or less frequently IgG and IgA40, and are generally not antigen specific41. They do offer polyreactivity on a wide range of bacteria and viruses42,43, and can be potent inducers of complement activation40,44. In contrast, anti-O(H) is thought not to be influenced by microbial stimulation41. This may explain why women who delivered preterm in blood group O had a clearer association with microbial composition and a pro-inflammatory immune milieu, than women of blood group A and B. Our future work will explore the presence and role of ABO antibodies in CVF.

The effect size of the influence of ABO blood phenotype on sPTB risk is not as strong as other factors such as race or risk factors cervical treatment and prior preterm birth. This implies that the interaction between ABO, microbiome and inflammation is only one element of a sequence of events that lead to preterm birth. One limitation of this study is that we did not have known secretor status for the study population which is influenced by race, and we did not have known concentrations of secreted blood group glycans and their antibodies, which may also impact on the relative risk. Nevertheless, we believe this study provides data to support the potential use of glycan therapies to target host–microbe interactions for preterm birth prevention, especially with the added risk stratification of ABO status. Strategies are in development targeting host–microbe interactions relevant to the reproductive tract. Examples include host glycan mimicry using high-affinity synthetic glycans resembling host glycans recognised by N. gonorrhoeae, and targeting bacterial glycans like E.coli and Group B Streptococcus with monoclonal antibodies and vaccines45.

Our data provide a novel mechanism of a genetically determined predisposition to, and protection against, sPTB in a large UK multi-ethnic population. Collectively, our findings indicate a role of ABO(H) glycans and the ABO phenotype in influencing microbial composition, immune milieu and pregnancy outcome, and serve as a novel predictive and therapeutic target for preterm birth prevention.

Methods

Study design and population

Observational data from 85,044 anonymised maternities from two inner London hospitals in the UK, Queen Charlottes and Chelsea Hospital and St Mary’s Hospital, Imperial College Healthcare NHS Trust, between 2014 and 2023. These women were identified using electronic healthcare records and anonymised data was analysed using electronic records via Imperial Clinical Analytics Research and Evaluation (iCARE) environment (Fig. 1a, b).

Data on women who were defined as being ‘at-risk’ of preterm birth was collected from women participating in the Vaginal Microbiome and Metabonome in Pregnancy (VMET-2) Research study. A total of 2084 women were prospectively recruited from Preterm Birth Prevention Clinics across five participating sites (Queen Charlottes and Chelsea Hospital, St Mary’s Hospital, Chelsea and Westminster Hospital, and University College Hospital (all in London), and the Royal Infirmary Edinburgh Hospital) between 2014 and 2023 (Fig. 1a–c). Complete metadata was collected from MedSciNet© (CSAM, Oslo, Norway) and electronic patient records (Cerner, EPR Solutions, North Kansas City, MO, USA). Women were defined as being at risk of preterm birth if they had a history of cervical excisional treatment for pre-cancerous cells, a previous preterm birth or mid-trimester loss, or other risk factors such as previous caesarean section at full cervical dilation or congenital uterine anomaly. The most common risk factors of women seen in preterm birth prevention clinics are previous excisional cervical treatment, previous spontaneous preterm birth < 34 weeks, including mid-trimester loss < 24 weeks of pregnancy. Of this high-risk population, vaginal microbiome sampling was performed in 596 women, and both microbiome and immune mediator analyses were performed in 314 women. A subset of 42 of women donated Softcup® samples that were used for glycomics analyses.

Outcomes

The primary outcome was sPTB <37 weeks of gestational age. PTB was defined as either sPTB either following spontaneous onset of labour with cervical dilatation, or prelabour premature rupture of membranes (PPROM), or iPTB following a delivery initiated by care providers in response to maternal or foetal complications of pregnancy. Secondary outcomes were sPTB <34 weeks of gestational age, cervical length (CL) as measured by transvaginal ultrasound scan (TVUS). ABO blood group and information on self-reported ethnicity were obtained from electronic records.

Sample collection

In the high-risk population, women attending the preterm birth prevention clinics were given the opportunity to be recruited to the VMET-2 study. Consenting women provided vaginal swabs at timepoint 1 (12 + 0 – 15 + 6 weeks) and at timepoint 2 (20 + 0 - 23 + 6 weeks) unless delivery occurred before. Swabs were inserted into the mid-high region of the vaginal canal prior to performing an ultrasound scan. After collection, the BBL™ culture swab™ MaxV Liquid Amies swabs (Becton Dickinson, Oxford, UK) were placed on ice immediately and stored at –80 °C. Cervical length was measured at each visit by transvaginal ultrasound as part of routine care. Metadata including ABO blood group, maternal age, BMI, self-reported ethnicity, past obstetric history and PTB prevention interventions, were collected at each visit and stored securely. Women were followed up to determine pregnancy outcome. In a subset, CVF was collected using a disposable menstrual disc (Softcup®) that was placed past the vaginal canal for 20 min. After its removal, contents were resuspended in a 1:5 weight:volume ratio with sterile phosphate buffer saline (PBS) and stored at −80 °C until use.

Microbiome composition

Swabs containing cervicovaginal fluid (CVF) were thawed and supernatant was retrieved by pressurising the sponge of the swab with a sterile syringe in preparation for DNA extraction, and confirmation of DNA integrity by polymerase chain reaction (PCR) amplification was performed as previously described46. The V1-V2 hypervariable regions of the 16S rRNA genes were amplified for sequencing, using forward and reverse primers. The forward primer set (28f-/YM) consisted of a mixture of the following primers mixed to a 4:1:1:1 ratio; 28F-Borrellia GAGTTTGATCCTGGCTTAG; 28F-Chlorlex GAATTTGATCTTGGTTCAG; 28F-Bifido GGGTTCGATTCTGGCTCAG; 28FYM GAGTTTGATCNTGGCTCAG. This mixed formulation of the 27 F forward primer (27F-YM) has been shown to maintain the rRNA gene ratio of Lactobacillus spp. to Gardnerella47. The reverse primer consisted of 388 R GCTGCCTCCCGTAGGAGT. Sequencing was then performed using an Illumina MiSeq platform, at RTL Genomics (Lubbock, TX, USA). Sequence data was then analysed, aligned and classified to operational taxonomic units (OTUs) (phylum to genus level) using Ribosomal Database Project reference sequence files and species-level taxonomies using USEARCH; or to amplicon sequence variant (ASV)- level using the QIIME2 bioinformatics pipeline (version 2022.2.1)48 with taxonomic assignments mapped to the STIRRUPS database49. Community state type (CST) classifications were generated from counts summed to species-level or the next highest taxa-level as per naming conventions required by the Vaginal Community State Type Nearest Centroid Classifier (VALENCIA) algorithm50. Based on VALENCIA CST assignments, samples were grouped into five main CSTs: 1) CST I (L. crispatus dominated); 2) CST II (L. gasseri dominated); 3) CST III (L. iners dominated); 4) CST IV (Gardnerella vaginalis dominated); and 5) CST V (L. jensenii dominated). For canonical correlation analysis the 20 most abundant taxa were identified across all samples using total read counts for all taxons.

Immune profiling

CVF was extracted from thawed swabs by pressurising the sponge of the swab with a sterile syringe until dry and centrifuged at 4500g for 10 min at 4 °C to collect the supernatant, which was stored at −20 °C. Cytokines and complement proteins were quantified using Luminex ® immunoassays following manufacturer’s instructions, including the concentration of cytokines IL-1β, IL-6, IL-10 (Luminex Human Discovery Assay 3-plex, neat), IL-8 (Luminex Human Discovery Assay 1-plex, 1:10 dilution using Calibrator Diluent RD6-52) (R&D Systems) and complement proteins MBL, C5 C5a (Human Complement Magnetic Bead Panel 1, HCMP1MAG-19K) and C3b (Human Complement Magnetic Bead Panel 2, HCMP2MAG-19K (Milliplex®) and results expressed as nanograms or picograms/ml. All samples were analysed using a 96-well plate, acquired in a MagPix®analyser/Bio-Plex 200 system and analysed with xPONENT ® software. Samples were run in duplicates, with inter and intraplate controls.

ABO glycan profiles

For analyses of glycan profiles, CVF was collected using a menstrual cup (Softcup®) by placing it against the cervix for 20 min. After removal, material was retrieved by repeated pipetting of phosphate buffer saline (PBS) to resuspend at a 1:5 weight: volume ratio within 30 min of collection and stored at −80 °C. Glycomic sample processing was done following the protocol detailed previously30. Briefly, CVF samples were sonicated in 25 mM Tris, 150 mM NaCl, 5 mM EDTA, and 1% CHAPS, pH 7.4, dialysed in dialysis cassettes, reduced by DTT, carboxymethylated by IAA, and digested by trypsin. N-glycans were released by PNGase F, separated from O-glycopeptides by C18 Sep-Pak chromatography and were then permethylated following the NaOH procedure51. O-glycans were released from glycopeptides through reductive elimination: four hundred microliters of 0.1 M KOH containing potassium borohydride (54 mg/mL) was added to dried samples and incubated at 45 °C for 14–16 h. The reaction was terminated by adding a few drops of 5% (v/v) acetic acid followed by purification with Dowex 1-X8 desalting column. Excess borates in the samples were subsequently removed by co-evaporating with 10% (v/v) acetic acid in methanol under a stream of nitrogen at room temperature. The purified native O-glycans were then permethylated following the NaOH procedure. The methylated glycans were dissolved in 10 µl methanol. An aliquot of 1 µl of sample was mixed with 1 µl of 10 mg/ml DABP matrix in 75% ACN. The mixture was spotted on a MALDI plate for MALDI-TOF-MS and MS/MS analysis. Data were analysed using Data ExplorerTM version 4.6 from AB Sciex, Glycoworkbench72 and MALDIquant73. The glycomic data were annotated based on monosaccharide composition derived from the molecular ion m/z value, knowledge of glycan biosynthetic pathways, the isotopic peak cluster patterns, the glycosylation patterns in the low and medium mass range, and MS/MS fragmentation.

Bacterial culture and preparation for glycan array binding studies

Bacterial isolates were streaked onto their corresponding agar (Supplementary Table 5) and incubated anaerobically (37 °C; gas composition 10% CO2, 10% hydrogen, 80% nitrogen; 70% humidity). The culture was used to inoculate 10 ml of degassed broth (Supplementary Table 5) which was incubated anaerobically overnight. The overnight culture was then used to inoculate fresh degassed broth which was incubated anaerobically until the optical density (600 nm) reached 1 (±0.05).

All bacteria were prepared live and fluorescently labelled. Aliquots (2 ml) of culture were centrifuged at 5000 g for 10 min and the pellet was washed twice using cold, degassed Hanks Balanced Salt Solution (HBSS; Gibco). The culture was then stained by incubating the cells in 10 μM CellTrace™ Far Red (Molecular Probes) in PBS for 1 h on a rotating wheel protected from light. The culture was washed twice using HBSS (as above). The prepared culture was resuspended in HBSS and kept on ice and protected from light until further use.

Preparation and analysis of AEAB-terminating glycan probes

AEAB-terminating probes were prepared from reducing glycans by reductive amination. Neutral blood group terminating glycans from glycan probes #3158, #3159, #3161, #3162, #3330 and #3164 were purchased from Elicityl (Crolles, France) (Supplementary Table 6). Preparation of the fluorescent reagent AEAB and conjugation with reducing sugars by reductive amination were performed as previously described52. For purification of the conjugation product, a Sep-Pak Aminopropyl cartridge (Waters, Wilmslow, England) was used with elution by acetonitrile /H2O for the neutral sugars and acetonitrile /0.05 M ammonium acetate for the sialylated sugars. Further purification was carried out by HPLC using Amide column (XBridge, Waters). The gradient elution was acetonitrile /H2O at a flow rate of 1 ml/min with detection by UV 330 nm.

The micro-scale dot assay on TLC plates was performed as described elsewhere53. Briefly, hexose-containing probes were dissolved in H2O (approx. 0.5–1.0 mg/ml), before 1 μl was deposited on silica gel TLC plate. Standard solutions (Gal in H2O) were also deposited alongside the sample. The plate was sprayed with orcinol/H2SO4 reagent before heating in an oven at 105 oC to develop colour. Quantitation was carried out by scanning at 550 nm on a CAMAG TLC scanner (Omicron Research, Hungerford, England).

MS analysis of most amino-terminating probes was carried out by electrospray mass spectrometer on a Waters (Manchester, UK) Q-TOF-type mass spectrometer SYNAPT-G2 in either positive- or negative ion mode. Cone voltage was at 50 eV or 80 eV and capillary voltage at 3 kV. Source temperature was at 80 oC and the desolvation temperature at 150 oC. A scan rate of 1.5 sec/scan was used and the acquired spectra were summed for presentation. For analysis, glycan probes were dissolved in H2O typically at a concentration of 10-20 pmol/μl, of which 1 μl was injected via a HPLC injector. Solvent acetonitrile/2 mM ammonium bicarbonate 1:1 was delivered by a HPLC pump (Waters) at a flow rate of 10 μl/min. Selected probes were also analysed by MALDI-MS on an Axima MALDI Resonance mass spectrometer with a QIT-TOF configuration (Shimadzu). A nitrogen laser was used to irradiate samples at 337 nm, with an average of 100-200 shots accumulated. A matrix solution (0.5 μl) of 2,5-dihydroxybenzoic acid (20 mg/ml) in a mixture of methanol/water (1:1) was deposited on the sample target before application of the sample solution (0.5 μl).

Glycan microarray preparation and analyses

The generation of covalent sequence-defined glycan arrays on NHS-functionalised glass slides (Nexterion H) was performed essentially as described previously54,55. Two oligosaccharide subarrays were generated. Details of the preparation of the microarrays and the methods used for binding assays and data analysis are in accordance with MIRAGE (Minimum Information Required for A Glycomics Experiment) guidelines for reporting of glycan microarray-based data (Supplementary Information)56.

Microarray analyses were performed essentially as previously described57 at ambient temperature. No blocking was used for overlay assays on covalent glycan array glass slides. Details of the antibodies used for glycan microarray quality control are summarised in the MIRAGE document in Supplementary Information.

Microarray analyses of fluorescently labelled bacteria were performed as follows. Fluorescently labelled bacterial cultures at OD = 1 were resuspended in binding buffer (10 mM Hepes at pH 7.5 or pH 4, 150 mM NaCl, and 5 mM CaCl2) and were used for overlays on glycan microarrays. 100 µL of bacterial suspension was applied to the incubation chamber with the microarrays, incubated for 1 h at room temperature under mild agitation on an oscillating platform and washed three times with binding buffer followed by two washes with HPLC grade water to remove salts from the array. Slides were dried under a mild nitrogen flow and scanned for quantitation as described below.

Images of fluorescently bound antibodies and bacteria were acquired with a GenePix 4300 A scanner from Molecular Devices, quantified using GenePix software and analysed with CarbArryART58. In glycan microarray binding analyses each assay was performed with four technical replicates and mean (MFI) and standard deviation (SD) values of quadruplicates data can be found in Supplementary Data 1. Heatmaps of processed data were generated using excel.

Statistical analyses

Statistical analyses were performed using R (version 4.2.3)59. Summary statistics and demographics were derived and stratified according to ABO blood group phenotype, risk factor for sPTB and self-reported ethnicity. Statistical significance was assumed at p < 0.05. Univariable comparisons of categorical data were completed using Chi-square test or Fisher’s exact test. Continuous data were assessed for normality using Kolmorgorov–Smirnov tests and presented as mean (±standard deviation) or median (±inter-quartile range). Either t-tests or Mann–Whitney U tests were used for comparisons as appropriate. All tests and p values were adjusted for multiple comparisons. Logistical regression analysis was completed with sPTB <37 weeks as the outcome variable, with ABO blood group, self-reported ethnicity, history of previous PTB or MTL or history of cervical treatment included as dependent variables.

Ethics and reporting

Data from the anonymised general maternity population from iCARE was approved by the NIHR Imperial BRC Data Access and Prioritisation Committee (under HRA database approval (20HH6447)). The study of women at risk of preterm birth was approved by the National Health Service, National Research Ethics Committee in Stanmore, London (REC 14/LO/03238), and these participants provided written informed consent. This research was performed in accordance with the Declaration of Helsinki. Inclusion criteria were previous excisional cervical treatment, previous sPTB or mid-trimester loss, fully dilated caesarean section or a history of uterine anomaly. Exclusion criteria included women under the age of 18, women with HIV/ Hepatitis B/C, vaginal intercourse or bleeding within 72 h of sample collection.