Introduction

Preeclampsia (PE) is a hypertensive pregnancy complication, characterized by new-onset hypertension and proteinuria or other related symptoms after 20 weeks of gestation1. PE affects approximately 5% of all pregnancies, with higher prevalences in developing countries1,2. This complex disorder mostly affects first-time mothers and is influenced by several maternal risk factors including obesity, cardiovascular disease, or systemic lupus erythematosus (SLE)3. Approximately 70% of PE develops after 34 weeks of gestation, defined as late-onset PE4.

Although only the delivery of the placenta cures PE, its underlying placental etiology remains incompletely understood. PE involves pre-clinical and clinical stages5. First, poor placentation and/or placental malperfusion causes syncytiotrophoblast stress, leading to the release of pro-inflammatory and anti-angiogenic agents into the maternal circulation3,6. Second, placental stress promotes generalized vascular inflammation and new-onset hypertension and proteinuria or other signs of end-organ dysfunction in the mother5. While early-onset PE is linked to poor placentation and fetal growth restriction, late-onset PE is more related to maternal risk factors.

Prediction of PE before the onset of clinical syndrome could improve maternal and fetal outcomes by enabling timely interventions, reducing complications, and lowering healthcare costs7. Current risk assessment involves maternal history and combinations of biomarkers such as mean arterial blood pressure, uterine artery pulsatility index, and placental growth factor (PlGF)8,9 These are, however, only useful in predicting early-onset PE. To discover potential biomarkers for presymptomatic prediction of late-onset PE, we studied gene expression profiles of maternal peripheral blood samples taken at 20–24 weeks of gestation. We found that several inflammatory and immunological pathways were associated with late-onset PE, weeks before the onset of clinical syndrome. These included the JAK-STAT signaling, leukocyte transendothelial migration, SLE, and graft versus host disease (GVHD) pathways. Moreover, we identified 12 candidate second-trimester biomarkers to predict late-onset PE.

Materials and methods

Sample collection and clinical characteristics

Samples were collected as part of the InTraUterine sampling in early pregnancy (ITU) cohort study at the Helsinki University Hospital, Finland. Data on cohort collection and sampling have been described10. The study protocol was approved by the Coordinating Ethics Committee of the Helsinki and Uusimaa Hospital District (269/13/03/00/09). Register data had been merged with approval from the register authority (THL/887/14.02.00/2021). All participants gave their signed informed consent during the recruitment. All methods were carried out in accordance with relevant guidelines and regulations.

Peripheral blood samples were collected between gestational weeks 20 and 24. A total of 64 samples, including 12 PE and 52 normotensive pregnancies (NP) were selected. All women with PE had late-onset disease ≥ 34 weeks, diagnosed using the most recent criteria11. The following maternal demographic and clinical data were recorded: maternal age at delivery, parity, hypertension, diabetes, pre-pregnancy body mass index (BMI), obesity status (BMI ≥ 30 kg/m2 denoting obesity), gestational age at birth, PE status (PE or NP), sex of the baby, birth weight, and placental weight. Although diabetes and obesity status were recorded, they were not included in the analyses because none of the normotensive women were obese or had diabetes. Clinical characteristics were tabled using Table 1 package (version 1.4.3)12 in R (version 4.2.2)13. Group comparisons were performed using t-test for parametric (p > 0.05, Shapiro-Wilk test), Mann-Whitney U test for non-parametric (p ≤ 0.05), and Chi-square test for categorical variables.

Table 1 Clinical characteristics of the study population. Shown are the clinical characteristics of 49 pregnant women.

STRT library preparation and the sequencing

RNA was extracted from maternal peripheral blood samples, preserved in PAXgene Blood RNA Tubes (QIAGEN), using PAXgene Blood RNA kit (QIAGEN). A modified 5’-end RNA-seq method, STRT, with 8-bp unique molecular identifiers (UMIs)14 was applied by combining GlobinLock technique15. For the sample identification, 6-bp barcode sequences were included in the primers. Two STRT libraries, each having 48 samples and water as non-template controls (NTC), were sequenced with Illumina NextSeq 500 System, High Output (75 cycles, single-end). The sample layout was balanced for PE and NP status between the two libraries. In total, 97 spike-ins from the ERCC RNA Spike-in mix were tested.

Preprocessing and the quality control of the STRT libraries

STRTN pipeline (https://github.com/gyazgeldi/STRTN, commit e16a6d1)16 was employed for this step. The raw data were aligned to the hg38 reference genome with GENCODE v43 basic annotation (wgEncodeGencodeBasicV43), which was downloaded from the UCSC Genome Browser (https://genome.ucsc.edu/)17; then a count matrix for protein-coding genes, and several quality control metrics were obtained. Outlier samples were assessed based on quality metrics, including mapped reads, spike-in reads, spike-in 5’-end rate, mapped rate, mapped/spike-in ratio, and coding 5’-end rate, and identified as values outside the interquartile range-based threshold range (values below than Q1–1.5×IQR or above than Q3 + 1.5×IQR) in boxplots.

Reduction of technical bias and noise in the gene expression profiles

Genes with counts below five were filtered out, and library specific biases between the two STRT libraries were corrected using library_bias_correction function from NBGLM-LBC (Negative Binomial Generalized Linear Model – Library Bias Correction) package18 in R (version 4.2.2). Depth files were generated from BAM files using samtools19, followed by processing and classification by library. Thereafter, feature selection test20 was applied to identify fluctuating (highly variable or informative) genes to reduce technical noise. This test calculated the squared coefficient of variation for each gene and spike-ins in the library bias corrected and spike-in normalized count matrix. Technical variation was evaluated by analyzing the fluctuations in spike-in RNA levels across the cells, allowing for the comparison of each gene’s expression changes with predicted variations from the spike-in controls. Genes with adjusted p-values < 0.05 were considered fluctuating, indicating a higher gene-to-spike-in ratio in the squared coefficient of variation. These genes were kept, while those with adjusted p-values > 0.05 were filtered out. All spike-ins were kept without considering fluctuation as minor fluctuations reflected technical noise rather than biology.

Dimensionality reduction and visualization of gene expression profiles

Principal Component Analysis (PCA) was applied for dimensionality reduction and clustering and Uniform Manifold Approximation and Projection (UMAP) was used for two-dimensional visualization of the library bias corrected, fluctuating genes expression data using the Seurat package (version 5.0.1)21 in R, followed by spike-in normalization with offset and log transformation. PCAtools package (version 2.10.0)22 in R was then used, by removing the lower 20% of variables that explained minimal variance, to extract the most informative features and visualize the Pearson’s correlations between principal components (PCs) and clinical characteristics using eigencorplot and pairplots.

Identification of differentially expressed genes

Differential gene expression analysis (DGE) analysis was performed using the DESeq2 package (version 1.38.3)23. The analysis was based on the library bias corrected count matrix. Genes with fewer than 5 counts across the smallest group size (n = 10) were filtered out prior to analysis. Spike-in normalization was applied using all 88 non-zero expressed spike-ins as control genes in the DESeq2 function estimateSizeFactors(), where a logical vector was provided to specify the spike-ins. The design formula was ~ Condition (PE vs. NP). The Wald test was used, and p-values were adjusted for multiple testing using Benjamini-Hochberg method (α = 0.05). Log2 fold-change (log2FC) values were then used to show how the genes in the enriched pathways change between PE and NP, using the Pathview package (version 1.38.0)24.

Pathway and cell type enrichment analysis

Gene set enrichment analysis (GSEA)25 was performed using the curated pathway collection CP: KEGG_LEGACY26,27,28,29 to identify significant pathways associated with PE. Using the fgsea package (version 1.24.0)30 in R, genes were ranked based on PC loadings, reflecting their contributions to each principal component (PC). Pathways enriched at the top or bottom of the ranked gene list were identified using normalized enrichment scores (NES) degree, and expression trend. NES values were then calculated to determine the association between gene sets and PCs. Leading genes, which are the most influential in the biological processes and contribute to the observed enrichment signal, were also detected to highlight key genes associated with PE or distinguishing PE samples. For each enriched pathway, spike-in normalized gene expression values of member genes were transformed into Z-score, scaled, and visualized as heatmaps with ComplexHeatmap package (version 2.14.0)31. Additionally, GSEA was performed using only the fluctuating genes to identify enriched cell types in the PE samples compared to NP samples. For this, a custom gene set was prepared using the human blood cells GSE149938 dataset32. This dataset originally contained 32 cell types. However, we focused on circulating blood cells and excluded hematopoietic stem and progenitor cells, including hematopoietic stem cells, granulocyte-monocyte progenitors, B cell and natural killer cell progenitors, multi-lymphoid progenitors, lymphoid-primed multipotent progenitors, megakaryocyte-erythroid progenitors, common myeloid progenitors, and multipotent progenitors. To identify cell-type-specific markers, differentially expressed genes (log2FC > 2.32, padj < 0.05) were detected using Seurat (version 5.0.1). Marker genes were then compiled into a GMT file for GSEA, enabling enrichment analysis of the immune cell types associated with PE.

Prediction of PE status using logistic regression analysis

Spike-in normalized fluctuated data and sample metadata were combined, resulting in 6,982 variables. Logistic regression models were then fitted to predict PE status for each variable. Key metrics, including Area Under the Curve (AUC), sensitivity, specificity, and accuracy along with their confidence intervals at the fixed 10% false-positive rate, were calculated using pROC (version 1.18.5)33 and ROCR (version 1.0.11)34 packages in R. Confidence intervals (CI) were calculated at the 95% confidence level. Statistical significance of each ROC curve was evaluated by testing whether the AUC was greater than 0.5, with p-values calculated using the Mann–Whitney U test, which is equivalent to the DeLong test. Results were filtered to highlight variables with sensitivity above 0.65, focusing on those with the highest diagnostic potential. The gene expression levels of these highlighted variables were visualized in PE and NP using violin plots with the VlnPlot function from the Seurat package. Differences in gene expression between these two conditions were assessed using the two-sided Wilcoxon test.

Results

An overview of the study design is shown in Fig. 1, and demographics of the study population in Table 1.

Fig. 1
figure 1

Flow chart of the study.

For this study, we specifically selected late PE cases and uneventful pregnancies, for which we had biological samples available and taken at the same gestational age. While the overall prevalence of gestational hypertension was 7% in the whole ITU cohort10, our cohort included 12 (19%) PE cases among the total of 64 women. Mean maternal ages at delivery, parity, and pre-pregnancy BMIs in early pregnancy were similar between the groups. However, 3 (30%) of PE women were obese while all controls had normal weight. Moreover, PE women delivered at earlier gestational age than the NP group, and as expected, their neonates had lower mean birth weights35. One patient with PE delivered at 35 weeks, while all other deliveries occurred at or after 37 weeks. After the quality control (Supplementary Fig. 1), two PE and 13 NP samples were excluded as outliers, leaving 49 (10 PE and 39 NP) samples from two unbiased libraries (Supplementary Fig. 1) for downstream analyses.

Immune activity and inflammation pathways associated with late-onset PE at the presymptomatic phase

To identify transcriptomic alterations and pathway-level changes associated with late-onset PE at the presymptomatic phase, we started with the initial dataset detecting signals for 19,670 genes, including 97 spike-ins. After filtering steps, 12,670 genes and 91 spike-ins remained in the analysis. After the library bias correction and feature selection, we obtained the final dataset of 6,804 genes, including the spike-ins, for downstream analyses.

Conventional DEG analysis did not reveal differentially expressed genes between PE and NP women (adjusted p-values were ~ 0.9). Using standard DESeq2 settings with Benjamini-Hochberg correction (α = 0.05), 771 genes showed nominal significance (p < 0.05), but none remained significant after multiple testing adjustment. Since no clear signals were detected at the single-gene level, we therefore applied a PCA-GSEA approach to improve sensitivity36. This approach reduces the effect of individual variation by grouping correlated genes into principal components and then testing for enrichment at the pathway level, rather than focusing on single-genes differences as done in DEG. Using this approach, we identified significant associations between clinical characteristics and four PCs that strongly correlated with PE (Fig. 2a). Of them, PC1 explained 51.53% of the total variation and was negatively correlated with PE (Pearson’s correlation coefficient: −0.25). PC6, PC8, and PC18 explained 0.95 to 1.95% of the variation and were positively correlated with PE (Pearson’s correlation coefficients ≥ 0.25) (Fig. 2a). PE and NP samples were only partially separated by the PCs: PE samples were in the negative area of PC1 and positive area of other PCs (Fig. 2b).

Fig. 2
figure 2

Transcriptomic alterations in second-trimester maternal blood samples from women with late-onset preeclampsia during the presymptomatic phase. (a) PCA-correlation plot demonstrates that PC1, PC6, PC8, and PC18 are correlated (Pearson’s r ≥ 0.25) with preeclampsia (PE). Percentages of variation that each component represents are shown at the bottom line. (b) Biplot shows that PC1, PC6, PC8, and PC18 only partially distinguish PE and normotensive (NP) samples. PE samples are shown in red and NP in blue.

Using the KEGG Legacy dataset, we found that the genes highly contributing to the PE-correlated PC1, PC6, and PC18 were significantly enriched in several pathways (Table 2), whereas no pathways were enriched for PC8. For PC1, the leukocyte transendothelial migration pathway, JAK-STAT signaling pathway, and Vibrio cholerae infection pathway were significantly negatively enriched (Table 2). As these pathways contributed negatively to PC1, and PC1 was negatively correlated with PE (Fig. 2a), these pathway genes were upregulated in PE (Fig. 3a-b, Supplementary Figs. 2–4). As for PC6, being positively correlated with PE, the antigen processing and presentation pathway, and asthma pathway were negatively enriched (Table 2), indicating that their member genes were downregulated in PE (Supplementary Figs. 5 and 6). As for PC18, being also positively correlated with PE, the SLE pathway, GVHD and antigen processing and presentation pathways were positively enriched (Table 2), indicating that their member genes were upregulated in PE (Fig. 3c-d, Supplementary Figs. 7–9). Interestingly, the antigen processing and presentation pathway was negatively enriched in PC6 and positively enriched in PC18, reflecting variations captured by each PC and the influence of other pathway-specific genes, detailed in Fig. 3 and Supplementary Figs. 5 and 7.

Table 2 Pathways associated with transcriptomic alterations in second-trimester maternal blood samples before late-onset preeclampsia. Shown are enriched pathways with KEGG pathway ID, padj < 0.05, and their corresponding normalized enrichment scores.
Fig. 3
figure 3

Heatmaps showing differential second-trimester pathway activation in women with late-onset preeclampsia and normotensive women. Shown are the major upregulated pathways in preeclampsia (PE) related to (a) leukocyte transendothelial migration, (b) JAK-STAT signaling, (c) systemic lupus erythematosus (SLE) and (d) graft versus host disease pathways. Columns represent samples (ordered by rotated values in each principal component, PC), and rows represent pathway genes (ordered by loading values in each PC), with the left indicating leading genes. Red color indicates upregulation, and blue indicates downregulation, normalized across preeclampsia (red) and normotensive pregnancies (blue).

High-AUC biomarkers for late-onset PE prediction suggesting intermediate monocytes enrichment in maternal second-trimester blood RNA profiles

After observing pathway alterations associated with late-onset PE in the second-trimester maternal blood samples, we searched for potential marker genes for PE prediction. We performed logistic regression analysis followed by ROC analysis. We found that several genes distinguished PE and NP with AUC values of 0.74 to 0.87 (Table 3), as visualized in Fig. 4a and Supplementary Fig. 10. Their expression levels in PE and NP samples are shown in Fig. 4b and Supplementary Fig. 10. As shown in these figures, the identified biomarkers exhibited higher expression in NP compared with PE samples. Despite the high AUC and sensitivity values, the small sample size resulted in wide confidence intervals for sensitivity estimates (Table 3). Moreover, the genes identified through the ROC analysis, with a sensitivity threshold set at 0.65, were not directly involved in the pathways detected by the PCA analysis (Table 3).

Table 3 Performances of 12 candidate genes in disease prediction in maternal second-trimester blood samples of women with late-onset preeclampsia and normotensive women. Shown are area under curve (AUC) values, sensitivity, accuracy, and their 95% confidence intervals (CI) at the fixed 10% false-positive rate for the top biomarkers.
Fig. 4
figure 4

Performances of three candidate genes for late-onset preeclampsia prediction. (a) ROC curves show the performance of selected biomarkers in distinguishing preeclampsia (PE) and normotensive pregnancies (NP) with false positive rate (FPR) on the x-axis and true positive rate (TPR) on the y-axis. The area under curve (AUC) values and corresponding p-values (testing whether AUC > 0.5) provide a measure of the diagnostic power of each biomarker. (b) Violin plots showing variation, with p-value from the two-sided Wilcoxon test shown above, in gene expression of the same biomarkers between PE and NP. Horizontal lines indicate median expression.

Table 4 Genes associated with different cell populations for late-onset PE prediction. Shown are enriched cell type signatures with padj < 0.05 and their corresponding normalized enrichment scores.

Since the candidate markers (Table 3) were not part of known pathways, we further tested whether they were associated with different blood cell populations. Specifically, we analyzed a total of 24 circulating blood cell types, ranking biomarkers separately based on AUC to assess their ability to distinguish PE from NP, and on sensitivity to assess their ability to detect PE. We found that the sensitivity-based ranking did not identify any significantly enriched cell types associated with PE samples. In contrast, the AUC-based ranking identified significant enrichment in human blood cell marker genes for erythrocytes, progenitor neutrophils, myelocytes, intermediate monocytes, metamyelocytes, mature neutrophils, and regulatory B cells (Table 4). Among them, intermediate monocytes had a positive NES, indicating that these cell-type marker genes were most effective in distinguishing PE samples. Conversely, other cell types had negative NES values, suggesting a lower ability to distinguish PE from NP.

Discussion

Tools for identifying late-onset PE at the presymptomatic phase, before the onset of maternal syndrome, are currently lacking. To address this challenge, we utilized maternal second-trimester blood samples to explore early alterations in RNA expression profiles before late-onset PE. We found that subtle changes in RNA expression profiles arise weeks before the onset of the clinical syndrome. Notably, PE-associated changes were related to leukocytic immune surveillance and maternal immunity homeostasis. Based on these observations, we found 12 potential candidate genes for PE prediction.

In our study, second-trimester maternal RNA expression profiles revealed alterations in several pathways among women with subsequent development of late-onset PE. These included upregulation of genes of the leukocyte transendothelial migration pathway that regulates immune surveillance through the movement of immune cells across the endothelium37. This pathway has previously been linked to immune activation during pregnancy, and additionally, to infection-related and immunological PE subtypes in placental transcriptome36. Upregulation of leukocyte transendothelial migration pathway is in agreement with prior findings that indicate maternal endothelial and leukocyte activation in PE38. To support the biological role of this pathway in PE, leukocyte migration genes are even candidate predictors of gestational age in the peripheral blood of normal pregnancies39. Similarly, genes in the JAK/STAT pathway, a critical mediator of cytokine and growth factor signaling, were upregulated in late-onset PE, further supporting alterations in immune responses or homeostasiss40. Thus, there are two possible interpretations: one involves the activation of maternal inflammatory responses already at the presymptomatic phase of PE, and the other suggests primary maternal immune dysregulation in pregnancies predisposed to subsequent development of PE. To support the role of primary immune aberration, genes in the SLE pathway were upregulated among women with subsequent PE, even if none of the participants had SLE. Among other connective tissue diseases, SLE is a chronic inflammatory disease with altered immune homeostasis41, and a considerable risk factor of PE42. Interestingly, one of the leading genes of the SLE pathway, cathepsin G (CTSG), shows upregulation in normal pregnancies versus non-pregnant controls43 of first-trimester maternal plasma, and contributes to the processing of antigens and autoantigens44, proposing its primary role during human pregnancy. As a potential sign of immune rejection, genes in the GVHD pathway were among those upregulated in PE, supporting the role of enhanced maternal immune responses in the PE pathogenesis45.

The candidate biomarkers that we identified have not been previously reported in association with PE. Among the top genes, some are directly linked to PE or placental processes. PI4KB (Phosphatidylinositol 4-Kinase Beta) is a lipid metabolism enzyme involved in vascular morphogenesis through the control of fibronectin secretion in endothelial cells46. As vascular remodeling is essential for placentation, and maternal endothelial dysfunction is a hallmark of PE47, its upregulation in late-onset PE may reflect early signs of endothelial dysfunction. KICS2 (KICSTOR subunit 2) regulates autophagy via mTORC1 signaling48. Autophagy is an important process for placentation, especially for trophoblast invasion49, and impaired placental autophagy has been observed in PE50. Autophagy also contributes to monocyte-macrophage differentiation51, suggesting that the PE-related changes in monocytes populations observed previously52,53,54 and in our study might be at least partially related to these alterations. HIPK3 (particularly its circular isoform circHIPK3) is also to be noted, as it is downregulated in preeclamptic placentas, with functional studies showing reduced trophoblast proliferation, migration and invasion55,56.

Among the other identified genes, many link to immune or blood cell functions. PPM1G is a phosphatase that regulates multiple cellular processes, including mRNA splicing, DNA damage response, and cell cycle progression. Its elevated expression has been associated with poor prognosis in hepatocellular carcinoma and shown to correlate with immune cell infiltration and NF-κB–mediated inflammatory signaling57. RAB11B, a Ras-like small GTPase, is known to regulate intracellular trafficking processes and immune receptor signaling58. It has also been linked to autophagy regulation and macrophage polarization, thereby influencing inflammatory responses59. FAM117A, also known as C/EBP induced protein, contributes to B-cell to myeloid differentiation, macrophages, and granulocytes in blood cell development60 and NK cell metabolism in cancer61. APOL6 has been associated with ferroptosis-related pathways and increased infiltration of immune cells such as CD8 + T cells, macrophages, and dendritic cells in cancer62, and exosomal regulation of APOL6 was shown to affect endometrial stromal cell survival63. TRIM68 is an E3 ubiquitin ligase involved in immune regulation and apoptosis in cancer64,65. In osteosarcoma, its expression correlated with increased memory CD4 T cells and Tregs, and decreased mast cells66, suggesting a role in adaptive immune responses. Although direct evidence from blood cells or immune cells is limited, these findings support a potential role for immune regulation.

Other identified genes have been studied mainly in placental or general cellular processes. SFI1 is a centriolar protein essential for centriole stability and ciliogenesis67. ARSG encodes a lysosomal sulfatase linked to metabolic stress and proinsulin levels, a metabolic trait associated with obesity68 and a known risk factor for PE. PITRM1, a mitochondrial metallopeptidase, is expressed in human placenta69 and has been reported to be abnormally upregulated in mouse nuclear transfer placentas70. USO1 is a vesicle transport factor that mediates ER-to-Golgi trafficking and is part of the phosphatidylinositol 3-kinase (PtdIns3K) complex, essential for autophagy and autophagosome assembly in intestinal epithelial cells71. While these genes are involved in placental or cellular pathways, their specific relevance in maternal blood and late-onset PE remains to be clarified.

While we investigated the cellular mRNAs in peripheral blood, many previous studies used cell-free molecules, not only RNAs but also DNAs7. Those approaches measure cell content primarily leaked from damaged cells. For example, Farina et al.72 could predict (both early- and late-onset) PE using cell-free mRNAs of Fms Related Receptor Tyrosine Kinase 1 (FLT1) at 10–14 weeks of gestation; the detection rate at a 5% false positive rate was 72.3%. Because FLT1 gene is highly expressed in placental trophoblasts and endothelial cells throughout the body73, and secreted in soluble forms, its high circulating levels make it an ideal marker for disease detection when the tissue breakdown occurs. In contrast, our approach measures cellular mRNAs originating primarily from intact cells of the peripheral blood. Therefore, the placental status cannot be assessed by our method but instead, our study provides data on maternal immune status and dysregulation based on leukocyte-derived mRNA alterations.

The main limitation of our study was the moderate number of samples. A larger series would have been ideal due to the heterogeneous nature of late-onset PE6, and the complexity of PE pathogenesis1. Furthermore, we had neither data on PE subtypes nor the severity of PE. These limitations may also explain why conventional DEG analysis failed to identify significant single-gene changes, partly due to the subtle variation observed at the presymptomatic phase of late-onset PE. Therefore, pathway-level approaches such as PCA-GSEA are more suitable in such cases. Genes that interact with each other for a function tend to co-regulate, and PCA groups co-regulated genes into principal components, so that genes in the same pathway could be enriched by PCA. Unlike DEG, Broad GSEA evaluates all genes, including those without significant changes at the single-gene level. Therefore, if the pathway-level alteration is consistent across patients with the same disease, the combination of PCA and GSEA can detect it. The present study detected several pathways as early signals of late-onset PE using this approach, and the robustness must be validated in other cohorts. However, our study utilized several variables that affected the expression profiles, including maternal pre-pregnancy BMI (PC1 of Fig. 2), fetal gender (PC2), or gestational diabetes (PC3), suggesting that future studies should not only include larger sample sizes but also account for these characteristics. Unfortunately, we had no longitudinal samples or samples at the diagnosis, limiting the conclusions of this study. As a strength of the study, however, the wide coverage of protein-coding genes allowed us to search for potential biomarkers for disease prediction.

Collectively, these results provide novel insights into the complex pathogenesis of PE and its prediction at the presymptomatic phase. Our study suggests that maternal immune dysregulation is an early sign of the disease and is measurable by maternal blood RNA expression profiles. Future studies are needed to characterize and stratify these findings in larger series and clinical settings.