Introduction

Endometriosis is a systemic inflammatory disease characterized by the presence of endometrial-like tissue beyond the uterine cavity1,2,3,4. The condition is estimated to affect 10% of reproductive age women, with common symptoms including chronic pain and infertility5,6,7,8. Autoimmune diseases are conditions that occur when the body’s immune system attacks and damages its own healthy tissue9. Autoimmune conditions affect approximately 8% of people, with women accounting for 80% of these cases10.

Given the overlapping clinical features and impact of both endometriosis and autoimmune conditions on quality of life, several studies have investigated potential links. These studies have shown an association between endometriosis and a range of autoimmune conditions, including lupus erythematosus11,12,13,14,15,16, rheumatoid arthritis11,13,15,17,18,19, Hashimoto’s disease16, inflammatory bowel disease14,20, Grave’s disease21, multiple sclerosis12, Sjögren’s syndrome11,12,14,22, celiac disease16, and ankylosing spondylitis23. The rationale for exploring this co-occurrence are potential shared immunopathogenic mechanisms between autoimmune conditions and endometriosis, such as chronic inflammation and immune cell dysregulation. The presence of ectopic lesions in patients with endometriosis results in a cascade of inflammation that promotes further adhesions and scar formation, along with subsequent increases in the release of prostaglandins, cytokines, and inflammatory products24. This proposed role of immune cells in the development of endometriosis is supported by findings that women with endometriosis have increased levels of peritoneal neutrophils and macrophages, suppressed natural killer cell activity, reduced myeloid dendritic cell density in the endometrium, and abnormal T-cell functioning25,26,27. These patterns of chronic inflammation and immune cell dysregulation also play a role in mediating the pathogenesis in various autoimmune diseases, supporting the plausibility of a link between endometriosis and autoimmune conditions28,29,30.

While past findings suggest a link between endometriosis and autoimmune conditions, the temporal relationship between these diagnoses remains unclear. Determining whether these conditions are diagnosed within a similar timeframe is essential for understanding if immunodeficiencies associated with endometriosis contribute to the development of comorbid autoimmune conditions. To date, however, investigations into the association between endometriosis and autoimmune conditions have only explored the association over patients’ lifetime and have not assessed if the association remains significant within a defined diagnostic time-window. Thus, we conducted a retrospective cohort study to test if women with endometriosis have greater odds than matched control patients of receiving an autoimmune diagnosis within two-years of their initial endometriosis diagnosis. Additionally, most past studies were limited to specific research cohorts and moderate sample sizes; in contrast, we explore the association between endometriosis and autoimmune conditions within large observational cohorts, reflective of routine care patterns in the United States.

Results

Study population

A total of 332,409 women with endometriosis were identified: 278,757 from CCAE and 53,652 from MDCD. Our comparison cohort included 1,220,932 women without endometriosis: 1,017,539 from CCAE and 203,393 from MDCD. A demographic breakdown of patients is provided in Table 1. and a breakdown of patients with a diagnosis of each autoimmune condition analyzed is provided in Table 2. Among patients with endometriosis, 4.93% (n = 11,520) received a diagnosis of at least one autoimmune condition within two-years of their endometriosis diagnosis. The prevalence of autoimmune conditions among the comparison cohort was lower with only 1.42% (n = 17,380) receiving an autoimmune diagnosis within two-years of entering the cohort.

Table 1 Demographic characteristics of endometriosis and comparison cohorts in CCAE and MDCD databases (race and ethnicity data not available for CCAE)
Table 2 Autoimmune diagnoses prevalence within endometriosis and comparison cohorts in CCAE and MDCD databases

Autoimmune comorbidities

All reported odds ratios (ORs) in this analysis were Bonferroni corrected to account for multiple comparisons. Analysis using a 1-year observation period on the CCAE database (Table 3) showed patients in the endometriosis cohort (n = 196,009) had significantly higher odds than comparison patients (n = 705,781) of receiving a diagnosis of Rheumatoid arthritis (OR = 2.32–2.82), type 1 diabetes mellitus (OR = 1.27–1.60), Hashimoto’s disease (OR = 2.25–2.77), systemic lupus erythematosus (OR = 2.55–3.27), multiple sclerosis (OR = 2.55–3.27), pernicious anemia (OR = 2.44–3.55), Sjögren’s syndrome (OR = 3.43–5.03), Myositis (OR = 3.79–5.92), and Graves’ disease (OR = 1.46–4.55) within the 2-year diagnosis window. When the observation period was expanded to 3-years (Table 3), the association between endometriosis and type I diabetes mellitus (OR = 0.84–1.26) was no longer significant (n = 82,748 endometriosis cohort; n = 311,758 control cohort).

Table 3 Odds ratios of patients with endometriosis in CCAE databases receiving a diagnosis of ten common autoimmune conditions within 2-years of receiving an endometriosis diagnosis (1-year and 3-year observation periods)

Similarly, when a 1-year observation period was applied to patients in the MDCD database (Table 4), patients in the endometriosis cohort (n = 35,547) had significantly higher odds than comparison patients (n = 141,130) of being diagnosed with each of the 10 autoimmune conditions analyzed. When the Bonferroni correction was applied, the association between endometriosis and Grave’s disease (OR = 0.75–7.37) was no longer significant. Analysis using a 3-year observation period with the MDCD database (Table 4) also showed patients in the endometriosis cohort (n = 16,105) had significantly higher odds than comparison patients (n = 62,263) of being diagnosed with all 10 of the autoimmune conditions analyzed. When the Bonferroni correction was applied, the association between endometriosis and vitiligo (OR = 0.97–5.87) was no longer significant.

Table 4 Odds ratios of patients with endometriosis in MDCD databases receiving a diagnosis of ten common autoimmune conditions within 2-years of receiving an endometriosis diagnosis (1-year and 3-year observation periods)

When considering all autoimmune conditions combined (Table 5), women with endometriosis in both CCAE and MDCD databases had significantly greater odds (p < 0.01) of receiving an autoimmune diagnosis within the 2-year diagnosis window relative to women without endometriosis. Results were significant for both observation periods. These results remained significant after applying the Bonferroni correction.

Table 5 Odds ratios of women with endometriosis receiving an autoimmune diagnosis within 2-years before or after being diagnosed with endometriosis

Discussion

The present study is the first large-scale investigation to explore the association between endometriosis and 10 distinct autoimmune conditions on real world observational health data using a two-year diagnosis window. Compared to matched controls, patients with endometriosis had a modestly increased risk of being diagnosed with rheumatoid arthritis, Hashimoto’s disease, systemic lupus erythematosus, multiple sclerosis, or pernicious anemia, and a large increase in risk of being diagnosed with Sjögren’s syndrome or myositis within two-years of receiving their endometriosis diagnosis. We also found that the odds of receiving a diagnosis of at least one of the autoimmune conditions analyzed was roughly two times greater among patients with endometriosis compared to matched controls.

Our results are consistent with findings from other large-scale studies showing an association between endometriosis and a range of autoimmune conditions. These include Rheumatoid arthritis11,13,17,18,19, systemic lupus erythematosus11,12,13,14,15, multiple sclerosis12, Sjögren’s syndrome11,12,14,22, and Grave’s disease21. Of note, however, smaller scale studies exploring the association between endometriosis and Hashimoto’s disease have found conflicting results16,31.

These past findings, coupled with our results suggest endometriosis and autoimmune conditions may share biological pathways. However, further investigation is needed, with adjustments for potential confounding factors. Defects in T cell and regulatory cell functioning, as well as abnormal levels of prostaglandins, cytokines, and inflammatory products have all been observed among patients with endometriosis24,25,27 as well as patients with several autoimmune conditions32,33. To date, however, explorations into shared genetic variants between endometriosis and autoimmunity have shown mixed results34. For instance, a large genome-wide analysis showed genetic correlations between endometriosis and inflammatory conditions such as asthma and osteoarthrosis, but no genetic correlation between endometriosis and any of the autoimmune conditions analyzed, including inflammatory bowel disease, systemic lupus erythematosus, rheumatoid arthritis, and multiple sclerosis35. However, a more recent study examining phenotypic and genetic associations between endometriosis and 31 autoimmune conditions found significant positive genetic associations between endometriosis and rheumatoid arthritis and multiple sclerosis36.

While the evidence of a genetic correlation shows mixed results, a potential alternative explanation is that the systemic chronic inflammation experienced by patients with endometriosis1, increases risk for developing a comorbid autoimmune condition37. Both endometriosis and autoimmune conditions are systemic inflammatory diseases, impacting multiple organ systems throughout the body1,2,3,4. Notably, the presence of systemic chronic inflammation can increase risk for developing numerous chronic conditions, including autoimmune conditions17,38,39,40. Therefore, it is possible that the genetic correlation between endometriosis and inflammatory conditions, coupled with the systemic and chronic inflammatory nature of endometriosis contribute to the development of co-morbid autoimmune conditions observed among patients with endometriosis. Previous research has shown that patients with endometriosis present with elevated levels of pro-inflammatory cytokines, including Interleukin-1 (IL-1), Interleukin-6 (IL-6), and Interleukin-8 (IL-8)41. These same pro-inflammatory cytokines are also recognized as playing a role in mediating the pathogenesis in various autoimmune diseases, including rheumatoid arthritis and systemic lupus erythematosus28,29,30. If this is the case, it may also explain why the increased risk of concomitant autoimmunity among patients with endometriosis have ranged from minimal to large within our findings and in past research. Since systemic chronic inflammation can be influenced by age, as well as social, environmental, and lifestyle factors, it is possible that exposure to these factors, and their contribution to systemic chronic inflammation varied across the populations studied. Additionally, while pre-existing systemic chronic inflammation has yet to be investigated as a contributing factor to the development of endometriosis, it may also be possible that the inflammatory nature of autoimmune conditions similarly increases risk for developing endometriosis.

Our study is the first to show a significant association between endometriosis and autoimmune conditions within a two-year diagnosis window. Our results do suggest that prior to giving an endometriosis diagnosis, clinicians likely consider and test patients for a range of other conditions which likely results in many patients with endometriosis receiving multiple diagnoses, including autoimmune diagnoses, within a condensed timeframe. Additionally, preliminary findings also suggest that concomitant autoimmunity is a predictor of later stage endometriosis42, suggesting that by the time an undiagnosed endometriosis patient has developed and been diagnosed with a comorbid autoimmune condition, their endometriosis has likely progressed to a more severe stage, which in turn may increase the likelihood their endometriosis will finally be diagnosed.

A significant strength of the study is the use of two comprehensive databases. CCAE provides healthcare claims from over 30 million individuals annually, all beneficiaries of private insurance plans. Additionally, this study utilizes MDCD, which provides data from individuals enrolled in Medicaid. This combination of databases allows for a thorough investigation of healthcare patterns and outcomes across diverse insurance frameworks, thereby enhancing the generalizability of the research findings. Another key strength is the consistency of results observed across different databases and observation periods, indicating the reliability of our findings. Furthermore, the use of a validated phenotype for endometriosis ensures methodological rigor, enhancing the accuracy of our study’s conclusions.

Our findings should be considered in light of the study’s limitations. First, the study’s retrospective cohort design precludes the establishment of causality or directionality. Therefore, while our findings indicate an association, they do not confirm a causal relationship.

Second, our analysis relies on administrative health claims data which might introduce selection bias, as this type of data can be incomplete. Specifically, the data only identifies patients who have received a diagnosis, potentially capturing more symptomatic cases and missing milder cases. This raises the possibility that the observed association could partially reflect that women with both an autoimmune disease and endometriosis present with more severe symptoms, leading to a higher likelihood of both conditions being diagnosed, rather than reflecting shared underlying biological pathways. Additionally, identifying both autoimmune conditions and endometriosis relies on diagnosis codes within the claims data. While we used validated phenotypes when available, relying on diagnosis codes may result in misclassification due to coding errors. The use of claims data also reduces our ability to comprehensively control for confounders. While administrative claims data allows for a large sample size, it lacks the granularity of prospective, curated data sources. Specifically, important covariates such as Body Mass Index (BMI), inflammation, socioeconomic status, detailed pregnancy/family history, lifestyle factors, and clinical characteristics of endometriosis (e.g., stage, primary symptoms), are not reliably captured. Therefore, we conducted a bivariate analysis by matching on age and visit date, making our results subject to residual confounding.

Another limitation of our data source is the lack of race and ethnicity data in the CCAE database, which limits our ability to assess potential racial or ethnic differences in the observed associations. Given these limitations of claims data and retrospective studies, we acknowledge that these findings require confirmation through studies designed to minimize such biases. Future research could use prospective cohorts or data sources such as Electronic Health Records (EHRs) that allow for more detailed clinical and demographic assessment and comprehensive multivariable analysis.

The study faces challenges due to the known delays in diagnosing endometriosis43,44 and several autoimmune conditions45,46, which might cause discrepancies between the actual onset of diseases and their documentation in the databases. As a result, we are unable to ascertain if the onset of comorbid autoimmune conditions reported by patients with endometriosis in this study occurred before, after or in tandem with the onset of their endometriosis. Thus, further investigation is needed to determine if the onset of endometriosis coincides with the onset of the autoimmune conditions analyzed. Before such an investigation can be conducted, however, diagnostic lags must be shortened, which will require further research to improve our understanding of these conditions and ability to identify them.

In conclusion, the present study is the first to show a significant association between endometriosis and several autoimmune conditions within a two-year diagnosis window. While our results show an association between the diagnosis of endometriosis and autoimmune conditions, it is important to note that we are unable to draw conclusions about disease onset due to known diagnostic lags. Future work is therefore needed to shorten time to diagnosis, which will enable further research to assess if the onset of endometriosis and autoimmune conditions coincide. Additionally, investigations into the underlying mechanisms driving the observed associations are crucial to better understand the potential shared biological pathways.

Methods

The analysis and the use of the de-identified dataset presented in this work were carried out under Research Protocol AAAR6505 approved by the Columbia University IRB. All research activities were conducted in accordance with ethical regulations, including the Declaration of Helsinki.

This study leverages the Observational Health Data Sciences and Informatics (OHDSI) framework47 to conduct a retrospective cohort study at scale. OHDSI is an international research network working to support reliable medical evidence generation by developing open-source standards for the structure and content of observational health data48,49. In following OHDSI’s approach, surgically confirmed endometriosis cohorts and comparison cohorts were generated from two administrative claims databases formatted to the OHDSI standard.

Data source

This study utilized two comprehensive deidentified administrative claims databases: the Merative MarketScan Commercial Claims and Encounters (CCAE) database and the Merative Multi-State Medicaid (MDCD) database. CCAE is comprised of data from patients with US employer-based health insurance and contains detailed records of healthcare coverage and service utilization for active employees, early retirees, COBRA participants, and any dependents also insured under employer-sponsored plans. MDCD includes data on the healthcare coverage and service usage for individuals enrolled in Medicaid programs across multiple states, including those in Medicaid-managed care programs.

Both CCAE and MDCD are populated with patient demographics, healthcare visits, disease diagnoses, medications, procedures, and other data types such as laboratory tests. In OHDSI standardized databases, patient data pertaining to diagnoses and procedures can be identified and extracted using Systematized Nomenclature of Medicines50. Similarly, medications can be identified using RxNorm codes51, and Logical Observation Identifiers52 can be used to identify laboratory test data.

Cohort identification

We identified patients with endometriosis using a validated endometriosis cohort definition53,54. The cohort included women ages 15–49 who underwent an endometriosis-related surgical or prevalent procedure and had an endometriosis concept code documented 30 days before or after the procedure, along with an additional endometriosis concept code in their record at any time following the procedure. The phenotype has been validated with a manual chart review of 430 patients from claims and EHR data with a sensitivity of 70%, specificity of 93%, and positive predictive value of 85%. Additional information about our endometriosis cohort is available in the supplemental materials.

The control group included females aged 15–49 who had no previous diagnosis of endometriosis. These individuals were matched to the endometriosis cohort at a ratio of 4:1. Matching criteria were age ( ± 1 year), and the occurrence of an outpatient visit on the exact day the endometriosis patient entered the cohort.

Both cohorts required a minimum of one year of medical observation to ensure adequate time for outcome assessment. Additional analyses were also conducted for those with observations spanning at least three years. Patients with a prior autoimmune diagnosis were not excluded from the cohort. This decision was made considering that patients with one autoimmune condition are at a significantly increased risk of developing additional autoimmune conditions55,56. Therefore, excluding patients with a prior autoimmune diagnosis would not accurately represent the actual population and may underestimate the true association observed in our dataset, limiting generalizability.

Utilizing validated phenotype definitions57,58 that have been made publicly available by the OHDSI research network (https://ohdsi.github.io/PhenotypeLibrary/), we established 10 distinct cohorts corresponding to common and representative autoimmune diseases in the United States: Rheumatoid arthritis, type 1 diabetes mellitus, Hashimoto’s disease, systemic lupus erythematosus, multiple sclerosis, pernicious anemia, Sjögren’s syndrome, myositis, vitiligo, and Graves’ disease55. Only patients who received an autoimmune diagnosis two-years before or after their index endometriosis diagnosis date or the corresponding index date for matched controls were included in these cohorts. The selection of the two-year window was informed by examining the distribution of time differences between patients’ autoimmune diagnosis and endometriosis diagnosis in our data. This analysis revealed mean time differences of 677 days in the CCAE cohort and 669 days in the MDCD cohort. Therefore, a two-year window (representing a total of four years, −730 days to +730 days) captured a substantial portion of the observed autoimmune diagnoses. Additional details about these cohorts can be found in the supplemental materials.

Statistical analysis

Descriptive analyses were performed to summarize demographic characteristics. We also tabulated a breakdown of endometriosis and comparison patients who received a diagnosis of each autoimmune disease analyzed. To achieve our study aim, we employed the Fisher’s exact test to calculate odds ratios (OR) and 95% confidence intervals (CI) for each autoimmune disease in relation to endometriosis. This approach allowed us to obtain precise measurements of the odds ratios, ensuring exactness in our calculations59. Due to the limited covariates available in administrative claims data (detailed in Discussion), we did not perform a multivariable analysis. Separate analyses were run for one-year and three-year observation periods on each database. For each observation period and database, 11 separate tests were run; one for each distinct autoimmune disease and an additional test of the aggregate of all autoimmune diseases combined. To account for potential Type I error due to multiple comparisons, we applied the Bonferroni correction. This approach adjusts the significance threshold downward, based on the number of tests performed, ensuring that the results remained robust against potential false positives. We consider the association significant if p < 0.0045. All analyses were carried out using Python.