Introduction

Endometriosis is a common benign gynecological condition characterized by the growth of endometrial gland and stroma located outside the uterine cavity, resulting in scar tissue, adhesions and inflammatory reactions1. Previous studies reported main symptoms including dysmenorrhea, dyspareunia, pelvic pain and infertility, which has a negative impact on quality of life2.

Ultrasonography presents sensitivity over 90% to define an endometrioma cyst with the typic visualization3. However, the diagnosis is limited in women with suspected early-stage disease and in cases of deeply penetrated endometriosis. In these cases, the diagnosis of endometriosis depends on histopathologic examination after invasive laparoscopic surgery excision, remaining the only standard accurate diagnostic investigation4. Therefore, researchers are directing their attention towards novel non-invasive methodologies for endometriosis diagnosis, including the analysis of blood biomarkers and genetic predispositions. However, to date, no serum marker has been found to diagnose endometriosis with adequate sensitivity and specificity.

Attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy allows evidence into the chemical composition of biological samples at a molecular level. FTIR vibrational analysis performs a determination of functional groups, which build compounds of lipids, protein, carbohydrate, and deoxyribonucleic acid (DNA)5. Previous reports have shown the potential of vibrational spectroscopy in categorizing gynecological complications, such as ovarian, endometrial and cervical cancer6,7,8. Additionally, near-infrared (NIR) imaging is a promising technique for the enhanced detection of endometriotic lesions. This technique relies on the application of a fluorescent dye, which is absorbed by the tissue or transported intraluminally.

Infrared spectroscopy evaluates the absorbed energy in biomolecules within the mid-infrared (IR) region of the electromagnetic spectrum9, measuring the absorbance of IR within this region. Each chemical bond has unique vibrational properties, leading to its absorption of IR radiation at specific frequencies, usually expressed as wavenumbers. Plotting wavenumber against absorbance intensity creates a spectrum representing the types and amounts of bonds present in the material. This pattern is called the 'biochemical-cell fingerprint’ in biological samples. Considering that endometriosis leads to chemical alterations in the IR spectrum, this technique has the potential to distinguish this condition from healthy individuals. IR spectroscopy together with chemometric methods has played an increasingly important role in the field of medical and biological analysis, through quickly detecting gynecological conditions at early stages. This is the first description of NIR and ATR-FTIR spectroscopy in blood plasma patients diagnosed with early endometriosis, elucidating potential future applications of this technique.

Results

The pre-processed IR spectra of blood plasma for both the control group (healthy women) and the endometriosis group are illustrated in Fig. 1. The dataset comprises 75 samples, generating a total of 225 spectra, with each sample providing three spectra. To observe the reproducibility of the spectral data, the mean and the standard deviation were calculated for each ATR-FTIR and NIR replicate to compare the differences between them. For ATR-FTIR, it was obtained the values of 0.0625 ± 0.0027 (replicate 1), 0.627 ± 0.0029 (replicate 2) and 0.626 ± 0.024 (replicate 3) for endometriosis and 0.0618 ± 0.0021 (replicate 1), 0.0618 ± 0.0023 (replicate 2) and 0.0614 ± 0.0018 (replicate 3) for the control group. For NIR, the values were 0.5569 ± 0.0725 (replicate 1), 0.5274 ± 0.0542 (replicate 2) and 0.5262 ± 0.0583 (replicate 3) for endometriosis and 0.4734 ± 0.0198 (replicate 1), 0.4637 ± 0.0207 (replicate 2) and 0.4645 ± 0.0189 (replicate 3) for healthy controls. The small difference between the obtained mean ± standard deviation values demonstrate that the data presents a high reproducibility. These spectra were averaged before constructing the model. In the ATR-FTIR region (Fig. 1A), several characteristic absorption bands are discernible. Notable among these are the Amide I band of proteins (~ 1650 cm-1), a minor band at 1552 cm-1 corresponding to the C = O bond of Amide II, and a feature at 3366 cm-1 associated with O–H stretching10,11.

Fig. 1
figure 1

(A) ATR-FTIR spectra; (B) NIR spectra; the standard deviation values for (C) ATR-FTIR and (D) NIR; and the second-derivative spectra for (E) ATR-FTIR and (F) NIR for the endometriosis and healthy control groups.

The NIR spectra (Fig. 1B) show the –CH₂ second overtone at 1179 nm, as well as the first overtone of N–H stretching and the first overtone of O–H stretching. The band at 1788 nm is attributed to lipid structures, while the bands at 1950 nm and 2332 nm correspond to the second overtone of C–O stretching in carbohydrates and the stretching and bending of CH associated with methylene, respectively12. In addition, the average signal-to-noise ratio (SNR) values for the spectra dataset were investigated. The calculated SNR for ATR-FTIR spectra were 145.19 and 160.19 for the endometriosis and the control classes, respectively, while for NIR, the SNR were 103.29 and 113.86 for endometriosis and controls, respectively. These values suggest a relative high quality of both spectroscopic measures13.

To obtain the spectral variability within each of the classes, the relative standard deviation (RSD) was calculated. For ATR-FTIR data, the RSD values were 6.43% and 4.05% for the control and endometriosis classes, respectively, whereas for NIR, the RSD values were 18.31% and 18.59% for controls and endometriosis, respectively. These results suggest a low spectral variability for the studied classes, especially in relation to the ATR-FTIR spectra. The standard deviation values for each wavenumber/wavelength in the IR spectra (Fig. 1C, D) shows the variation of the IR variables around its average spectra. The NIR data presents more fluctuations on its signal resulting in higher values of standard deviation.

Figure 1E, F shows the average second derivative spectra of ATR-FTIR and NIR. By differentiating the spectra and then averaging the derivative spectra, bands with characteristic differences between healthy women and endometriosis group can be obtained. In the ATR-FTIR region (Fig. 1E), the characteristic differences between the two groups were mainly at 1632 cm−1 and 1548 cm−1, which corresponds to the bands of Amide I and Amide II11, respectively, indicating that these bands are responsible for the separation between the two groups. Meanwhile, the characteristic differences of NIR spectra were more evident. In the NIR range, the characteristic differences between the two groups were mainly at 1852 nm, 1900 nm, 2244 nm and 2332 nm, that corresponds to the O–H combination band, combination frequency of water absorption, stretching and bending of CH associated with methylene, and the -CH3 methyl combination band, respectively12.

Both the NIR and ATR-FTIR spectra display significant overlap in spectral features between the control and endometriosis groups, necessitating the use of multivariate tools to distinguish between these categories. Several supervised multivariate classification tools were tested, and their performances are summarized in Table 1.

Table 1 Quality parameters calculated for the test set using different supervised classification algorithms to distinguish women with endometriosis and the healthy control women.

The best discrimination results for ATR-FTIR data were obtained using PCA-SVM (77% accuracy), followed by GA-SVM (73% accuracy). This performance is consistent with F-scores of 86% and 72%, respectively, indicating that classification was not affected by imbalanced class sizes. For PCA-SVM, three principal components (PCs) were selected, representing 97.18% of the accumulated variance. Figure 2D–F depicts the PCA scores for an overview of the classes’ separation. However, the group distribution was not satisfactory just with the use of this exploratory analysis technique, probably due to the high spectral overlap between both classes. Thus, it is necessary the application of a discriminant algorithm, such as LDA, QDA or SVM, enabling an accurate classification of the samples. The loading profiles of these PCs, used for spectral assignment, are shown in Figs. 2A–C. The wavenumbers contributing to this discrimination are summarized in Supplementary Table S1.

Fig. 2
figure 2

PCA loadings on (A) PC1; (B) PC2, and (C) PC3; and PCA scores (D) PC1 vs. PC2; (E) PC1 vs. PC3, and (F) PC2 vs. PC3.

GA-SVM selected 12 spectral wavenumbers from the 1726 variables in the entire spectrum for class differentiation. These wavenumbers are 3602, 3476, 2644, 2404, 1850, 1502, 1454, 1156, 1094, 978, 702, and 640 cm⁻1 (Supplementary Figure S1A). The GA-SVM discriminant function (DF) is depicted in Supplementary Figure S1B, with the tentative biomarker assignments for selected wavenumbers shown in Supplementary Table S2. The best fit for NIR data was achieved using PCA-LDA and GA-LDA, both with 100% accuracy. The assignments for PCA-LDA loadings are depicted in Supplementary Table S3, while the loadings and the scores are shown in Fig. S3. Although PC1 vs. PC2 do not show the separation between the classes, this is evident in both scores plots in Fig. S3E-F, when it is possible observe the separation between endometriosis and control groups, due to some compositional differences between the two classes. Figure 3 shows the discrimination plot and the 14 selected variables for NIR using GA-LDA.

Fig. 3
figure 3

GA-LDA results for NIR data. (A) Selected wavenumbers responsible for class separation (B) Discriminant function (DF) for the samples.

The 14 spectral wavelengths responsible for class separation in NIR data using GA-LDA are 2332, 2289, 1498, 1479, 1393, 1370, 1361, 1304, 1223, 1187, 1096, 1073, 1037, and 901 nm. The assignments for these variables are summarized in Supplementary Table S4. To improve results, a low-level data fusion strategy was applied to IR data, with results depicted in Table 2.

Table 2 Quality parameters calculated for the test set using different supervised classification algorithms for the data fusion to distinguish women with endometriosis and the healthy control women.

The results indicate good performance for the PCA-LDA, SPA-LDA, and GA-LDA models in discriminating data using fused NIR and MIR spectra. The combination of overtones and combination bands from NIR with fundamental vibrations found in ATR-FTIR provides comprehensive data characterization14. SPA-LDA selected 15 variables, while GA-LDA selected 11 variables (Supplementary Table S5), with tentative biomarker assignments shown in Table 3.

Table 3 Wavenumbers and wavelengths selected by SPA-LDA and GA-LDA applied to the ATR-FTIR and NIR fused data to discriminate between healthy controls vs. endometriosis samples.

Additionally, 2D correlation spectroscopy was applied to study the relationship between different ATR-FTIR and NIR wavenumbers/wavelengths (Fig. 4). This correlation aids in identifying the molecular origin of NIR markers, as MIR bands are sharper and well-documented in specialized literature15. A correlation marker for endometriosis patients was found at ~ 1650 cm−1 of the MIR spectra combined with the 1170 nm NIR band, being related to the amide I band. The marker at 1552 cm⁻1, attributable to amide II, is consistent with the NIR band at 1100 nm, representing the second overtone of secondary amides10,16. Other markers include wavenumbers at 2782 cm⁻1, 1402 cm⁻1, and 1416 cm⁻1 MIR bands, attributed to N–H stretching, symmetric CH₃ bending modes of methyl groups of proteins, and N–H deformation/C–N stretching, respectively.

Fig. 4
figure 4

Synchronous map obtained from 2D correlation. Black arrows indicate the correlation between ATR-FTIR and NIR spectroscopy bands.

Discussion

There are few studies in specialized literature that search for new tools to predict the possible diagnosis of endometriosis using non-invasive methods. These methods include the analysis of blood biomarkers and genetic predispositions, demonstrating high sensitivity and specificity17,18. Kokot et al. 2022, developed a partial least squares discriminant analysis (PLS-DA) model based on selected serum biochemical parameters, specific regions of the serum’s ATR-FTIR spectra, and combined data to diagnostic screening of advanced endometriosis, reaching an overall accuracy of 87.5%19. Therefore, a novel method using IR spectroscopy coupled with multivariate classification tools was applied to analyze blood plasma collected from women with endometriotic patterns and healthy individuals to discriminate between the two groups.

In this study, 75 blood plasma samples from women were analyzed using ATR-FTIR and NIR spectroscopy to stratify patients with endometriosis (41 out of 75) and healthy controls (34 out of 75). Various chemometric algorithms were tested for category discrimination, with the wavenumbers/wavelengths most responsible for segregating the different classes designated as potential diagnostic biomarkers.

ATR-FTIR spectroscopy is a powerful tool for analyzing different biological structures based on spectral analysis, proving to be highly useful in clinical studies20. Previously, this tool has been successfully applied to detect gestational diabetes mellitus by our research group21 and endometrial cancer22. Herein, ATR-FTIR spectroscopy combined with chemometric algorithms was employed to detect patients with endometriosis. The best results were obtained using PCA-SVM, which achieved a classification accuracy of 77% and a sensitivity of 83%, followed by GA-SVM with an accuracy of 73%. Twelve spectral wavenumbers were responsible for class separation based on GA-SVM (Table S2). Bands related to Amide I at 1640 cm⁻1, Amide II at 1548 cm⁻1, and some features related to proteins at 1538 cm⁻1 were identified as potential discriminant biomarkers.

In addition, NIR spectroscopy is a valuable tool for analyzing different types of diseases in biologically-derived samples. This spectroscopy has been successfully applied to detect HIV in pregnant women23. PCA-LDA and GA-LDA were capable of discriminating patients with endometriosis from healthy women with 100% accuracy, 100% sensitivity, and 100% specificity. Furthermore, the NIR technique allows the use of portable instruments, which represents an operational advantage. However, NIR spectroscopy has limited chemical specificity due to the high degree of overlapping spectral features, making pure biomarker identification using only this technique infeasible24. In this context, IR data fusion (ATR-FTIR and NIR) appears as an alternative strategy, combining the advantages of both IR spectroscopies in a single model. This approach increases the reliability of classification algorithms compared to using a single analytical technique25, as demonstrated by Yang et al. 2021, that successfully applied an NIR-MIR spectral feature fusion based on PLS-DA to obtain a rapid and accurate diagnostic of Alzheimer’s disease, obtaining an accuracy of 100%26.

Applying the data fusion strategy, the classification models using all studied feature extraction/selection algorithms combined with LDA and SVM achieved 100% accuracy, 100% sensitivity, and 100% specificity, confirming the feasibility of this approach for identifying endometriosis patients. The successful application of this method in a clinical environment could aid in the early diagnosis of endometriosis in a low-cost and non-invasive manner.

Furthermore, the multimodal spectroscopy or data fusion approach can be considered an advanced analytical tool that provides comprehensive and complementary information previously inaccessible, enabling the elucidation of intricate details. The application of 2D correlation spectroscopy allowed for the study of relationships between different MIR and NIR variables (wavenumbers/wavelengths). Markers found in the synchronous map were related to the Amide I and Amide II MIR bands, which matched the variables selected by the chemometric models. Finally, this approach demonstrates that NIR bands are correlated with specific ATR-FTIR bands, enhancing the interpretability of NIR spectra.

Conclusions

To the best of our knowledge, this is the first study demonstrating the efficacy of a new tool for fast and non-invasive diagnosis of endometriosis using blood plasma samples analyzed with IR spectroscopy combined with multivariate classification. The PCA-LDA, SPA-LDA, and GA-LDA models showed excellent performance when applied to the fused ATR-FTIR and NIR data, with high discriminant power between the studied classes. In this context, spectroscopy emerges as a potential method for studying endometriosis and could be applied in clinical practice in the future, either as a screening or diagnostic test.

Methods

The methodology used in this study is summarized in the workflow in Fig. 5.

Fig. 5
figure 5

Workflow for the endometriosis versus healthy controls classification.

Study design and population

We conducted a case–control study at the Januário Cicco Maternity School, affiliated with the Federal University of Rio Grande do Norte. A total of forty-one women (n = 41) with a clinical diagnosis of endometriosis were recruited, alongside thirty-four healthy women (n = 34). Subjects were excluded if they had autoimmune diseases, immunodeficiencies, chronic illnesses, or genetic syndromes linked to sex chromosomes. The characteristics of the included patients are presented in Table 4. All methods used in this study adhered to approved guidelines. The study received ethical approval from the ethics committee at the Januário Cicco Maternity School, Federal University of Rio Grande do Norte, under protocol number 44352921.6.0000.5292, and written informed consent was obtained from all subjects. All procedures were conducted in compliance with the Declaration of Helsinki.

Table 4 Demographic and clinical characteristics of included women with and without diagnosis of endometriosis.

Sample collection and determination for analysis with ATR-FTIR and NIR

Venous blood samples were collected from all participants. The samples were centrifuged at 3600 rpm for 7 min to separate erythrocytes from the plasma. Subsequently, 100 µL aliquots of plasma were transferred to Eppendorf tubes and stored at −80 °C until ATR-FTIR and NIR analysis. This sample collection resulted in a total of 75 samples, with 41 from patients with endometriosis and 34 from the healthy control group.

ATR-FTIR spectroscopy

The blood plasma samples were left to thaw for 30–40 min at room temperature before spectrochemical measurement. 10 µL aliquots were used for analysis and all tests were obtained in triplicates using wet samples, with a new aliquot applied for each of the replicates. The spectral data were obtained using an attenuated total reflectance ATR-FTIR of PerkinElmer Spectrum Two FTIR type (PerkinElmer, USA) using a single-reflection diamond accessory. Measurements were performed using a KBr beamsplitter and a LiTaO3 (lithium tantalate) detector. The spectra were recorded with a total of 32 scans. Next, they underwent Norton-Beer apodization and Fourier transformation using a zero-filling factor of 2 with 4 cm−1 resolutions at the range between 4000 and 550 cm−1. The ATR crystal was cleaned with ethyl alcohol (70% v/v) for each sample, before acquiring the experimental background and after reading each sample to minimize atmosphere variations over time.

NIR spectroscopy

NIR were obtained using an ARCoptix FT-NIR Rocker spectrophotometer (Arcoptix S.A., Switzerland). The tests were taken in transflectance mode with a resolution of 5 nm. The methodology was carried out by transferring 10 µL of the sample to an aluminum paper surface and using an optical fiber positioned onto each paper. The measurements were obtained in triplicate for each sample, with a new aliquot being prepared for each replicate, in the 900–2600 nm range.

Data analysis

The data analysis was performed in MATLAB R2014b environment version 8.4 (Math Works, Inc., USA), with PLS Toolbox version 7.9.3 (Eigenvector Research, Inc., USA) and the Classification toolbox (version 7.0) by Milano chemometrics and QSAR research group27. The raw ATR-FTIR spectra were loaded and pre-processed by Savitzky-Golay (SG) smoothing (window of 7 points, 2nd order polynomial fitting) and automatic weighted least squares (AWLS) baseline correction. NIR spectra were pre-processed with multiplicative signal correction (MSC) and SG smoothing (window of 15 points, 2nd order polynomial fitting). In addition, a low-level data fusion strategy was applied by combining the NIR and the ATR-FTIR information to achieve more accurate results28. The ATR-FTIR/NIR fused data were normalized due to the different scales of the spectra. Both data were mean centered before the analysis. The samples were divided into training (70%) and test (30%) subsets by applying the Kennard-Stone (KS) sampling algorithm29, using the training set for model construction and test set for its validation. For building the models, feature extraction/selection algorithms combined with discriminant analysis and support vector machine (SVM) were applied. The genetic algorithm (GA)30 and the successive projections algorithm (SPA)31 were applied for feature selection, while principal components analysis (PCA)32 was used for feature extraction. The discriminant analysis was performed by linear discriminant analysis (LDA) and quadratic discriminant analysis (QDA). The algorithms PCA-LDA/QDA/SVM, SPA-LDA/QDA/SVM and GA-LDA/QDA/SVM were tested independently to achieve the best classification model.

In the PCA model, each PC is composed of scores (variance in the sample direction), used to assess similarities/dissimilarities between the samples; and the loadings (variance in the variables direction), used to show the weight of each variable (wavenumber or wavelength) towards the scores pattern. The PCA decomposition of a spectral dataset can be calculated by Eq. 1:

$$X=T{P}^{T}+E$$
(1)

where T is the score matrix; P is the loadings matrix; and, E is the residual matrix. The spectral variables selected by SPA and GA are based on minimizing the cost function G according to the Eq. 2:

$$G=\frac{1}{{N}_{V}}\sum_{n=1}^{{N}_{V}}{g}_{n}$$
(2)

where NV represents the number of validation samples and gn can be calculated according to Eq. 3:

$${g}_{n}=\frac{{r}^{2}({x}_{n},{M}_{I\left(n\right)})}{{{min}_{I(m)\ne I(n)}r}^{2}({x}_{n},{M}_{I\left(m\right)})}$$
(3)

where r2(xn,mI(n)) is the squared Mahalanobis distance between the object xn and the center of its true category, while r2(xn,mI(m)) represents the squared Mahalanobis distance between the object xn and the center of the closest wrong category mI(m).

The LDA (Lik) and QDA (Qik) discriminant analysis are calculated in a non-Bayesian form as described below 33,34:

$${L}_{ik}=({x}_{i}-{\overline{x} }_{k}{)}^{T}{C}_{pooled}^{-1}({x}_{i}-{\overline{x} }_{k})$$
(4)
$${Q}_{ik}=({x}_{i}-{\overline{x} }_{k}{)}^{T}{C}_{k}^{-1}({x}_{i}-{\overline{x} }_{k})$$
(5)

where xi is a vector containing the input variables for each sample; \({\overline{x} }_{k}\) is the mean spectrum of category k; and, Cpooled and Ck are the pooled covariance matrix and the variance–covariance matrix for category k, respectively.

SVM is a binary linear classifier using a nonlinear step that transforms the input sample space into a feature space by the application of a kernel function35. The SVM can be calculated by the following Eq. 636:

$$f\left(x\right)=sing\left(\sum_{i=1}^{{N}_{SV}}{\alpha }_{i}{y}_{i}k\left({x}_{i},{z}_{j}\right)+b\right)$$
(6)

where NSV represents the number of support vectors; αi, yi, and k(xi,zj) are the Langrage multiplier, the class membership (± 1) and the kernel function, respectively; and b is the bias parameter.

Model quality evaluation

The classification performance of each method was obtained based on the model accuracy, sensitivity and specificity. The accuracy (AC) represents the total number of samples correctly classified, while the specificity (SPEC) and the sensitivity (SENS) measure the proportion of negatives and positives that are correctly identified, respectively. The F-Score measures the overall classification performance considering imbalanced data, while the G-Score measures the overall classification performance not accounting for class sizes37. These validation parameters are calculated by the following equations:

$$AC\left({\%}\right)=\left(\frac{TP+TN}{TP+FP+TN+FN}\right)\times 100$$
(7)
$$SPEC\left({\%}\right)=\left(\frac{TN}{TN+FP}\right)\times 100$$
(8)
$$SENS\left({\%}\right)=\left(\frac{TP}{TP+FN}\right)\times 100$$
(9)
$$F-Score\left(\%\right)=(\frac{2\times SENS\times SPEC}{SENS+SPEC})$$
(10)
$$G-Score\left(\%\right)=\sqrt{SENS\times SPEC}$$
(11)

where TP, TN, FP and FN represent the numbers of true positives, true negatives, false positives and false negatives, respectively.