Spectrochemical differentiation in gestational diabetes mellitus based on attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy and multivariate analysis

Bernardes-Oliveira, Emanuelly; de Freitas, Daniel Lucas Dantas; de Morais, Camilo de Lelis Medeiros; Cornetta, Maria da Conceição de Mesquita; Camargo, Juliana Dantas de Araújo Santos; de Lima, Kassio Michell Gomes; Crispim, Janaina Cristiana de Oliveira

doi:10.1038/s41598-020-75539-y

Download PDF

Article
Open access
Published: 06 November 2020

Spectrochemical differentiation in gestational diabetes mellitus based on attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy and multivariate analysis

Scientific Reports volume 10, Article number: 19259 (2020) Cite this article

3516 Accesses
30 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Gestational diabetes mellitus (GDM) is a hyperglycaemic imbalance first recognized during pregnancy, and affects up to 22% of pregnancies worldwide, bringing negative maternal–fetal consequences in the short- and long-term. In order to better characterize GDM in pregnant women, 100 blood plasma samples (50 GDM and 50 healthy pregnant control group) were submitted Attenuated Total Reflection Fourier-transform infrared (ATR-FTIR) spectroscopy, using chemometric approaches, including feature selection algorithms associated with discriminant analysis, such as Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA) and Support Vector Machines (SVM), analyzed in the biofingerprint region between 1800 and 900 cm⁻¹ followed by Savitzky–Golay smoothing, baseline correction and normalization to Amide-I band (~ 1650 cm⁻¹). An initial exploratory analysis of the data by Principal Component Analysis (PCA) showed a separation tendency between the two groups, which were then classified by supervised algorithms. Overall, the results obtained by Genetic Algorithm Linear Discriminant Analysis (GA-LDA) were the most satisfactory, with an accuracy, sensitivity and specificity of 100%. The spectral features responsible for group differentiation were attributed mainly to the lipid/protein regions (1462–1747 cm⁻¹). These findings demonstrate, for the first time, the potential of ATR-FTIR spectroscopy combined with multivariate analysis as a screening tool for fast and low-cost GDM detection.

Prediction of gestational diabetes mellitus using machine learning from birth cohort data of the Japan Environment and Children's Study

Article Open access 13 October 2023

Association of pre- and early-pregnancy factors with the risk for gestational diabetes mellitus in a large Chinese population

Article Open access 01 April 2021

Severe gestational diabetes mellitus in lean dams is associated with low IL-1α levels and affects the growth of the juvenile mouse offspring

Article Open access 30 January 2023

Introduction

Gestational diabetes mellitus (GDM) is a hyperglycaemic metabolic disorder that first appears during pregnancy and does not meet the criteria for manifest diabetes¹, it is characterized by glucose intolerance or beta cell dysfunction and insulin resistance, and affects up to 22% of all pregnancies worldwide².

One of the protocols that is most used in the diagnosis of GDM follows the recommendations of the American Diabetes Association (ADA)³. In addition to hyperglycemia, other glycemic markers have been used for the diagnosis of diabetes mellitus (DM), including fructosamine, glycated albumin, hemoglobin A1c (HbA1c), and 1,5-anhydroglucite, each with its own limitation, if we consider cost for countries in development⁴. Despite this approach, several researchers are looking for new possibilities to identify women at risk for GDM, particularly in the first trimester.

It is known that GDM is considered a risk factor associated with many perinatal morbidities that affect maternal and foetal/neonatal health¹. GDM promotes increased weight and triglyceride levels, changes in blood pressure, heart problems, induction of caesarean section, and type II diabetes after childbirth in women. For new-borns, the most common risks are weight gain (macrosomia), shoulder dystocia at birth, congenital heart defects, hyperbilirubinemia, polycythemia, respiratory distress and stillbirth, in addition to the risk of developing metabolic syndrome^5,6.

Individuals with GDM during pregnancy are known to suffer physiological changes, with the appearance of diabetogenic placental hormones (oestrogen and progesterone), placental factors (human placental lactogen), and increased lipids and adipokines including leptin, resistin and visfatin from the first trimester. These contribute to the predisposition of metabolic diseases and insulin resistance, obesity and chronic inflammation capable of releasing different pro-inflammatory cytokines and C-reactive proteins (CRP), especially when these women are obese⁷.

In regard to the contribution of biomolecules in the pathophysiology of GDM, this is not yet well known, however, recent studies have shown that the levels of Growth differentiation factor 15 (GDF15), also known as macrophage inhibitory cytokine-1 (MIC-1), are highly expressed the placenta, and this is identified as a pleiotropic protein that plays key roles in prenatal development, induced by both acute and chronic inflammatory states, acting directly on metabolism of carbohydrates and lipids of GDM women^8,9. Due to the metabolic impact of GDM during pregnancy, screening and appropriate management of GDM is essential, especially in the first weeks of pregnancy, aiming at improving the quality of prenatal care of these women. The diagnosis of GDM and early intervention is of great significance for reducing short- and long-term consequences for the mothers and new-borns¹⁰. This is critical in less developed countries, where most pregnant women do not have the opportunity to perform early GDM diagnosis.

Therefore, there is a need for accurate and low-cost techniques for GDM detection. Attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy can be used to extract spectrochemical information of biological samples, where signals of vibrational motions existing in the chemical bonds of these biomolecules can be captured, hence, generating an important biofingerprint spectrum in the region between 1800 and 900 cm⁻¹ where many important biomolecules (DNA/RNA, lipids, proteins and carbohydrates) have contributing metabolic features relating to disease appearance¹¹.

Chemometric methods are often employed to analyse complex spectral data acquired with ATR-FTIR spectroscopy. Feature extraction and selection methods, such as principal component analysis (PCA), successive projections algorithm (SPA) and genetic algorithm (GA) can be employed to reduce data complexity and redundant information¹². PCA is an exploratory analysis algorithm capable of reducing the original data into a low number of principal components (PCs), where each PC represents a piece of the original data variance¹¹, while SPA and GA are able to select the most significant wavenumbers from the spectral dataset responsible for class differentiation¹³. These algorithms are commonly associated with linear discriminant analysis (LDA), quadratic discriminant analysis (QDA) and support vector machines (SVM). These classification algorithms are used to build supervised training models which allow us to predict unknown samples based on their spectral response¹².

ATR-FTIR together with chemometric methods has played an increasingly important role in the field of medical and biological analysis, through quickly detecting pathological conditions, even at very early stages.

Previous studies have demonstrated the importance of using infrared spectroscopy in samples of biological diabetics when analyzing glycation in nail clippings. These studies have shown that ATR-FTIR is sensitive enough to analyze the presence of glucose when compared to the reference population¹⁴. ATR-FTIR also demonstrated its use in the diagnosis of diseases such as cancer¹⁵, neurodegenerative diseases¹⁶, zika and chikungunya¹⁷ and chronic diseases¹⁸, as well as in analyzing blood plasma, and managing to separate the disease group from the healthy group, via biomolecules.

Material and methods

Study design and population

We performed a case–control study, conducted in a Reference Obstetrics and Gynecology Hospital between January and October 2018. A total of 50 GDM women were recruited, all with single pregnancy at a gestational age of between 12 and 38 weeks. Only participants with complete clinical information were included in the analysis. Subjects were excluded if they had had chronic medical conditions, including hypertension, were declared diabetic (blood glucose ≥ 126 mg/dL), had type 2 diabetes mellitus, and heart or kidney diseases. The study was approved by the Ethics Committee of Federal University of Rio Grande do Norte. Written informed consent was obtained from every participant. All procedures were performed in compliance with the Declaration of Helsinki.

Clinical measurements

Baseline anthropometric measurements were completed at recruitment using a standardized protocol for BMI classification by week of gestation, the classifications were: underweight, adequate weight, overweight and obesity. Clinical data were collected from medical record reviews. Pregnant women in the GDM group were already diagnosed with blood glucose changes between ≥ 92 mg/dL and < 126 mg/dL during prenatal care, while patients with blood glucose ≥ 126 mg/dL were considered to be declared diabetic, according to the guidelines of the American Diabetes Association (ADA)³. These women were given medical nutrition therapy and/or insulin treatment during their antenatal follow-up. The anthropometric, socioepidemiological and metabolic characteristics of GDM and glucose samples were summarized in Table 3.

Healthy pregnant control group

Fifty healthy pregnant women were enrolled who attended a low-risk maternity hospital. The pregnant women were between 19 and 44 years old, and at a gestational age of between 9 and 39 weeks. The healthy pregnant control group had blood glucose < 92 mg/dL and all underwent fasting glucose testing and oral glucose tolerance test (OGTT) screening at 24–28 weeks to discard GDM.

Sample collection and determination for analysis with ATR-FTIR

Venous blood samples were collected from participants following an overnight fast 8 h. After 4 h the blood samples were centrifugated at 3600 rpm for 7 min to separate erythrocytes from blood plasma. 100 µL aliquots of plasma were transferred to eppendorf tubes and stored at − 80 °C until ATR-FTIR analysis. The blood plasma glucose levels were determined as described in Table 3.

ATR-FTIR spectroscopy

The blood plasma samples were thawed at room temperature for 30–40 min, [n = 100 samples (GDM group = 50) and (healthy pregnant control group = 50)], where 10 μL aliquots (in triplicates) were used for analysis. The spectral data were acquired using a IRAffinity-1S FTIR spectrophotometer (Shimadzu Corp., Japan) equipped with an ATR.

The instrument was set up to perform a total of 32 scans with 4 cm⁻¹ spectral resolution for both background and sample spectra, recorded rapidly at the range between 4000 and 600 cm⁻¹, as described by Santos et al. with some modifications¹⁷.

Data analysis

The data analysis was performed in MATLAB R2014b environment version 8.4 (MathWorks, Inc., USA). The raw spectral data was loaded and pre-processed by cutting the biofingerprint region between 1800 and 900 cm⁻¹, followed by Savitzky–Golay (SG) smoothing (window of 15 points, 2nd order polynomial fitting), automatic weighted least squares (AWLS) baseline correction and normalisation to the Amide I band (1650 cm⁻¹). The data were mean-centred before analysis.

Samples were divided into training (70%), validation (15%) and test (15%) sets for all classification models by applying the Kennard–Stone (KS) algorithm¹⁹ to the pre-processed spectra. The training set was used in the modelling procedure, the validation set for internal model optimisation, and the test set was only used in the final classification evaluation. Initially, the data were analysed by principal component analysis (PCA). Each PC is composed of scores (variance in sample direction) and loadings (variance in wavenumber direction), where the scores are used to assess similarities/dissimilarities between the samples, and the loadings show the weight of each wavenumber towards the scores pattern. The PCA decomposition of a spectral dataset ${\varvec{X}}$ takes the following form:

$$ {\varvec{X}} = {\varvec{TP}}^{T} + E $$

where ${\varvec{T}}$ is the scores matrix; ${\varvec{P}}$ is the loadings matrix; and ${\varvec{E}}$ is the residual matrix. The PCA scores were used for exploratory analysis of the data, and as input data for supervised classification models: linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and support vector machines (SVM).

In addition to PCA, the spectral dataset were reduced to a few spectral features by feature selection methods: genetic algorithm (GA) and successive projections algorithm (SPA). These were coupled to LDA, QDA and SVM for classification, and their performances were compared with the PCA-based approaches. GA²⁰ is a type of variable selection algorithm that performs this task by mimicking the evolution process, thus recombining and promoting mutations in different subsets of variables until a determined fitness criterion is reached. The goal of this algorithm is to reduce the total number of variables without changing the type of variable, as occurs when using data reduction via PCA. In this case, GA was used with 100 generations and 200 chromosomes each, and mutation and crossover probabilities were set to 10% and 60%, respectively. SPA²¹ also works by reducing the pre-processed spectral data to a low number of variables maintaining the original spectral information. It works with an iterative process by projecting the spectral variables and selecting those which minimise the data collinearity. The optimum number of variables for SPA and GA was determined by the minimum cost function G calculated for the validation set as follow¹⁰:

$$ G = \frac{1}{{N_{V} }} \mathop \sum \nolimits_{n = 1}^{{N_{V} }} g_{n} $$

(2)

where ${{\varvec{N}}}_{{\varvec{V}}}$ is the number of validation samples and ${{\varvec{g}}}_{{\varvec{n}}}$ is defined as:

$$ g_{n} = \frac{{r^{2} \left( {x_{n} ,m_{I\left( n \right)} } \right)}}{{min_{I\left( m \right) \ne I\left( n \right)} r^{2} \left( {X_{n} ,m_{I\left( m \right)} } \right)}} $$

(3)

where ${{\varvec{r}}}^{2}({{\varvec{x}}}_{{\varvec{n}}},{{\varvec{m}}}_{{\varvec{I}}\left({\varvec{n}}\right)})$ is the squared Mahalanobis distance between the object ${\text{x}}_{\text{n}}$ (of class ${\text{I}}_{\left( {\text{n}} \right)}$) and the centre of its true class (${\text{m}}_{{{\text{I}}\left( {\text{m}} \right)}}$), and ${\text{r}^{2}}\left( {\text{X}_{\text{n}}},{\text{m}}_{{{\text{I}}\left( {\text{m}} \right)}} \right)$ is the squared Mahalanobis distance between the object ${\text{X}}_{\text{n}}$ and the centre of the closest wrong class (${\text{m}}_{{{\text{I}}\left( {\text{m}} \right)}}$).

Like the PCA scores, the selected wavenumbers by GA and SPA were used as input variables for LDA, QDA and SVM. LDA and QDA are discriminant analysis algorithms based on a Mahalanobis distance calculation between the classes, where LDA assumes classes have similar variance structures, thus, using a pooled covariance matrix for distance calculation; while QDA assumes classes have different variance structures, and thus uses the individual variance–covariance matrix for each class in the distance calculation²² SVM is a linear classification algorithm that uses a non-linear step called the kernel transformation²³. The kernel function (in this case, the radial bases function (RBF)) transforms the input spectral data into a feature space that maximises the margin of separation between the classes. Although more powerful than LDA or QDA for classification, SVM is more susceptible to overfitting²⁴.

Model quality evaluation

Model accuracy, sensitivity and specificity were calculated for the test set in order to evaluate the classification performance and validate the models. The accuracy (AC) represents the total number of samples correctly classified; the sensitivity (SENS) and specificity (SPEC) measure the proportion of positives and negatives that are correctly identified, respectively. These metrics are calculated as follows²⁵:

$$ {\text{AC }}\left( {\text{\% }} \right) = \left( {\frac{{{\text{TP}} + {\text{TN}}}}{{{\text{TP}} + {\text{FP}} + {\text{TN}} + {\text{FN}}}}} \right) \times 100 $$

(4)

$$ {\text{SENS }}\left( {\text{\% }} \right) = \left( {\frac{{{\text{TP}}}}{{{\text{TP}} + {\text{FN}}}}} \right) \times 100 $$

(5)

$$ {\text{SPEC }}\left( {\text{\% }} \right) = \left( {\frac{{{\text{TN}}}}{{{\text{TN}} + {\text{FP}}}}} \right) \times 100 $$

(6)

where TP stands for true positive; TN for true negative; FP for false positive; and FN for false negative.

Results

ATR-FTIR is considered a valuable tool capable of analysing different types of diseases by measuring biological-derived samples. Therefore, we used this technique in order to analyse the specificity, sensitivity and accuracy when differentiating the GDM group.

The raw ATR-FTIR mean spectra of GDM vs. healthy pregnancy control groups are shown in Fig. 1A. The data set consists of 100 samples of blood plasma, 50 samples of GDM group and 50 samples of healthy pregnancy control group. For each sample, the acquisition of 3 spectra was done, giving a total of 300 spectra. In the region of interest between 1800 and 900 cm⁻¹, known as the biofingerprint region, some characteristic IR absorption bands can be observed in the spectra, such as the major peaks at ~ 1650 cm⁻¹ for Amide I of proteins, as well as methylene groups of lipids at ~ 1750 cm⁻¹²⁶.

The spectral data were pre-processed by Savitzky–Golay smoothing, baseline correction and normalisation to the Amide I band (~ 1650 cm⁻¹) (Fig. 1B). The spectra present strong similarity related to absorption bands, in addition to being highly overlapped, in a way that it becomes difficult to categorise samples only considering the visual spectral information available. In this sense, application of multivariate algorithms is an essential strategy to extract important spectral information, allowing for the discrimination between samples of GDM vs. healthy pregnancy control groups based on their pathophysiological condition reflected in the spectral features. Furthermore, variable selection algorithms are powerful tools used to search for biomarkers in blood plasma, allowing less complex models to be obtained.

To predict whether pregnant women are affected by GDM, it is necessary to use chemometric models capable of finding spectral features that differentiate GDM spectra with the healthy pregnancy control group spectra. Initially, a PCA model was performed for exploratory analysis of the data, as shown in Fig. 2. Three principal components (PCs) were used, accounting for > 90% of cumulative explained variance.

The PC1 (68.18% explained variance) vs. PC2 (16.56% explained variance) scores plot (Fig. 2A), PC1 (68.18% explained variance) vs. PC3 (7.16% explained variance) scores plot (Fig. 2B), and the show some visual distinction between GDM and healthy pregnancy control groups; while the PC2 (16.56% explained variance) vs. PC3 (7.16% explained variance) scores plot (Fig. 2C) was much able to efficiently differentiate the sample groups, showing that a low percentage of spectral variance is responsible for class separation.

The PCA loadings are shown in Fig. 2D, where the following spectral features were found to have higher absolute coefficients, thus being responsible for the segregation pattern observed in the PCA scores plot. PC1 and PC2 show very similar loading profiles, with many overlapping bands between 900 to 1500 cm⁻¹, and a mirroring profile between 1500 and 1700 cm⁻¹; while PC3 shows quite a distinctive loading profile from PC1 and PC2.

Supervised classification models were built for systematic discrimination of GDM and healthy pregnancy control groups. For this, the pre-processed spectral data were split into training (70%), validation (15%) and test (15%) sets using the Kennard-Stone (KS) uniform sample selection algorithm. Several classification algorithms were tested (Table 1), where figures of merit were calculated for the test set: accuracy (AC) (percentage of total correct classification), sensitivity (SENS) (percentage of correct classification for the GDM group), and specificity (SPEC) (percentage of correct classification for the healthy pregnancy control group). The genetic algorithm linear discriminant analysis (GA-LDA) model achieved the best classification results, with 100% accuracy, sensitivity and specificity for the test set. GA-LDA Fisher’s discriminant scores (Fig. 3A,B) show an almost complete separation for all samples (training, validation and test sets) (Fig. 3A), and a perfect separation for the test samples (Fig. 3B). Where GA-LDA selected 10 spectral wavenumbers which were responsible for group differentiation, principally associated with the regions for water (901; 1047 cm⁻¹) and lipid/protein regions (1462; 1539; 1560; 1582; 1645; 1661; 1693; 1747 cm⁻¹) (Fig. 3C). The tentative biochemical assignments of these variables based on Movasaghi et al.²⁶ are shown in Table 2.

Table 1 Quality parameters for the test set.

Full size table

Table 2 Selected wavenumbers by the GA-LDA to distinguish GDM and controls samples.

Full size table

While still analyzing the characteristics of both groups, in the present study it was possible to verify some differences in relation to demographic, clinical and obstetric data, as shown in Table 3. Most pregnant women with GDM were older and had previous pregnancies when compared to the healthy pregnancy control group (p < 0.05). When analyzing fasting blood glucose, the GDM group was statistically significant when compared to the healthy pregnancy control group (p < 0.05). The mean BMI of the GDM group was higher (30.78 ± 5.00), compared to healthy pregnancy control group (28.24 ± 4.09), and they presented obesity or were overweight (p < 0.05).

Table 3 Demographic factors, clinical and obstetric history of pregnant women with and without diagnosis of GDM.

Full size table

Discussion

The development of a novel tool for the diagnosis of different diseases is extremely important, principally when they affect women during pregnancy, as is the case with GDM which is capable of harming both the mother and the fetus.

ATR-FTIR is considered a powerful tool, as it analyzes different biological structures based on spectral analysis, proving to be of great use to health clinical, promoting future perspectives through technological advances¹¹.

In our study, blood plasma from 100 pregnant women (50 GDM and 50 healthy control group) was analyzed by ATR-FTIR spectroscopy, in order to predict GDM group based on their samples’ spectrochemical profile. Our data showed that unsupervised model PCA was able to show a discriminating pattern between the groups, generating better scores between the PCs (PC2 vs. PC3). In PC3, the main difference is the amount of protein versus water. The negative loading appears around 1635 cm⁻¹ (water band). This appears oppositely correlated with the Amide II indicating a difference in the protein/water ratio between the two groupings. PC2 and PC3 show a great scores difference between the samples groups, indicating their respective loadings on PC1 and PC2 can be used to identify spectral markers associated with class differences. The spectral regions around 1640 cm⁻¹, near the water band, showed one of the highest absolute loadings indicating that water is a discriminating feature between the samples. However, Caixeta et al.²⁷, when analyzing saliva samples of male wistar rats with DM (treated with insulin), pre-diabetic and healthy, demonstrated the applicability of the ATR-FTIR associated with PCA-LDA, where it was able to generate six PCs, demonstrating the effectiveness of using mathematical algorithms in monitoring DM. Moreover, in a recent study analyzing peripheral blood samples from pre-diabetic patients, a response was found to glucose levels when using ATR-FTIR and PCA combined with eXtreme Gradient Boosting (XGBoost) generating the model SG-PCA-XGBoost, which was able to differentiate from healthy people¹⁸.

When we used different supervised models, GA-LDA was the best classification model that systematically distinguished GDM samples from controls. GA-LDA is a powerful feature selection algorithm based on iterative combinations inspired by Mendelian genetics, where the fittest variables (wavenumbers) that maximize class separation are selected¹³. It commonly outperforms feature extraction methods such as PCA²⁸. However, there are few studies that address the use of the ATR-FTIR tool in diabetes, and fewer with GDM. Until this moment, no study has analyzed blood plasma samples from pregnant women with GDM in GA-LDA models. This demonstrates the innovation of this model in the prediction of GDM, and confirms that GA-LDA is an excellent classification algorithm for samples of blood plasma of pregnant women, playing a fundamental role during prenatal care, assisting in diagnosis and monitoring.

Although many studies on the pathophysiology of GDM have been conducted, the potential of biomarkers in its development remains unclear. In our study it was possible to verify that the selected wavenumbers by GA-LDA were responsible for group separation, according to the biomolecule regions referring to lipid and protein/water ratio. This information combined with the GA-LDA selected wavenumbers at 1046 cm⁻¹, 1537 cm⁻¹ and 1640 cm⁻¹ indicate that some relation between water and protein levels is a discriminant factor between the groups.

However, GDM emerges as a disorder of insulin-dependent, where metabolomic pathways are relevant to lipid and amino acid metabolisms, as well as bile acids and abnormal protein turnover²⁹. Promotion of oxidation of protein intensifies during GDM, in which the hyperglycemic state causes protein hydroperoxides, protein carbonyls, C-reactive protein and glycated hemoglobin (HbA1c). In addition to this, it is considered an important mediator of adipocyte disorders, intensifying the inflammatory response and contributing to the complications of diabetes³⁰.

To reinforce our data and assessment of the associated factors with GDM, we can observe that there is an increase in BMI, one of the precursors for insulin resistance, since during obesity there is an increase in lipids and there is the release of inflammatory cytokines. In addition, we emphasize that maternal age and obesity are factors that can directly interfere with pregnancy, contributing to the development of GDM.

Conclusions

According to the results of the present study, blood plasma samples from pregnant women with GDM could rapidly be differentiated from our healthy pregnant control group based on their sample FTIR spectra, where a chemometric model by means of the GA-LDA algorithm, was able to distinguish between GDM and healthy pregnant control group with 100% accuracy, sensitivity and specificity in an external test set.

References

Giannakou, K. et al. Risk factors for gestational diabetes: An umbrella review of meta-analyses of observational studies. PLoS ONE 14, e0215372. https://doi.org/10.1371/journal.pone.0215372 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sifnaios, E. et al. Gestational diabetes and T-cell (Th1/Th2/Th17/Treg) immune profile. In Vivo 33, 31–40. https://doi.org/10.21873/invivo.11435 (2019).
Article CAS PubMed PubMed Central Google Scholar
American Diabetes Association. Classification and diagnosis of diabetes: Standards of medical care in diabetes-2018. Diabetes Care 41(Supplement 1), S13–S27. https://doi.org/10.2337/dc18-S002 (2018).
Article Google Scholar
Katchunga, P. B. et al. Delanghe Glycated nail proteins as a new biomarker in management of the South Kivu Congolese diabetics. Biochem. Med. 25(3), 469–473. https://doi.org/10.11613/BM.2015.04 (2015).
Article Google Scholar
Donovan, B. M. et al. Development and validation of a clinical model for preconception and early pregnancy risk prediction of gestational diabetes mellitus in nulliparous women. PLoS ONE 14, e0215173. https://doi.org/10.1371/journal.pone.0215173 (2019).
Article CAS PubMed PubMed Central Google Scholar
Yasuda, S. et al. Weight control before and during pregnancy for patients with gestational diabetes mellitus. J. Diabetes Investig. 10, 1075–1082. https://doi.org/10.1111/jdi.12989 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kianpour, M., Saadatmand, F., Nematbakhsh, M. & Fahami, F. Relationship between c-reactive protein and screening test results of gestational diabetes in pregnant women referred to health centers in Isfahan in 2013–2014. Iran J. Nurs. Midwifery Res. 24, 360–364. https://doi.org/10.4103/ijnmr.IJNMR_352_14 (2019).
Article PubMed PubMed Central Google Scholar
Desmedt, S. et al. Growth differentiation factor 15: A novel biomarker with high clinical potential. Crit. Rev. Clin. Lab. Sci. 56(5), 333–350. https://doi.org/10.1080/10408363.2019.1615034 (2019).
Article PubMed Google Scholar
Tang, M. et al. Serum growth differentiation factor 15 is associated with glucose metabolism in the third trimester in Chinese pregnant women. Diabetes Res. Clin. Pract. 156, 107823. https://doi.org/10.1016/j.diabres.2019.107823 (2019).
Article CAS PubMed Google Scholar
Nielsen, K. K., O’Reilly, S., Wu, N., Dasgupta, K. & Maindal, H. T. Development of a core outcome set for diabetes after pregnancy prevention interventions (COS-DAP): A study protocol. Trials 19, 708. https://doi.org/10.1186/s13063-018-3072-y (2018).
Article PubMed PubMed Central Google Scholar
Kelly, J. G., Trevisan, J., Scott, A. D., Carmichael, P. L. & Pollock, H. M. Biospectroscopy to metabolically profile biomolecular structure: A multistage approach linking computational analysis with biomarkers. J. Proteome Res. 10, 1437–1448. https://doi.org/10.1021/pr101067u (2011).
Article CAS PubMed Google Scholar
Morais, C. L. M. et al. Standardization of complex biologically derived spectrochemical datasets. Nat. Protoc. 14, 1546–1577. https://doi.org/10.1038/s41596-019-0150-x (2019).
Article CAS PubMed Google Scholar
Theophilou, G. et al. Synchrotron- and focal plane array-based Fourier-transform infrared spectroscopy differentiates the basalis and functionalis epithelial endometrial regions and identifies putative stem cell regions of human endometrial glands. Anal. Bioanal. Chem. 410, 4541–4554. https://doi.org/10.1007/s00216-018-1111-x (2018).
Article CAS PubMed PubMed Central Google Scholar
Coopman, R. et al. Glycation in human fingernail clippings using ATR-FTIR spectrometry, a new marker for the diagnosis and monitoring of diabetes mellitus. Clin. Biochem. 50(1–2), 62–67. https://doi.org/10.1016/j.clinbiochem.2016.09.001 (2017).
Article CAS PubMed Google Scholar
Siqueira, L. F. S. & Lima, K. M. G. MIR-biospectroscopy coupled with chemometrics in cancer studies. Analyst 141, 4833–4847. https://doi.org/10.1039/C6AN01247G (2016).
Article ADS CAS PubMed Google Scholar
Paraskevaidi, M. et al. Differential diagnosis of Alzheimer’s disease using spectrochemical analysis of blood. Proc. Natl. Acad. Sci. U.S.A. 114, E7929–E7938. https://doi.org/10.1073/pnas.1701517114 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Santos, M. C. D., Morais, C. L. M., Nascimento, Y. M., Araujo, J. M. G. & Lima, K. M. G. Spectroscopy with computational analysis in virological studies: A decade (2006–2016). Trends Anal. Chem. 97, 244–256. https://doi.org/10.1016/j.trac.2017.09.015 (2017).
Article CAS Google Scholar
Yang, X. et al. Pre-diabetes diagnosis based on ATR-FTIR spectroscopy combined with CART and XGBoots. Optik 180, 189–198. https://doi.org/10.1016/j.ijleo.2018.11.059 (2019).
Article ADS CAS Google Scholar
Kennard, R. W. & Stone, L. A. Computer aided design of experiments. Technometrics 11, 137–148. https://doi.org/10.1080/00401706.1969.10490666 (1969).
Article MATH Google Scholar
McCall, J. Genetic algorithms for modelling and optimisation. J. Comput. Appl. Math. 184, 205–222. https://doi.org/10.1016/j.cam.2004.07.034 (2005).
Article ADS MathSciNet MATH Google Scholar
Soares, S. F. C., Gomes, A. A., Araujo, M. C. U., Galvão Filho, A. R. & Galvão, R. K. H. The successive projections algorithm. Trends Anal. Chem. 42, 84–98. https://doi.org/10.1016/j.trac.2012.09.006 (2013).
Article CAS Google Scholar
Morais, C. L. M. & Lima, K. M. G. Principal component analysis with linear and quadratic discriminant analysis for identification of cancer samples based on mass spectrometry. J. Braz. Chem. Soc. 29, 472–481. https://doi.org/10.21577/0103-5053.20170159 (2018).
Article CAS Google Scholar
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20, 273–297. https://doi.org/10.1007/BF00994018 (1995).
Article MATH Google Scholar
Morais, C. L. M., Lima, K. M. G. & Martin, F. L. Uncertainty estimation and misclassification probability for classification models based on discriminant analysis and support vector machines. Anal. Chim. Acta 1063, 40–46. https://doi.org/10.1016/j.aca.2018.09.022 (2019).
Article CAS PubMed Google Scholar
Morais, C. L. M. & Lima, K. M. G. Comparing unfolded and two-dimensional discriminant analysis and support vector machines for classification of EEM data. Chemometr. Intell. Lab. Syst. 170, 1–12. https://doi.org/10.1016/j.chemolab.2017.09.001 (2017).
Article CAS Google Scholar
Movasaghi, Z., Rehman, S. & Rehman, I. U. Fourier Transform Infrared (FTIR) spectroscopy of biological tissues. Appl. Spectrosc. Rev. 43, 134–179. https://doi.org/10.1080/05704920701829043 (2008).
Article ADS CAS Google Scholar
Caixeta, D. C. et al. Siqueira. Salivary molecular spectroscopy: A sustainable, rapid and non-invasive monitoring tool for diabetes mellitus during insulin treatment. PLoS ONE 15(3), e0223461. https://doi.org/10.1371/journal.pone.0223461 (2020).
Article CAS PubMed PubMed Central Google Scholar
Siqueira, L. F. S., Araújo Júnior, R. F., de Araújo, A. A., Morais, C. L. M. & Lima, K. M. G. LDA vs. QDA for FT-MIR prostate cancer tissue classification. Chemometr. Intell. Lab. Syst. 162, 123–129. https://doi.org/10.1016/j.chemolab.2017.01.021 (2017).
Article CAS Google Scholar
Huynh, J., Xiong, G. & Bentley-Lewis, R. A systematic review of metabolite profiling in gestational diabetes mellitus. Diabetologia 57, 2453–2464. https://doi.org/10.1007/s00125-014-3371-0 (2014).
Article CAS PubMed PubMed Central Google Scholar
Urbaniak, S. K., Boguszewska, K., Szewczuk, M., Kaźmierczak-Barańska, J. & Karwowski, B. T. 8-Oxo-7,8-dihydro-2’-deoxyguanosine (8-oxodG) and 8-hydroxy-2’-deoxyguanosine (8-OHdG) as a potential biomarker for gestational diabetes mellitus (GDM) development. Molecules (Basel, Switzerland) 25(1), 202. https://doi.org/10.3390/molecules25010202 (2020).
Article CAS Google Scholar

Download references

Acknowledgements

The authors would like to thank the pregnant women who participated in the study, the Januário Cicco Maternity School and Divine Motherhood Love, the Federal University of Rio Grande do Norte, Post-Graduate Program in Technological Development and Innovation in Medicines (PPGDITM/UFRN), Post-Graduate Program in Chemistry (PPGQ/UFRN), and the Laboratory of Biological Chemistry and Chemometrics of the Institute of Chemistry. Emanuelly Bernardes-Oliveira and Daniel Lucas Dantas de Freitas, would like to thank CAPES—Brazil for their research grants.

Author information

Authors and Affiliations

Post-Graduate Program in Technological Development and Innovation in Medicines, Federal University of Rio Grande do Norte, Natal, RN, 59072-970, Brazil
Emanuelly Bernardes-Oliveira & Janaina Cristiana de Oliveira Crispim
Biological Chemistry and Chemometrics, Institute of Chemistry, Federal University of Rio Grande do Norte, Natal, RN, 59072-970, Brazil
Daniel Lucas Dantas de Freitas & Kassio Michell Gomes de Lima
Lancashire Teaching Hospitals NHS Trust, Royal Preston Hospital, Fulwood, Preston, PR2 9HT, UK
Camilo de Lelis Medeiros de Morais
School of Pharmacy and Biomedical Sciences, University of Central Lancashire, Preston, PR1 2HE, UK
Camilo de Lelis Medeiros de Morais
Januario Cicco Maternity School, Federal University of Rio Grande do Norte, Natal, RN, 59072-970, Brazil
Maria da Conceição de Mesquita Cornetta, Juliana Dantas de Araújo Santos Camargo & Janaina Cristiana de Oliveira Crispim

Authors

Emanuelly Bernardes-Oliveira
View author publications
Search author on:PubMed Google Scholar
Daniel Lucas Dantas de Freitas
View author publications
Search author on:PubMed Google Scholar
Camilo de Lelis Medeiros de Morais
View author publications
Search author on:PubMed Google Scholar
Maria da Conceição de Mesquita Cornetta
View author publications
Search author on:PubMed Google Scholar
Juliana Dantas de Araújo Santos Camargo
View author publications
Search author on:PubMed Google Scholar
Kassio Michell Gomes de Lima
View author publications
Search author on:PubMed Google Scholar
Janaina Cristiana de Oliveira Crispim
View author publications
Search author on:PubMed Google Scholar

Contributions

E.B.O. and D.L.D.F., designed the experiments. E.B.O. and M.C.M.C. contributed to the collection of biological samples. K.M.G.L. and J.C.O.C. analyzed the data and contributed with reagents, materials, and/or analysis tools. E.B.O. and D.L.D.F. contributed in manuscript preparation. K.M.G.L., C.L.M.M. and J.C.O.C. refined the manuscript for publication. J.D.A.S.C., data analysis. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Emanuelly Bernardes-Oliveira or Janaina Cristiana de Oliveira Crispim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bernardes-Oliveira, E., de Freitas, D.L.D., de Morais, C.d. et al. Spectrochemical differentiation in gestational diabetes mellitus based on attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy and multivariate analysis. Sci Rep 10, 19259 (2020). https://doi.org/10.1038/s41598-020-75539-y

Download citation

Received: 05 June 2020
Accepted: 30 September 2020
Published: 06 November 2020
Version of record: 06 November 2020
DOI: https://doi.org/10.1038/s41598-020-75539-y

This article is cited by

Spectrochemical differentiation in endometriosis based on infrared spectroscopy advanced data fusion and multivariate analysis
- Amaxsell Thiago Barros de Souza
- Anne Beatriz Figueira Câmara
- Kássio Michell Gomes de Lima
Scientific Reports (2025)
Effects of robot assisted mirror therapy on motor function and cortical activation in patients with right hemisphere damage
- Yu Wei
- Lifan Wu
- Yifan Wang
Scientific Reports (2025)
The role of machine learning algorithms in detection of gestational diabetes; a narrative review of current evidence
- Emmanuel Kokori
- Gbolahan Olatunji
- David B. Olawade
Clinical Diabetes and Endocrinology (2024)
Spectrochemical analysis of blood combined with chemometric techniques for detecting osteosarcopenia
- Tales Gomes da Silva
- Camilo L. M. Morais
- Kássio M. G. Lima
Scientific Reports (2023)