A machine learning tool for early identification of celiac disease autoimmunity

Dreyfuss, Michael; Getz, Benjamin; Lebwohl, Benjamin; Ramni, Or; Underberger, Daniel; Ber, Tahel Ilan; Steinberg-Koch, Shlomit; Jenudi, Yonatan; Gazit, Sivan; Patalon, Tal; Chodick, Gabriel; Shoenfeld, Yehuda; Ben-Tov, Amir

doi:10.1038/s41598-024-80817-0

Download PDF

Article
Open access
Published: 28 December 2024

A machine learning tool for early identification of celiac disease autoimmunity

Michael Dreyfuss¹^na1,
Benjamin Getz¹^na1,
Benjamin Lebwohl²,
Or Ramni¹,
Daniel Underberger¹,
Tahel Ilan Ber^1,5,6,
Shlomit Steinberg-Koch¹,
Yonatan Jenudi¹,
Sivan Gazit³,
Tal Patalon³,
Gabriel Chodick³,
Yehuda Shoenfeld⁴ &
…
Amir Ben-Tov^3,5

Scientific Reports volume 14, Article number: 30760 (2024) Cite this article

6011 Accesses
2 Citations
30 Altmetric
Metrics details

Subjects

Abstract

Identifying which patients should undergo serologic screening for celiac disease (CD) may help diagnose patients who otherwise often experience diagnostic delays or remain undiagnosed. Using anonymized outpatient data from the electronic medical records of Maccabi Healthcare Services, we developed and evaluated five machine learning models to classify patients as at-risk for CD autoimmunity prior to first documented diagnosis or positive serum tissue transglutaminase (tTG-IgA). A train set of highly seropositive (tTG-IgA > 10X ULN) cases (n = 677) with likely CD and controls (n = 176,293) with no evidence of CD autoimmunity was used for model development. Input features included demographic information and commonly available laboratory results. The models were then evaluated for discriminative ability as measured by AUC on a distinct set of highly seropositive cases (n = 153) and controls (n = 41,087). The highest performing model was XGBoost (AUC = 0.86), followed by logistic regression (AUC = 0.85), random forest (AUC = 0.83), multilayer perceptron (AUC = 0.80) and decision tree (AUC = 0.77). Contributing features for the XGBoost model for classifying a patient as at-risk for undiagnosed CD autoimmunity included signs of anemia, transaminitis and decreased high-density lipoprotein. This model’s ability to distinguish cases of incident CD autoimmunity from controls shows promise as a potential clinical tool to identify patients with increased risk of having undiagnosed celiac disease in the community, for serologic screening.

Machine learning-based prediction of celiac antibody seropositivity by biochemical test parameters

Article Open access 03 July 2025

Detection of antibodies in suspected autoimmune encephalitis diseases using machine learning

Article Open access 31 March 2025

Precision medicine and machine learning towards the prediction of the outcome of potential celiac disease

Article Open access 11 March 2021

Introduction

Celiac disease (CD) is an immune-mediated disease characterized by small bowel enteropathy triggered by dietary exposure to gluten. The classic clinical presentation of CD includes signs of malabsorption and gastrointestinal symptoms such as abdominal pain, bloating and diarrhea^1,2. However, many patients experience predominantly non-specific extra-intestinal symptomatology^3,4, or are asymptomatic⁵. The diverse manifestations of CD can make it challenging for primary care providers (PCPs) to identify and diagnose CD in the general population⁶. Indeed, a majority of adult patients with CD today are likely undiagnosed^5,6,7,8. Patients who eventually receive a diagnosis experience a mean delay of eleven years from symptom onset to diagnosis, with more than half reporting a delay of five or more years till the diagnosis is established⁶. CD has an estimated global prevalence of over 1%⁹, and a rising incidence in recent years^6,10,11,12. Diagnostic delays and underdiagnosis of CD therefore represent an important healthcare problem¹³.

Screening for CD is typically performed by highly sensitive and widely available serum tests for CD autoimmunity (CDA): the most accurate and commonly used being the assay for antibodies to tissue-transglutaminase (tTG-IgA)¹⁴. Seropositivity is suggestive of underlying CD, and such patients typically undergo endoscopic evaluation to establish or rule out the diagnosis of CD via biopsy of intestinal mucosa¹⁵. High seropositivity (tTG-IgA > 10X ULN) is associated with a high (> 95%) positive predictive value (PPV) for villous atrophy and, in the proper clinical setting, is considered sufficient for diagnosis without a biopsy in children and possibly in adults^7,14,15,16.

Clinical guidelines do not provide clear and consistent definitions on when to screen for CDA. General population screening is not currently recommended¹⁷, although screening high-risk patients may be warranted^18,19. What defines high-risk groups for CD varies somewhat between reports, although the focus has typically been on a family history of CD and medical comorbidities with established associations with CD²⁰. Beyond these factors, laboratory abnormalities such as iron-deficiency anemia are common among patients with CD²¹. While PCPs may be aware of specific risk-factors, signs and symptoms of undiagnosed CD, more subtle combinations of clinical features may go missed.

Machine learning (ML) algorithms have the potential to use existing data within a patient’s electronic medical record (EMR) to provide risk assessments to providers²². ML models have been developed to alert intensive care unit physicians to patients at risk of circulatory failure²³, to identify clinically significant portal hypertension in non-alcoholic steatohepatitis patients from pathology reports²⁴, to predict incident hypertension²⁵ and hypertension outcomes²⁶, to identify patients with undiagnosed psoriatic arthritis²⁷, hepatitis C²⁸, to predict dementia onset²⁹, IgA nephropathy³⁰, future Parkinson’s disease diagnosis³¹, and to flag patients at risk of advanced colorectal cancer³². One previous study that attempted to develop ML models to identify patients with incident CD using a variety of modeling methods found that the models were not consistently better than chance³³. Another study showed positive results, but the study size was small and the models relied on symptoms extracted from unstructured clinical documents and diagnostic codes³⁴. Neither study included objective laboratory test results as predictive input features, which may have hindered performance and limited generalizability.

The goal of the current study was to develop and assess a prescreening EMR-based tool to classify adult and adolescent patients by risk of having unidentified CD autoimmunity using commonly available clinical features. Five algorithms were trained and tested: logistic regression, decision tree, random forest, XGBoost and multilayer perceptron. Each algorithm was then assessed on discriminative ability as measured by estimated area under the ROC (AUC). Input features included age, biological sex and results from commonly available laboratory tests performed as part of complete blood counts and comprehensive metabolic, iron and lipid panels. Incident cases were identified from a large retrospective community-based dataset using results from tTG-IgA testing. Highly seropositive cases (tTG-IgA > 10X ULN) with probable underlying CD were used for model training and evaluation against cohorts of controls with no evidence of disease. Performance was additionally assessed for the highest performing model in a test set consisting of a cohort of seropositive cases (tTG-IgA > 2X ULN) who may require endoscopic evaluation for CD and a cohort of controls. In both test sets AUC was assessed at multiple time points before first documented evidence of CD autoimmunity.

Methods

Dataset

The dataset for this retrospective study consisted of deidentified EMR data from Maccabi Health Services (MHS), Israel’s 2nd largest health maintenance organization (HMO)³⁵. Data were accessed via the Kahn Sagol Maccabi Research and Innovation Centre (KSM), and extracted using the MDClone platform (version 5.5.0.4; https://www.mdclone.com/), a proprietary software. The dataset was de-identified by KSM, and no personal identifying information was made available to the researchers. The dataset contains records from 2,963,864 unique patients with longitudinal data, as members rarely change HMOs. Unstructured data, including progress notes and pathology or procedure reports were not made available by KSM. The study therefore relied entirely on the structured data, specifically patient demographics and laboratory results. Approval for use of the dataset and the retrospective analysis was obtained by the Maccabi institutional review board (approval #0052-20-MHS), and the study was conducted in accordance with the Declaration of Helsinki and all relevant guidelines and regulations. Informed consent was waived as all identifying information had been removed by KSM.

Study cohort definitions

Patients were eligible for inclusion if they (1) were MHS members during the study period (2005–2021), and (2) joined MHS before 2005. The eligible population was randomly split into train (80%) and test (20%) sets. The split was performed by assigning patients with digits 2 or 6 as the third to last digit in their randomly generated hash ID to test, and all others to train.

In this study we distinguish between seropositive cases in general and highly seropositive cases with likely underlying CD. Cohorts of highly seropositive cases were selected in both train and test sets, and an additional cohort of seropositive cases was identified using case identification criteria (CIC) described below.

Highly seropositive cases were defined as patients having at least one documented tTG-IgA test ≥ 10X ULN, which has an extremely high (> 95%) PPV for duodenal biopsy proven CD^7,36,37,38. CIC for seropositive cases included all patients with at least one documented tTG-IgA > 2X ULN. This definition is more sensitive for underlying CD, but also includes a higher proportion of cases that would not have pathologic evidence of disease⁷. Reports of procedures including endoscopic evaluations were not available to researchers, so pathologic evidence of CD could not be confirmed or ruled out.

To reduce the possibility that cases were not newly diagnosed, cases were excluded if they had a history of tTG-IgA levels within normal limits before their first positive tTG-IgA, or a CD diagnosis code > 1 year before their first positive tTG-IgA. Providers may order serology on patients with known CD to check patient compliance with a gluten-free diet (GFD), so such cases may have been previously diagnosed and therefore not incident cases relevant for this study. For each seropositive case, the earlier of the first positive serology or first diagnosis of CD was defined as that patient’s index date.

Screening serology was performed with one of two commercial kits used during the study period: Celiakey (Thermo Fisher Scientific-USA) for years 2005–2011 and Elia (Thermo Fisher Scientific-USA) for years 2012–2021, each with manufacturer established ULN values of > 5 U/mL, and > 7 U/mL respectively.

Controls were identified as eligible MHS patients with no documented CD diagnosis code and no serologic evidence of CD autoimmunity. One cohort of controls was selected to match the train set cases and two cohorts of controls in the test set were selected to match the two cohorts of cases (highly seropositive and seropositive). Controls were matched to cases by years of data availability at the maximal possible ratio of controls to cases. Controls were assigned the same index dates as the case to which they were matched. Controls were not matched by demographic characteristics to allow the model to learn the relationship between these features for predicting incident CD seropositivity.

Each patient included in the train or test sets was additionally assigned a run date, defined as the first of July of the calendar year preceding the patient’s index date. This gap between run date and index date is referred to as the one-year gap, referring to the mean time between run date and index dates of the cohort. The gap between the run date and the index date was added to account for potential clinical suspicion for CD immediately preceding the index date. Analyses were also conducted at further time gaps of up to four years prior to index dates to test the ability of the model to identify patients years before initial suspicion of disease. This methodology was described in detail in a previous report on patients with psoriatic arthritis²⁷. Patients were additionally excluded if on their run date they did not meet the following criteria: (i) age ≥ 12 and age ≤ 85 years-old, (ii) members of MHS for at least four years, and (iii) at least one complete blood count (CBC) during the four years prior to their index date. The patient selection process is depicted in Fig. 1.

Model development

Five candidate models were developed to classify patients as at-risk or not for having undiagnosed CD autoimmunity. Each model was fit to the training data using the following algorithms: logistic regression, decision tree, random forest, XGBoost and multilayer perceptron. Logistic regression estimates the log-odds of an outcome by linear combination of weighted input features. The estimate can then be converted into a probability using a logit function, which can be used for binary classification given a predefined cutoff. A decision tree performs classification by recursively partitioning based on the given set of input features. Random forest is a method that uses multiple decision trees, and then performs classification based on the result of the majority of the decision trees^39,40. XGBoost builds multiple decision trees sequentially using gradient boosting to minimize the errors made by previous trees^26,39,40,41. A multilayer perceptron is a type of artificial neural network that consists of an input layer, one or multiple hidden layers and an output layer⁴⁰. Neurons of the input layer represent input features, which activate neurons in the hidden layer to produce an output from the output layer. This output can then be converted into a classification. The models were trained and evaluated using data available during the three years prior to the run date. Training was performed at a one-year time gap between run dates and index dates. All models were implemented using the scikit-learn library v1.3⁴² and the scikit-learn compatible XGBoost library.

Candidate independent predictors consisted of basic demographic information (biological sex and age at run date), and laboratory test results extracted from the structured data (Supplementary Table 1). Laboratories included as features all individual components of the comprehensive metabolic panel and complete blood count with differential, ferritin and high-density lipoprotein (HDL) as patients with celiac often have iron deficiency anemia⁴ and low HDL⁴³. The most recent available laboratory result of each type log-transformed by sex and age group. Missing data were given a special indication and the model treated these as null values. For the logistic regression, null values were replaced by median values for the feature as calculated by biological sex and age group. SHapley Additive exPlanations (SHAPs) were used both for model selection and to explain the output of the XGBoost model⁴⁴.

Models were developed to provide each patient with an output score given the patient’s input features. The score can be converted at a given threshold to perform binary classification of positive (at-risk for undiagnosed CD) or negative (not at-risk for undiagnosed CD) classes. The predicted classes assigned to each patient are then evaluated against the ground truth labels established by the CIC as described above (i.e. incident case of CD autoimmunity or control with no documented evidence of CD).

The hyperparameters for each algorithm were selected by performing five-fold cross validation within the train set and optimizing for average precision (Supplementary Table 2). Each model was then retrained on the entire train set with the selected hyperparameters to produce one model with each of the five algorithms for evaluation on the test set.

Model selection and evaluation

To test the ability of the models to identify incident CD seropositivity prior to the first documented evidence of disease, the performance of each model was assessed one year prior to patients’ run dates in the test set with cohorts of highly seropositive cases and controls. The model with the highest AUC was then additionally evaluated at run dates of two-, three- and four-year time gaps prior to each patient’s run date, as well as at all four time gaps in the test set consisting of seropositive patients and controls. Tests at each time gap were performed on the same base cohort selected according to the criteria described above. Patients from each test cohort who did not meet eligibility criteria at previous time gaps were removed from analyses.

Results

Cohorts of cases of CDA and controls in the train and test sets are described in Table 1.

Table 1 Descriptive characteristics of the train and test cohorts. The first test cohort includes cases of celiac disease autoimmunity with high seropositivity (tTG-IgA > 10X ULN) and matched controls. The second test cohort includes all cases of seropositivity with tTG-IgA > 2X ULN.

Full size table

Performance for each model on the high seropositivity test cohort at a gap of one year are shown in Fig. 2. At the one-year gap, discriminatory AUC was highest for the XGBoost model at 0.86, followed by the models using logistic regression (AUC = 0.85), random forest (AUC = 0.83), multilayer perceptron (AUC = 0.80) and decision tree (AUC = 0.77). Feature contributions for the XGBoost model are depicted for this test by SHAP analysis (Fig. 3). The XGBoost also had the best performance at the two-, three-, and four-year gaps, with AUC values of 0.83, 0.82 and 0.81 respectively (Supplementary Table 3).

The models was then tested on the test set consisting of seropositive cases and controls. The XGBoost model’s ability to distinguish between cases and controls in this test set was assessed by AUC as 0.79, 0.77, 0.75, 0.75 at gaps of one, two, three and four years (Supplementary Table 4).

Discussion

In the current study, we describe the development and comparative evaluation of five ML models for identifying adult and adolescent patients with incident CD autoimmunity prior to the first documented evidence of disease. Based on AUC, XGBoost exhibited the strongest ability of the models to distinguish between cases of highly seropositive cases of CD autoimmunity and controls one year prior to initial documentation, followed by the models using logistic regression, random forest, multilayer perceptron and decision tree respectively. XGBoost is a particularly robust ML method for tabular data, which often produces the best performance compared to other ML methods^{26,30,39,40,41}. This model showed excellent discriminatory ability (AUC > 0.80) between cases of highly seropositive patients with likely CD at gaps of one, two, three and four years. The model also showed good discriminatory ability (AUC > 0.7) at all time gaps for more broadly defined seropositive cases compared to controls. These findings suggest the potential utility of this model as a prescreening tool to identify patients at risk of having CDA for eventual evaluation for CD. The final model achieved these results using commonly available laboratory results and demographic features.

The relationships between demographic and lab features of the XGBoost model as depicted in the SHAP are consistent with established phenomena among patients with untreated CD. The model identified decreased hemoglobin, ferritin, mean cell hemoglobin (MCH), mean cell hemoglobin concentration (MCHC) and mean cell volume (MCV) as predictive of undiagnosed CD autoimmunity. These laboratory findings are indeed characteristic of anemia secondary to malabsorption of dietary iron and chronic inflammation in the setting of CD¹. Increased liver function tests (LFTs), specifically alanine transaminase (ALT) and aspartate transferase (AST) and alkaline phosphatase contributed positively to classification as at-risk for the model. In patients with newly diagnosed CD, 20–50% of patients have elevated LFTs, and undetected CD accounts for an estimated 4% of cases of unexplained transaminitis^45,46. Low high density lipoprotein (HDL) has frequently been associated with untreated CD⁴⁷, and the combination of unexplained iron-deficiency anemia and low HDL may be particularly suggestive of CD⁴⁸. For the model, low HDL contributed to a positive classification as at-risk for undiagnosed CD autoimmunity. Longitudinal studies have found that these lab abnormalities typically resolve after initiation of a GFD, including anemia⁴⁹, transaminitis⁵⁰, and low HDL^51,52, further highlighting the importance of identifying undiagnosed cases of CD early.

Earlier identification and treatment of CD have been shown to have clinical benefits: screen-detected asymptomatic and mildly symptomatic adult patients typically show improved intestinal histology and reduced serum autoantibody levels after following adherence to a GFD^53,54. Late diagnosis in contrast has been associated with unfavorable clinical outcomes. Patients diagnosed after 40 years of age are more likely than younger patients to show persistent signs of intestinal mucosal injury, including villous blunting and intraepithelial lymphocytes in the duodenum despite following a strict GFD⁵⁵. Among symptomatic patients, diagnostic delays are associated with poorer long-term outcomes even after initiation of a GFD, including persistent gastrointestinal and extra-gastrointestinal symptoms^3,4,56, increased utilization of healthcare resources, and lower reported quality of life measures^8,57,58. Nevertheless CD is a highly heterogeneous disease, and the long-term benefits of early identification by screening should be established in future studies⁵⁹.

To our knowledge, this is the first report of a ML model showing the ability to identify cases of incident CDA from controls within a large community-based setting using only commonly available laboratory results, biological sex and age in adults and adolescents. This tool may have clinical value as a prescreening tool to identify patients who should be evaluated for CDA. Patients who are found to be seropositive can then undergo further evaluation for CD according to clinical guidelines, including additional serologic testing or endoscopic evaluation with multiple biopsies⁶⁰.

A major strength of the study was the size and continuity of follow-up in the longitudinal data set, which allowed for large patient cohorts with rich historical data. An additional strength is that by using commonly available lab results and demographic information, the model can be portably implemented in most EMR systems. Follow-up studies are needed to evaluate the robustness of the models to other datasets with different population demographics and EMR systems. Additionally, prospective studies should explore the PPV that a flagged patient is seropositive for CDA, and for undiagnosed CD. This model, if validated, may assist PCPs in identifying patients with CDA at point of care, or health systems identifying patients in their populations who may benefit from screening for CDA.

The study also had limitations. Due to lack of access to unstructured data such as clinical documentation, patients were selected by the proxy of test results rather than by endoscopy results. Some patients selected as cases with CDA may not have had CD. Diagnostic status as reported in clinical documents such as were not made available to establish CD status. Highly seropositive patients are very likely to have CD⁷. Some controls may also have CD that was not documented or undiagnosed at the time the study was conducted. Additionally, there may be heterogeneity in the full population that was not captured in the cohorts of cases and controls used for model training and evaluation. Future studies should examine the robustness of the model, and its ability to perform on different populations and health care systems that may differ from MHS through retrospective and prospective validation studies.

In conclusion, this study presents a ML model based on a large population dataset that can identify adults and adolescents at risk of undiagnosed CD autoimmunity using commonly available structured clinical and demographic data.

Data availability

The authors do not have permission to share the data itself, which can be accessed through agreement with KSM. Statistics and analytics can be provided upon request from Michael Dreyufss or Benjamin Getz.

References

Reilly, N. R., Fasano, A. & Green, P. H. R. Presentation of celiac disease. Gastrointest. Endosc. Clin. N. Am. 22, 613–621 (2012).
Article PubMed MATH Google Scholar
Catassi, C., Verdu, E. F., Bai, J. C. & Lionetti, E. Coeliac disease. Lancet 399 (10344), 2413–2426 (2022).
Article PubMed Google Scholar
Laurikka, P., Nurminen, S., Kivelä, L. & Kurppa, K. Extraintestinal manifestations of celiac disease: early detection for better long-term outcomes. Nutrients 10, 1015 (2018).
Article PubMed PubMed Central Google Scholar
Jericho, H., Sansotta, N. & Guandalini, S. Extraintestinal manifestations of celiac disease: effectiveness of the gluten-free diet. J. Pediatr. Gastroenterol. Nutr. 65, 75–79 (2017).
Article PubMed Google Scholar
Choung, R. S. et al. Prevalence and morbidity of undiagnosed celiac disease from a community-based study. Gastroenterology 152, 830–839e5 (2017).
Article PubMed MATH Google Scholar
Lebwohl, B. & Rubio-Tapia, A. Epidemiology, presentation, and diagnosis of celiac disease. Gastroenterology 160, 63–75 (2021).
Article CAS PubMed Google Scholar
Kvamme, J. M., Sørbye, S., Florholmen, J. & Halstensen, T. S. Population-based screening for celiac disease reveals that the majority of patients are undiagnosed and improve on a gluten-free diet. Sci. Rep. 12, 12647 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Fuchs, V. et al. Delayed celiac disease diagnosis predisposes to reduced quality of life and incremental use of health care services and medicines: a prospective nationwide study. United Eur. Gastroenterol. J. 6, 567–575 (2018).
Article Google Scholar
Singh, P. et al. Global prevalence of celiac disease: systematic review and meta-analysis. Clin. Gastroenterol. Hepatol. 16, 823–836e2 (2018).
Article PubMed MATH Google Scholar
King, J. A. et al. Incidence of celiac disease is increasing over time. Am. J. Gastroenterol. 115, 1 (2020).
Article MATH Google Scholar
Ludvigsson, J. F. et al. Increasing incidence of celiac disease in a north American population. Am. J. Gastroenterol. 108, 818–824 (2013).
Article PubMed PubMed Central MATH Google Scholar
Vilppula A., et al. Increasing prevalence and high incidence of celiac disease in elderly people: a population-based study. BMC Gastroenterol. 9, 49 (2009).
Article PubMed PubMed Central Google Scholar
Pinto-Sanchez et al. Society for the study of celiac disease position statement on gaps and opportunities in coeliac disease. Nat. Revs. Gastroenterol. Hepatol. 18, 875–884 (2021).
Article MATH Google Scholar
Pelkowski, T. D. & Viera, A. J. Celiac disease: diagnosis and management. Am. Family Phys. 89, 99–105 (2014).
MATH Google Scholar
Ianiro, G. Endoscopic tools for the diagnosis and evaluation of celiac disease. World J. Gastroenterol. 19, 8562 (2013).
Article PubMed PubMed Central MATH Google Scholar
Al-Toma, A. et al. European society for the study of coeliac disease (ESsCD) guideline for coeliac disease and other gluten-related disorders. United Eur. Gastroenterol. J. 7, 583–613 (2019).
Article MATH Google Scholar
Bibbins-Domingo, K. et al. Screening for celiac disease. JAMA 317, 1252 (2017).
Article PubMed MATH Google Scholar
Ludvigsson, J. F. et al. Screening for celiac disease in the general population and in high-risk groups. United Eur. Gastroenterol. J. 3, 106–120 (2014).
Article MATH Google Scholar
Richey, R., Howdle, P., Shaw, E. & Stokes, T. Recognition and assessment of coeliac disease in children and adults: Summary of NICE guidance. BMJ 338, b1684–b1684 (2009).
Article PubMed Google Scholar
Chou, R. et al. Screening for celiac disease. JAMA 317, 1258 (2017).
Article PubMed MATH Google Scholar
Agardh, D. et al. Clinical features of celiac disease: a prospective birth cohort. Pediatrics 135, 627–634 (2015).
Article PubMed PubMed Central Google Scholar
Goldstein, B. A., Navar, A. M., Pencina, M. J. & Ioannidis, J. P. A. Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review. J. Am. Med. Inf. Assoc. 24, 198–208 (2016).
Article Google Scholar
Hyland, S. L. et al. Early prediction of circulatory failure in the intensive care unit using machine learning. Nat. Med. 26, 364–373 (2020).
Article CAS PubMed MATH Google Scholar
Bosch, J. et al. A machine learning approach to liver histological evaluation predicts clinically significant portal hypertension in NASH cirrhosis. Hepatology 74, 3146–3160 (2021).
Article CAS PubMed MATH Google Scholar
Ye, C. et al. Prediction of incident hypertension within the next year: prospective study using statewide electronic health records and machine learning. J. Med. Internet Res. 20, e22 (2018).
Article PubMed PubMed Central MATH Google Scholar
Chang, W. et al. A machine-learning-based prediction method for hypertension outcomes based on medical data. Diagnostics 9, 178 (2019).
Article PubMed PubMed Central MATH Google Scholar
Shapiro, J. et al. Evaluation of a machine learning tool for the early identification of patients with undiagnosed psoriatic arthritis – a retrospective population-based study. J. Transl. Autoimmun. 7, 100207–100207 (2023).
Article CAS PubMed PubMed Central MATH Google Scholar
Rigg, J. et al. Finding undiagnosed patients with hepatitis C virus: an application of machine learning to US ambulatory electronic medical records. BMJ Health Care Inf. 30, e100651. (2023).
Nori, V. S. et al. Machine learning models to predict onset of dementia: a label learning approach. Alzheimer’s Dement. 5, 918–925 (2019).
Article MATH Google Scholar
Noda, R., Ichikawa, D. & Shibagaki, Y. Machine learning-based diagnostic prediction of IgA nephropathy: model development and validation study. Sci. Rep. 14, 12426 (2024).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Yuan, W. et al. Accelerating diagnosis of Parkinson’s disease through risk prediction. BMC Neurol. 21, (2021).
Wang, Y. H., Nguyen, P. A., Islam, M. M., Li, Y. C. & Yang, H. C. Development of deep learning algorithm for detection of colorectal cancer in EHR data. Stud. Health Technol. Inform. 264, 438–441 .
Hujoel, I. A. et al. Machine learning in detection of undiagnosed celiac disease. Clin. Gastroenterol. Hepatol. 16, 1354–1355e1 (2018).
Article PubMed MATH Google Scholar
Ludvigsson, J. F. et al. Use of computerized algorithm to identify individuals in need of testing for celiac disease. J. Am. Med. Inf. Assoc. 20, e306–310 (2013).
Article MATH Google Scholar
Gazit, S. et al. The incidence of SARS-CoV-2 reinfection in persons with naturally acquired immunity with and without subsequent receipt of a single dose of BNT162b2 vaccine. Ann. Intern. Med. 175, 674–681 (2022).
Article PubMed MATH Google Scholar
Husby, S. et al. European society for pediatric gastroenterology, hepatology, and nutrition guidelines for the diagnosis of coeliac disease. J. Pediatr. Gastroenterol. Nutr. 54, 136–160 (2012).
Article CAS PubMed Google Scholar
Ciacci, C. et al. Serum anti-tissue transglutaminase IgA and prediction of duodenal villous atrophy in adults with suspected coeliac disease without IgA deficiency (Bi.A.CeD): a multicentre, prospective cohort study. Lancet Gastroenterol. Hepatol. 8, 1005–1014 (2023).
Article CAS PubMed Google Scholar
Werkstetter, K. J. et al. Accuracy in diagnosis of celiac disease without biopsies in clinical practice. Gastroenterology 153, 924–935 (2017).
Article PubMed MATH Google Scholar
Piccialli, F. et al. Precision medicine and machine learning towards the prediction of the outcome of potential celiac disease. Sci. Rep. 11, (2021).
Lee, S., Lee, H., Choi, J. R. & Koh, S. B. Development and validation of prediction model for risk reduction of metabolic syndrome by body weight control: a prospective population-based study. Sci. Rep. 10, (2020).
Shwartz-Ziv, R. & Armon, A. Tabular data: deep learning is not all you need. Inf. Fusion 81, 84–90 (2022).
Article Google Scholar
Pedregosa, F. et al. Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Jessup, A. B., Law, J. R. & Spagnoli, A. Are HDL levels lower in children with type 1 diabetes and concurrent celiac disease compared with children with type 1 diabetes only? J. Pediatr. Endocrinol. Metab. 27, 1213–1216 (2014).
CAS PubMed Google Scholar
Lundberg, S. & Lee, S. I. A unified approach to interpreting model predictions. arXiv preprint arXiv:1705.07874 (2017).
Sainsbury, A., Sanders, D. S. & Ford, A. C. Meta-analysis: Coeliac disease and hypertransaminasaemia. Aliment. Pharmacol. Ther. 34, 33–40 (2011).
Article CAS PubMed Google Scholar
Volta, U. Pathogenesis and clinical significance of liver injury in celiac disease. Clin. Rev. Allergy Immunol. 36, 62–70 (2008).
Article MATH Google Scholar
Salardi, S. et al. Whole lipid profile and not only HDL cholesterol is impaired in children with coexisting type 1 diabetes and untreated celiac disease. Acta Diabetol. 54, 889–894 (2017).
Article CAS PubMed Google Scholar
Abu Daya, H., Lebwohl, B., Smukalla, S., Lewis, S. K. & Green, P. H. Utilizing HDL levels to improve detection of celiac disease in patients with iron deficiency anemia. Am. J. Gastroenterol. 109, 769–770 (2014).
Article CAS PubMed Google Scholar
Bergamaschi, G. et al. Anemia of chronic disease and defective erythropoietin production in patients with celiac disease. Haematologica 93, 1785–1791 (2008).
Article CAS PubMed MATH Google Scholar
Jena, A. et al. Liver abnormalities in celiac disease and response to gluten-free diet: a systematic review and meta-analysis. J. Gastroenterol. Hepatol. 38, 11–22 (2022).
Article PubMed MATH Google Scholar
Brar, P. et al. Change in lipid profile in celiac disease: beneficial effect of gluten-free diet. Am. J. Med. 119, 786–790 (2006).
Article CAS PubMed MATH Google Scholar
Nikniaz, Z., Farhangi, M. A., Hosseinifard, H. & Nikniaz, L. Does a gluten-free diet increase body mass index and lipid profile in celiac patients? A systematic review and meta-analysis. Mediterr. J. Nutr. Metab. 12, 341–352 (2019).
Article Google Scholar
Kurppa, K. et al. Benefits of a gluten-free diet for asymptomatic patients with serologic markers of celiac disease. Gastroenterology 147, 610–617e1 (2014).
Article CAS PubMed Google Scholar
Vilppula, A. et al. Clinical benefit of gluten-free diet in screen-detected older celiac disease patients. BMC Gastroenterol. 11, (2011).
Galli, G. et al. Relationship between persistent gastrointestinal symptoms and duodenal histological findings after adequate gluten-free diet: a gray area of celiac disease management in adult patients. Nutrients 13, 600 (2021).
Article CAS PubMed PubMed Central Google Scholar
Patel, N. et al. Clinical data do not reliably predict duodenal histology at follow-up in celiac disease. Am. J. Surg. Pathol. 48, 212–220 (2023).
Article ADS PubMed MATH Google Scholar
Norström, F., Lindholm, L., Sandström, O., Nordyke, K. & Ivarsson, A. Delay to celiac disease diagnosis and its implications for health-related quality of life. BMC Gastroenterol. 11, (2011).
Mårild, K. et al. Costs and use of health care in patients with celiac disease: a population-based longitudinal study. Am. J. Gastroenterol. 115, 1253–1263 (2020).
Article PubMed MATH Google Scholar
Paavola, S. et al. Coeliac disease re-screening among once seronegative at‐risk relatives: a long‐term follow‐up study. United Eur. Gastroenterol. J. 10, 585–593 (2022).
Article CAS MATH Google Scholar
Rubio-Tapia, A. et al. American college of gastroenterology guidelines update: diagnosis and management of celiac disease. Am. J. Gastroenterol. 118, 59–76 (2023).
Article CAS PubMed Google Scholar

Download references

Funding

This study was funded in full by Predicta-Med LTD and the Israel Innovation Authority (IIA).

Author information

These authors contributed equally: Michael Dreyfuss and Benjamin Getz.

Authors and Affiliations

Predicta Med Analytics Ltd., Ramat Gan, Israel
Michael Dreyfuss, Benjamin Getz, Or Ramni, Daniel Underberger, Tahel Ilan Ber, Shlomit Steinberg-Koch & Yonatan Jenudi
Celiac Disease Center, Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA
Benjamin Lebwohl
Kahn Sagol Maccabi Research & Innovation Center, Maccabi Healthcare Services, Tel Aviv, Israel
Sivan Gazit, Tal Patalon, Gabriel Chodick & Amir Ben-Tov
Zabludowicz Center for Autoimmune Diseases, Chaim Sheba Medical Center, Tel Hashomer, Israel
Yehuda Shoenfeld
Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
Tahel Ilan Ber & Amir Ben-Tov
PhaseV, Tel Aviv, Israel
Tahel Ilan Ber

Authors

Michael Dreyfuss
View author publications
Search author on:PubMed Google Scholar
Benjamin Getz
View author publications
Search author on:PubMed Google Scholar
Benjamin Lebwohl
View author publications
Search author on:PubMed Google Scholar
Or Ramni
View author publications
Search author on:PubMed Google Scholar
Daniel Underberger
View author publications
Search author on:PubMed Google Scholar
Tahel Ilan Ber
View author publications
Search author on:PubMed Google Scholar
Shlomit Steinberg-Koch
View author publications
Search author on:PubMed Google Scholar
Yonatan Jenudi
View author publications
Search author on:PubMed Google Scholar
Sivan Gazit
View author publications
Search author on:PubMed Google Scholar
Tal Patalon
View author publications
Search author on:PubMed Google Scholar
Gabriel Chodick
View author publications
Search author on:PubMed Google Scholar
Yehuda Shoenfeld
View author publications
Search author on:PubMed Google Scholar
Amir Ben-Tov
View author publications
Search author on:PubMed Google Scholar

Contributions

M.D.: Conceptualization, Manuscript preparation – original draft. B.G.: Conceptualisation, Data curation, Formal analysis, Methodology, Project administration, Software, Supervision, Validation, Visualisation, Writing – original draft. B.L.: Conceptualisation, Supervision, Validation, Writing – original draft, Writing – review & editing. O.R.: Conceptualisation, Data curation, Formal analysis, Methodology, Project administration, Software, Validation, Visualisation, Writing – original draft. D.U.: Conceptualisation, Validation, Visualisation, Writing – review & editing. T.I.B.: Conceptualisation, Validation, Visualisation. S.S.-K.: Conceptualisation, Funding acquisition, Project administration, Visualisation, Writing – review & editing. Y.J.: Conceptualisation, Formal analysis, Software, Validation, Writing – review & editing. S.G.: Writing – review & editing. T.P.: Writing – review & editing. G.C.: Writing – review & editing. Y.S.: Conceptualisation, Supervision Validation, Writing – review & editing. A.B.-T.: Conceptualisation, Methodology, Project administration, Supervision, Validation, Writing – original draft. All authors approved the final version of the article, including the authorship list.

Corresponding author

Correspondence to Michael Dreyfuss.

Ethics declarations

Competing interests

Michael Dreyfuss, Benjamin Getz, Or Ramni, Daniel Underberger, Shlomit Steinberg-Koch, Yonathan Jenudi are paid employees of Predicta Med. Tahel Ilan Ber is a former employee of Predicta Med. Yehuda Shoenfeld owns options in the Predicta Med. Kahn Sagol Maccabi Research and Innovation Centre (KSM) is under a commercial agreement with Predicta Med; Sivan Gazit, Tal Patalon, Gabriel Chodick and Amir Ben-Tov are employed by KSM. Benjamin Lebwohl has no conflicts of interest.

Ethics approval

The Maccabi Helsinki Committee provided ethical approval for this study including a full waiver on informed consent (approval #0052-20-MHS).

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dreyfuss, M., Getz, B., Lebwohl, B. et al. A machine learning tool for early identification of celiac disease autoimmunity. Sci Rep 14, 30760 (2024). https://doi.org/10.1038/s41598-024-80817-0

Download citation

Received: 23 January 2024
Accepted: 21 November 2024
Published: 28 December 2024
DOI: https://doi.org/10.1038/s41598-024-80817-0

This article is cited by

Machine learning-based prediction of celiac antibody seropositivity by biochemical test parameters
- Signe Ulfbeck Schovsbo
- Michael Charles Sachs
- Line Lund Kårhus
Scientific Reports (2025)