Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

npj Digital Medicine
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. npj digital medicine
  3. articles
  4. article
Prediction of antibiotic-associated cutaneous adverse drug reactions using electronic health record foundation models
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 04 March 2026

Prediction of antibiotic-associated cutaneous adverse drug reactions using electronic health record foundation models

  • Junmo Kim1 na1,
  • Kyunghoon Kim2,3 na1,
  • Jeong-Eun Yun4,
  • Yu-Kyoung Hwang5,
  • Min-Gyu Kang5,
  • Seok Kim6,
  • Sooyoung Yoo6,
  • Chaiho Shin7,
  • Suhyun Kim8,9,10,
  • Kwangsoo Kim8,10,11 &
  • …
  • Sae-Hoon Kim12,13,14 

npj Digital Medicine , Article number:  (2026) Cite this article

  • 1441 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Diagnosis
  • Health care

Abstract

Cutaneous adverse drug reactions (CADRs) are the most common form of adverse drug reactions, ranging from mild rashes to life-threatening diseases, such as Stevens–Johnson syndrome and toxic epidermal necrolysis. However, there is no effective tool to predict antibiotic-associated CADRs. In this study, we propose an antibiotic-associated CADR prediction model using electronic health record (EHR) foundation models (FMs). EHR FMs are based on the pretraining-finetuning paradigms of language models, corresponding medical codes and their sequences to words and sentences. We included 802,131 inpatients across three tertiary hospitals in Korea, combining EHR data with nursing statements and reports to extract skin rash records. Our approach achieved the best predictive performance compared to all the other baseline models across all datasets. To enhance clinical relevance, we classified CADRs into immediate and delayed types and conducted a detailed sub-analysis. Finally, we found that properly configured EHR FMs can effectively predict the risk of developing antibiotics-associated CADRs, particularly for delayed-type reactions where predictive testing options are limited.

Similar content being viewed by others

Effects of implementing a clinical pathway on antibiotic prophylaxis for patients who underwent an elective surgery

Article Open access 23 November 2022

Meta-analysis of randomized controlled trials of electronic health interventions to reduce medication errors

Article Open access 17 December 2025

Phenotyping people with a history of injecting drug use within electronic medical records using an interactive machine learning approach

Article Open access 30 November 2024

Data availability

The raw data used in this study are not publicly available to preserve participant privacy. The data generated and analyzed during the study are available from the corresponding author upon reasonable request.

Code availability

Source code for the experiments is publicly available at https://github.com/kicarussays/CDM-BERT.

References

  1. Edwards, I. R. & Aronson, J. K. Adverse drug reactions: Definitions, diagnosis, and management. Lancet 356, 1255–1259 (2000).

    Google Scholar 

  2. Lazarou, J., Pomeranz, B. H. & Corey, P. N. Incidence of adverse drug reactions in hospitalized patients: A meta-analysis of prospective studies. JAMA 279, 1200–1205 (1998).

    Google Scholar 

  3. Seo, B. et al. Incidence and economic burden of adverse drug reactions in hospitalization: A prospective study in Korea. J. Korean Med. Sci. 38, e56 (2023).

    Google Scholar 

  4. Gomes, E. R. & Demoly, P. Epidemiology of hypersensitivity drug reactions. Curr. Opin. Allergy Clin. Immunol. 5, 309–316 (2005).

    Google Scholar 

  5. Thong, B. Y., Leong, K. P., Tang, C. Y. & Chng, H. H. Drug allergy in a general hospital: Results of a novel prospective inpatient reporting system. Ann. Allergy Asthma Immunol. 90, 342–347 (2003).

    Google Scholar 

  6. Frey, N. et al. The epidemiology of stevens-johnson syndrome and toxic epidermal necrolysis in the UK. J. Invest Dermatol 137, 1240–1247 (2017).

    Google Scholar 

  7. Lee, E. Y., Knox, C. & Phillips, E. J. Worldwide prevalence of antibiotic-associated Stevens-Johnson syndrome and toxic epidermal necrolysis: A systematic review and meta-analysis. JAMA Dermatol 159, 384–392 (2023).

    Google Scholar 

  8. Singh, M. P. et al. Antimicrobial utilization in a paediatric intensive care unit in India: A step towards strengthening antimicrobial stewardship practices. PLOS ONE 19, e0310515 (2024).

    Google Scholar 

  9. Rossi, G., da Silva Cartell, A. & Marchiori Bakos, R. Dermoscopic aspects of cutaneous adverse drug reactions. Dermatol. Pract. Concept. 11, e2021136 (2021).

    Google Scholar 

  10. Finkelstein, Y., Macdonald, E. M., Li, P., Hutson, J. R. & Juurlink, D. N. Recurrence and mortality following severe cutaneous adverse reactions. Jama 311, 2231–2232 (2014).

    Google Scholar 

  11. Duong, T. A., Valeyrie-Allanore, L., Wolkenstein, P. & Chosidow, O. Severe cutaneous adverse reactions to drugs. Lancet 390, 1996–2011 (2017).

    Google Scholar 

  12. Parasrampuria, S. & Henry, J. Hospitals’ use of electronic health records data, 2015–2017. ONC Data Brief. 46, 13 (2019).

    Google Scholar 

  13. Liang, J. et al. Adoption of electronic health records (EHRs) in China during the past 10 years: consecutive survey data analysis and comparison of sino-american challenges and experiences. J. Med. Internet Res. 23, e24813 (2021).

    Google Scholar 

  14. Lee, K. et al. Digital health profile of South Korea: A cross sectional study. Int. J. Environ. Res. Public Health 19, 6329 (2022).

    Google Scholar 

  15. Huang, D., Cogill, S., Hsia, R. Y., Yang, S. & Kim, D. Development and external validation of a pretrained deep learning model for the prediction of non-accidental trauma. npj Digital Med. 6, 131 (2023).

    Google Scholar 

  16. Shang, J., Ma, T., Xiao, C. & Sun, J. In 28th International Joint Conference on Artificial Intelligence, IJCAI 2019, 5953–5959 (International Joint Conferences on Artificial Intelligence, 2019).

  17. Li, Y. et al. BEHRT: Transformer for electronic health records. Sci. Rep. 10, 7155 (2020).

    Google Scholar 

  18. Rasmy, L., Xiang, Y., Xie, Z., Tao, C. & Zhi, D. Med-BERT: Pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction. npj Digital Med. 4, 86 (2021).

    Google Scholar 

  19. Pang, C. et al. In Proceedings of Machine Learning for Health Vol. 158 (eds Roy Subhrajit et al.) 239–260 (PMLR, Proceedings of Machine Learning Research, 2021).

  20. Li, Y. et al. Hi-BEHRT: Hierarchical transformer-based model for accurate prediction of clinical events using multimodal longitudinal electronic health records. IEEE J. Biomed. Health Inform. 27, 1106–1117 (2023).

    Google Scholar 

  21. Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. In Proc. 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers). 4171–4186 (2019).

  22. Wornow, M., Thapa, R., Steinberg, E., Fries, J. & Shah, N. Ehrshot: An ehr benchmark for few-shot evaluation of foundation models. Adv. Neural Inf. Process. Syst. 36, 67125–67137 (2023).

    Google Scholar 

  23. Kim, J. et al. Pretrained patient trajectories for adverse drug event prediction using common data model-based electronic health records. Commun. Med. 5, 232 (2025).

    Google Scholar 

  24. Fiszenson-Albala, F. et al. A 6-month prospective survey of cutaneous drug reactions in a hospital setting. Br. J. Dermatol 149, 1018–1022 (2003).

    Google Scholar 

  25. Hernández-Salazar, A. et al. Epidemiology of adverse cutaneous drug reactions. A prospective study in hospitalized patients. Arch. Med. Res. 37, 899–902 (2006).

    Google Scholar 

  26. Park, C. S. et al. The use of an electronic medical record system for mandatory reporting of drug hypersensitivity reactions has been shown to improve the management of patients in the university hospital in Korea. Pharmacoepidemiol Drug Saf. 17, 919–925 (2008).

    Google Scholar 

  27. Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).

    Google Scholar 

  28. Jerome, H. F. Greedy function approximation: A gradient boosting machine. Ann. Stat. 29, 1189–1232 (2001).

    Google Scholar 

  29. Graves, A. in Supervised Sequence Labelling with Recurrent Neural Networks (ed Graves A.) 37–45 (Springer Berlin Heidelberg, 2012).

  30. Chung, J., Gulcehre, C., Cho, K. & Bengio, Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014).

  31. Choi, E. et al. Retain: An interpretable predictive model for healthcare using reverse time attention mechanism. Advances in neural information processing systems 29 https://doi.org/10.48550/arXiv.1608.05745 (2016).

  32. Guo, L. L. et al. EHR foundation models improve robustness in the presence of temporal distribution shift. Sci. Rep. 13, 3767 (2023).

    Google Scholar 

  33. Vovk, V. Cross-conformal predictors. Ann. Math. Artif. Intell. 74, 9–28 (2015).

    Google Scholar 

  34. Sun, J. et al. Applying mondrian cross-conformal prediction to estimate prediction confidence on large imbalanced bioactivity data sets. J. Chem. Inf. Model. 57, 1591–1598 (2017).

    Google Scholar 

  35. Ding, T., Angelopoulos, A., Bates, S., Jordan, M. & Tibshirani, R. J. Class-conditional conformal prediction with many classes. Adv. neural Inf. Process. Syst. 36, 64555–64576 (2023).

    Google Scholar 

  36. Zhao, J., Henriksson, A., Asker, L. & Boström, H. Predictive modeling of structured electronic health records for adverse drug event detection. BMC Med Inf. Decis. Mak. 15, S1 (2015).

    Google Scholar 

  37. Bampa, M. & Papapetrou, P. Aggregate-eliminate-predict: Detecting adverse drug events from heterogeneous electronic health records. ArXiv abs/1907.06058 (2019).

  38. Khan, D. A. et al. Drug allergy: A 2022 practice parameter update. J. Allergy Clin. Immunol. 150, 1333–1393 (2022).

    Google Scholar 

  39. Broyles, A. D. et al. Practical guidance for the evaluation and management of drug hypersensitivity: Specific drugs. J. Allergy Clin. Immunol. Pr. 8, S16–s116 (2020).

    Google Scholar 

  40. Doña, I. et al. An algorithm for the diagnosis of beta-lactam allergy, 2024 update. Allergy 80, 633–637 (2025).

    Google Scholar 

  41. Yoon, S. Y. et al. Validation of the cephalosporin intradermal skin test for predicting immediate hypersensitivity: A prospective study with drug challenge. Allergy 68, 938–944 (2013).

    Google Scholar 

  42. Kwon, J. W. et al. Results of intradermal skin testing with cefazolin according to a history of hypersensitivity to antibiotics. J. Korean Med Sci. 34, e319 (2019).

    Google Scholar 

  43. Sousa-Pinto, B. et al. Accuracy of penicillin allergy diagnostic tests: A systematic review and meta-analysis. J. Allergy Clin. Immunol. 147, 296–308 (2021).

    Google Scholar 

  44. Kardaun, S. H. et al. Drug reaction with eosinophilia and systemic symptoms (DRESS): an original multisystem adverse drug reaction. Results from the prospective RegiSCAR study. Br. J. Dermatol 169, 1071–1080 (2013).

    Google Scholar 

  45. Broyles, A. D., Banerji, A. & Castells, M. Practical guidance for the evaluation and management of drug hypersensitivity: General concepts. J. Allergy Clin. Immunol. Pract. 8, S3–s15 (2020).

    Google Scholar 

  46. Brockow, K. et al. EAACI position paper on how to classify cutaneous manifestations of drug hypersensitivity. Allergy 74, 14–27 (2019).

    Google Scholar 

  47. Biedermann, P. et al. Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases. BMC Med. Res. Methodol. 21, 238 (2021).

    Google Scholar 

  48. Benson, T. Principles of health interoperability HL7 and SNOMED. (Springer Science & Business Media, 2012).

  49. Liu, S., Wei, M., Moore, R., Ganesan, V. & Nelson, S. RxNorm: prescription for electronic drug information exchange. IT Professional 7, 17–23 (2005).

    Google Scholar 

  50. McDonald, C. J. et al. LOINC, a universal standard for identifying laboratory observations: A 5-year update. Clin. Chem. 49, 624–633 (2003).

    Google Scholar 

  51. Ke, G. et al. in Proceedings of the 31st International Conference on Neural Information Processing Systems 3149–3157 (Curran Associates Inc., Long Beach, California, USA, 2017).

  52. Wolf, T. et al. Huggingface’s transformers: State-of-the-art natural language processing. arXiv preprint arXiv:1910.03771 https://doi.org/10.48550/arXiv.1910.03771 (2019).

  53. Vaswani, A. et al. Attention is all you need. Advances in neural information processing systems 30 (2017).

  54. DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44, 837–845 (1988).

    Google Scholar 

  55. Wilson, E. B. Probable inference, the law of succession, and statistical inference. J. Am. Stat. Assoc. 22, 209–212 (1927).

    Google Scholar 

Download references

Acknowledgements

This work was supported by the Korea Institute of Drug Safety & Risk Management and the AI Institute at Seoul National University. S.H.K. receives funding from Korea Institute of Drug Safety & Risk Management, and J.K. is supported by the fellowship program of the AI Institute at Seoul National University. The funder played no role in study design, data collection, analysis and interpretation of data, or the writing of this manuscript.

Author information

Author notes
  1. These authors contributed equally: Junmo Kim, Kyunghoon Kim.

Authors and Affiliations

  1. Interdisciplinary Program in Bioengineering, Seoul National University, Seoul, Republic of Korea

    Junmo Kim

  2. Department of Pediatrics, Seoul National University Bundang Hospital, Seongnam, Republic of Korea

    Kyunghoon Kim

  3. Department of Pediatrics, Seoul National University College of Medicine, Seoul, Republic of Korea

    Kyunghoon Kim

  4. Department of Internal Medicine, Chung-Ang University College of Medicine, Seoul, Republic of Korea

    Jeong-Eun Yun

  5. Department of Internal Medicine, Chungbuk National University College of Medicine and Chungbuk National University Hospital, Cheongju, Republic of Korea

    Yu-Kyoung Hwang & Min-Gyu Kang

  6. Healthcare ICT Research Center, Office of eHealth Research and Businesses, Seoul National University Bundang Hospital, Seongnam, Republic of Korea

    Seok Kim & Sooyoung Yoo

  7. Interdisciplinary Program of Medical Informatics, Seoul National University, Seoul, Republic of Korea

    Chaiho Shin

  8. Department of Transdisciplinary Medicine, Seoul National University Hospital, Seoul, Republic of Korea

    Suhyun Kim & Kwangsoo Kim

  9. Department of Clinical Medical Sciences, Seoul National University College of Medicine, Seoul, Republic of Korea

    Suhyun Kim

  10. Center for Data Science, Healthcare AI Research Institute, Seoul National University Hospital, Seoul, Republic of Korea

    Suhyun Kim & Kwangsoo Kim

  11. Department of Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea

    Kwangsoo Kim

  12. Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Republic of Korea

    Sae-Hoon Kim

  13. Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea

    Sae-Hoon Kim

  14. Institute of Allergy and Clinical Immunology, Seoul National University Medical Research Center, Seoul, Republic of Korea

    Sae-Hoon Kim

Authors
  1. Junmo Kim
    View author publications

    Search author on:PubMed Google Scholar

  2. Kyunghoon Kim
    View author publications

    Search author on:PubMed Google Scholar

  3. Jeong-Eun Yun
    View author publications

    Search author on:PubMed Google Scholar

  4. Yu-Kyoung Hwang
    View author publications

    Search author on:PubMed Google Scholar

  5. Min-Gyu Kang
    View author publications

    Search author on:PubMed Google Scholar

  6. Seok Kim
    View author publications

    Search author on:PubMed Google Scholar

  7. Sooyoung Yoo
    View author publications

    Search author on:PubMed Google Scholar

  8. Chaiho Shin
    View author publications

    Search author on:PubMed Google Scholar

  9. Suhyun Kim
    View author publications

    Search author on:PubMed Google Scholar

  10. Kwangsoo Kim
    View author publications

    Search author on:PubMed Google Scholar

  11. Sae-Hoon Kim
    View author publications

    Search author on:PubMed Google Scholar

Contributions

S.H.K. and J.K. contributed to the conceptualization and design of the study. J.K. handled and mainly analyzed the research data, and all authors interpreted the results. J.K. constructed the machine learning and deep learning models and conducted statistical analysis. J.K. and K.K. wrote the original draft of the paper. S.H.K. and K.K. reviewed the clinical evidence of the study. K.S.K., S.Y., and M.G.K. provided the data, and J.K. verified the quality of the data. J.K., S.H.K., K.S.K., M.G.K., and S.Y. had full access to the raw data. All authors had the final responsibility to submit for publication.

Corresponding authors

Correspondence to Kwangsoo Kim or Sae-Hoon Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kim, J., Kim, K., Yun, JE. et al. Prediction of antibiotic-associated cutaneous adverse drug reactions using electronic health record foundation models. npj Digit. Med. (2026). https://doi.org/10.1038/s41746-026-02503-x

Download citation

  • Received: 07 May 2025

  • Accepted: 19 February 2026

  • Published: 04 March 2026

  • DOI: https://doi.org/10.1038/s41746-026-02503-x

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Collections
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Content types
  • Journal Information
  • About the Editors
  • Contact
  • Editorial policies
  • Calls for Papers
  • Journal Metrics
  • About the Partner
  • Open Access
  • Early Career Researcher Editorial Fellowship
  • Editorial Team Vacancies
  • News and Views Student Editor
  • Communication Fellowship

Publish with us

  • For Authors and Referees
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

npj Digital Medicine (npj Digit. Med.)

ISSN 2398-6352 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing