Abstract
Our aim was to design and validate a software application, based on the Sepsis-3 criteria, capable of retrospectively identifying community-onset sepsis among emergency department patients requiring hospital admission.The application was developed using QlikView (Qlik, King of Prussia, PA, USA) software, and accessed data from the electronic health records TakeCare (CompuGroup Medical, Koblenz, Germany), and CliniSoft (CliniSoft, Kuopio, Finland). The application utilized indicators such as blood culture data, antibiotic administration, and Sequential Organ Failure Assessment scores to detect sepsis cases according to Sepsis-3 criteria. The application was tested retrospectively against a cohort from a large city hospital in Stockholm over a 2-year period, and its performance was compared to physician record reviews in a subset of cases identified by stratified random sampling. The results showed that among 229,195 emergency department visits leading to 60,213 hospital admissions, the application detected 7027 cases of sepsis. Validation using physician record review of a random selection of 426 cases demonstrated a sensitivity, specificity, positive predictive value, and negative predictive value of 95%, 99%, 92%, and 99%, respectively. The lower respiratory tract was the most common site of infection. This software application effectively identified community-onset sepsis patients using electronic health record data with high performance. It has the potential to improve sepsis identification as it operates independently of diagnostic codes and may, therefore, facilitate research in many areas of sepsis. Furthermore, it can be used as a tool within the healthcare system to enhance sepsis surveillance and evaluate quality improvement interventions.
Similar content being viewed by others
Introduction
The World Health Organization recognizes sepsis as a significant global health issue, being a leading cause of morbidity and mortality worldwide1. The definition of Sepsis has evolved over time and is now, according to the Third International Consensus Definition for Sepsis and Septic Shock, considered to be a life-threatening organ dysfunction caused by a dysregulated host response to infection2. Organ dysfunction is identified through the scoring system, Sequential Organ Failure Assessment (SOFA)3,4.
The real burden of sepsis is difficult to estimate. Although the International Classification of Disease (ICD) coding method is the most practical tool to measure sepsis incidence at scale, this method has been proven unreliable when compared to medical record review. A recent study found that the incidence of sepsis among hospitalized patients, based on the Sepsis-3 criteria, was 4.1%, as determined by medical record reviews, while the incidence based on ICD-10 codes was only 1.0%5. Intensive care Unit (ICU) patients had higher rates of correct coding, maybe because ICU doctors are more commonly caring for septic patients and thus more aware of the diagnosis. Other factors affecting the accuracy of ICD coding for sepsis are severity of disease, community-aquired or nosocomial sepsis, time and source of sepsis. A study from Hong Kong showed that using administrative data tends to underestimate the local burden of sepsis when compared to Electronic Health Record (EHR)-based methods, probably due to inadequate training and resources to perform the coding6.
The rational for developing this application was to accurately identify true sepsis patients, enabling epidemiological studies, monitoring of incidence to improve planning of health care, and evaluation of quality improvement interventions.
We have developed a new software application to identify patients with sepsis in the EHR system TakeCare (TC) (CompuGroup Medical, Koblenz, Germany), generally used for hospital and outpatient records, as well as Clinisoft (CCC) (CliniSoft, Kuopio, Finland), a patient data management system (PDMS) used in Intensive Care Units (ICUs).
This software application retrieves data from a large patient cohort. Since this process can be automated, it has the potential to be nearly as efficient as using ICD codes, but potentially more reliable.
The objective of this study was to evaluate the diagnostic accuracy of this fully automated software application to identify patients with community-acquired sepsis using physician medical record review as the reference standard.
Results
Out of the 229,195 patient visits to the emergency department (ED), 60,213 resulted in hospitalization. The application identified 7027, visits (3.1%) as cases of sepsis. Out of these, 3099 (44%) were female and the median age was 74 years. The maximum SOFA-score achieved was a median of 3 points. A total of 732 (10%) of the sepsis visits resulted in an admission to the ICU. Additional descriptive data are outlined in Table 1. Except for bilirubin, measuring the liver function component of the SOFA score, there was minimal missing data in the SOFA scores (Table 2).
According to the reviewers’ medical record review, a total of 129 visits of 140 in group A fulfilled the true Sepsis-3 criteria while 11 were deemed false positive. Four visits of 143 in group B and 1 visit of 143 in group C fulfilled the true Sepsis-3 criteria and these were thus considered to be false negative. Based on these figures, the QV application achieved a sensitivity of 0.95 (95% CI 0.76–0.98), a specificity of 0.99 (95% CI 0.98–0.99), positive predictive value 0.92 (95% CI 0.86–0.96) and negative predictive value 0.99 (95% CI 0.96–1.00). The positive likelihood ratio was 91 (95% CI 56–199), the negative likelihood ratio was 0.061 (95% CI 0.002–0.16) and Youden’s index was 0.93 with (95% CI 0.83–0.99) (Table 3 and Supplemental File 1).
The false positive patients in group A, were further reviewed. One patient had a SOFA score of 2 points due to trauma, while another experienced respiratory failure unrelated to infection. Four patients initially had a baseline SOFA score of 1 point, which increased to 2 points at time zero, and therefore did not meet the criterium of an increase of two SOFA scores. Additionally, one patient had myelodysplastic syndrome, leading to a high SOFA score. Another patient failed to meet the infection criteria, and one received an elevated SOFA score due to an error in GCS (Glasgow Coma Scale) recording. Furthermore, one patient had a high SOFA score due to dehydration, and the last patient experienced a stroke followed by seizures, resulting in increased SOFA score. Initially, in all cases, sepsis could not be ruled out, so antibiotics were started as a precaution but were discontinued shortly after admission.
In group B, the four false negative patients had a SOFA score attributed to respiration, which the application failed to recognize due to inaccuracies in the patient records regarding the amount of oxygen administered. This information was not entered correctly into the EHR. Regarding the false negative patients in group C, one individual had been blood-cultured in a geriatric ward prior to transfer to the emergency department, where new blood cultures were not obtained (Table 4).
Infection characteristics of the patients in group A (n = 129) can be seen in Table 5. The most common site of infection was lower respiratory tract (41/129) followed by urinary tract (31/129). Among subjects in group A that fulfilled the true sepsis criteria, 60% fulfilled the criteria for possible infection, 22% for probable and 19% for definite infection and 16% were classified as possible infection due to unknown or uncertain source of infection (Table 5).
Discussion
The application was designed to analyze a large clinical data set and provide a more reliable and consistent way of identifying septic patients from the EHR. We found that the application was able to capture 95% of sepsis visits, including those admitted to the ICU, with a specificity of 99%. Compared to a previous Swedish study, the application performs slightly better, and one possible reason could be the use of ICU data in addition to the data from EHRs7. Moreover, we found that 16.2% of admissions were classified as possible infection due to an unknown source of infection, highlighting that the application can detect cases of sepsis where diagnosis is difficult. Overall, the application has the potential to be used in all hospitals working with the EHR system TC and the PDMS system CCC, to facilitate further studies on sepsis incidence, mortality, and adherence to the treatment bundles of the Surviving Sepsis Campaign8.
Currently, the strategies used to register sepsis using ICD codes are not sufficient in accurately determining the true incidence of the condition9. In Hong Kong, EHR-based surveillance revealed a rise in standardized sepsis incidence and mortality at the population level from 2009 to 2018. This method proved to be significantly more accurate than administrative definitions6. These results highlight the feasibility and benefits of using an EHR-based approach for large-scale sepsis monitoring. Estimates derived from claims data may not accurately reflect the true clinical picture and can be influenced by variations in how diagnoses and codes are assigned over time10. For example, according to Rhee et al.11, if a hospital uses administrative codes to track sepsis, there is a high probability that they will see higher sepsis case counts and lower sepsis mortality rates. On the other hand, Swedish studies have shown that clinicians rarely use ICD-10 codes indicating sepsis, meaning that many cases would be missed and sepsis incidence probably underestimated5. Apparently, ICD-10 coding seem to differ substantially between different settings12. In the application developed for the present study, clinical data relevant for the Sepsis-3 definition has been used, ensuring a more correct detection of sepsis visits.
Using the application can help hospitals to measure the impact of their sepsis prevention and treatment initiatives and improve the quality of care for patients. This creates the potential for valuable feedback that can guide quality improvement interventions, such as educational programs, systems for earlier sepsis recognition, treatment bundles, and infection control measures.
One limitation of the study is that the application relies on data input by nurses and physicians, including clinical judgments such as decisions regarding blood cultures, antibiotic prescriptions, or initiation of vasopressor therapy. This reliance on individual practitioner judgments introduces a potential source of variability in the data. Additionally, there is no guarantee that all patients with sepsis underwent blood culture testing, meaning some cases may have been missed by the application.
Furthermore, there was a significant level of missing data regarding the liver function (bilirubin). Even though it is probable that physicians have been more likely to have ordered bilirubin in sicker patients or with a suspicion of liver engagement, it is still possible that a subset of these patients would have a SOFA score of ≥ 2 if bilirubin had been analyzed. Thus, these patients should have classified as group A and ultimately this could have resulted in an underestimation of the number of visits diagnosed with sepsis by the application This highlights the importance of adequately training physicians in conducting all tests contributing to the SOFA score in order to recognize patients with potential sepsis.
In relation to the timeframe for data collection, during the assessment of clinical criteria for the sepsis-3 definition, it was allowed for cultures to be drawn up to 24 h after the initiation of antimicrobial therapy, or for antimicrobials to be commenced up to 72 h after culture collection13. The initial intention of this study was to encompass all three time frames. However, during the early stages of application development, it became apparent that utilizing all three windows was intricate and time-intensive, particularly when cross-referencing with the EHR. There was a concern that such complexity might introduce errors into the validation process. Consequently, a decision was made to simplify the approach. We opted for a 48-h time window from the patient’s arrival at the ED, as utilized for the SOFA score, for the timing of both blood cultures and the administration of antimicrobials following the definition of community acquired sepsis14. However, given the requirement for at least two doses of antibiotics for sepsis treatment, only the initial dose needed to be administered within 48 h.
Another limitation is that, although the reviewing physicians independently assessed the medical records to determine sepsis or infection for all cases, they were not blinded to the application software. This could potentially introduce bias and impact the consistency of the results. However, we believe this effect is minimal, as a strict protocol was followed. Also, different reviewers took responsibility for the different validation groups. The participation of one researcher (CD) in the validation process across all three groups (A, B, and C), and the strict protocol should vouch for consistency of our results.
Also, the study was only conducted in one center. Even though the application could be used also in other hospitals utilizing TC and CCC systems, local validation will be needed to use the application in other hospitals..
The present study is limited in its scope to identify community onset sepsis, however, the application could be adapted to detect also hospital onset sepsis cases15.
Methods
Setting and study population
This study was conducted at Södersjukhuset (Stockholm South General Hospital), Sweden, which has one of the largest EDs in the Nordic countries. The study covers a period from May 1st, 2016, to April 30th, 2018. During these two years 229,195 adult visits were registered to the ED and out of these, 60,213 were admitted to the hospital.
Electronic health records
We used the EHR system TakeCare (TC) (used for hospital and outpatient records) as well as Clinisoft (CCC), a patient data management system used in ICUs. Both systems store and retrieve clinical information, including patient history, vital signs, laboratory results, drug administration, other treatments provided, and diagnosis codes.
Application development
A software engineer developed the application in the QlickView (QV) software (Qlik Technologies Inc, King of Prussia, PA, USA.), enabling QV to retrieve and store data from TC, as well as from CCC. Data was generated, coded, and analyzed from a population that consisted of all adult patient visits (≥ 18 years of age) to the ED. The patients were monitored for the initial 48-h period or until they were discharged or passed away, whichever occurred first.
To identify patients likely to have community-onset sepsis according to the Sepsis-3 criteria, the QV application was programmed to identify patients who (1) had blood cultures taken within 48 h from the arrival to the ED, (2) received two doses of antibiotics used for sepsis, and (3) obtained a SOFA score ≥ 2 during at least one 24 h window within 48 h from arrival to the ED. “Time zero”, i.e. the exact time when a patient was considered to have sepsis was defined as the first time a patient obtained a SOFA score ≥ 2. Antibiotics used for sepsis were defined by the recommendations of the Swedish Society of Infectious Diseases (SILF) for treating community onset sepsis in Sweden, namely benzylpenicillin, cloxacillin, piperacillin + tazobactam, cefotaxime, ceftazidime, meropenem, imipenem + cilastatin, tobramycin, gentamicin and amikacin16.
Two doses of antibiotics used for sepsis were required; the first should be administered within 48 h from the patients’ registration at the ED, aiming to identify community-acquired sepsis14. The second dose had to be administered within 18 h from the first dose. The time frame was selected to ensure the inclusion of patients on a twice-daily regimen, even if there were delays in administering the second dose, as all antibiotics recommended by SILF for sepsis treatment require at least two doses per day. Also, the time between prescription and administration may be delayed due to the nurses’ workload.
Patients reaching a SOFA score ≥ 2 for the first time later than 48 h from admission were not included since these were considered as having a hospital acquired sepsis15. The SOFA score’s respiratory component was calculated using the Ellis equation17, converting the peripheral capillary oxygen saturation (SpO2) to partial pressure of oxygen (PaO2) when PaO2 from arterial blood was not available. GCS was not used as a SOFA score criterion for intubated (and thus sedated) patients admitted to the ICU. Urine output was not used as a criterion due to a lack of data. If a SOFA score component was lacking, the value was considered to be zero.
The application did not incorporate the calculation of the patient’s baseline SOFA score. This decision was made due to the potential risk of missing information from previous hospital contacts and the possibility of patients being admitted to other hospitals that do not utilize TC or CCC as their EHR systems.
Validation process
The QV software application was used for the entire cohort of the 60,213 consecutive visits to the emergency department that led to hospitalization. For validation of the application with medical record review, a stratified random sampling was made from three groups18. Stratification was made to ensure a sufficient number of patients from the relatively small groups A and B. From group A, consisting of 7027 ED visits considered to have sepsis according to the application, (i.e. blood cultures taken, antibiotics administered and a SOFA score of ≥ 2) a random sample of 140 visits was selected. From group B, consisting of 1 958 visits that had a suspicion of a serious infection but not sepsis according to the application (i.e. blood cultures taken, antibiotics administered but a SOFA score of < 2) a random sample of 143 visits was selected. Finally, from group C, consisting of 51 228 visits with low likelihood of sepsis or serious infection (i.e. not fulfilling the criteria for groups A and B) a random sample of 143 visits was selected. Thus, a total of 426 ED visits were selected by stratified random sampling from the total cohort and included for medical record review (Fig. 1).
Three emergency physicians, one licensed physician from the UK, and one ICU physician were tasked with reviewing whether patients met the Sepsis-3 clinical criteria. They conducted this assessment independently.
The application’s accuracy in detecting community-onset sepsis was compared to manual examination of the EHR.
The reviewers manually calculated baseline SOFA scores. For each component of the SOFA score, the baseline was determined as the best value measured within 90 days prior to arrival at the ED. If the value was unknown, it was assumed to be zero. The extent of missing data for the various SOFA components is detailed in Table 2. The manual examination revealed only one discrepancy between the missing data from the script and the missing data in the manual examination. In group A, the application identified five patients as having fewer points in the respiration SOFA compared to the manual examination, and the same occurred with one patient in group B. This discrepancy arose because the implementation of oxygen in some patients was only recorded manually by physicians or nurses, which the application could not detect in free text. However, in the manual examination, it was observed that all the six patients had a higher SOFA point when the manually recorded implementation of oxygen was added. The rest of the missing data did not present any discrepancy between the reviewers and the application.
Admissions that met the Sepsis-3 clinical criteria of having an increase in SOFA score of ≥ 2, as determined by physician review, were further evaluated for the likelihood of infection. For this purpose, the reviewers scrutinized clinical, microbiological and radiological findings of each patient. The episodes were classified into four categories: no infection, possible infection, probable infection, and definite infection, in accordance with the definitions outlined by Klouwenberg et al.19. Additionally, infections from an unknown source were included based on criteria established by Valik et al., defined as symptoms indicating an infection according to the attending physician, and the patient receiving a full course of anti-infective treatment, but no source could be determined. This category could only be classified as a possible infection. To be considered a true sepsis case in the assessment of the application’s sensitivity and specificity, patients had to meet Sepsis-3 clinical criteria (increase in SOFA score and the presence of possible, probable, or definite infection)7.
Statistical analyses
Weighted estimates of sensitivity, specificity, Positive Predicted Value, Negative Predictive Value, and Youden’s index were used to account for bias caused by the stratified sampling scheme.
Confidence intervals with a 95% confidence level were calculated from 10 000 bootstrap samples18. Furthermore, the positive and negative likelihood ratios were calculated. Data handling and statistical analyses were conducted using R 4.1.1.
Large language models (LLM), ChatGPT
A large language model, ChatGPT, was used solely for the purpose of correcting the English language in some parts of this manuscript; no content or images were created by the LLM.
Conclusion
Our study demonstrates the feasibility of utilizing a software application, based on the Sepsis-3 criteria, to automatically identify patients with community onset sepsis, achieving high sensitivity, specificity, and predictive values in patients admitted through the emergency department. This application has the potential to improve sepsis identification as it operates independently of diagnostic codes and may, therefore, facilitate research in many areas of sepsis. Furthermore, it can be used as a tool within the healthcare system to enhance sepsis surveillance and evaluate quality improvement interventions.
Data availability
The data used for this study is available upon request to Dr. Cristian Duré at cristian.dure@ki.se.
References
WHO, W.H.O. Protocol for the assessment of national communicable disease surveillance and response systems: Guidelines for assessment teams. 2017 9 January 2017; Available from: https://apps.who.int/iris/bitstream/handle/10665/273181/B140_12-en.pdf?sequence=1&isAllowed=y.
Singer, M. et al. The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). JAMA 315(8), 801–810 (2016).
Benjamin Flam, A.O.S.n.d.o.k.f.L.s.M.K.-.-c., 113:DY3A; Available from: https://lakartidningen.se/klinik-och-vetenskap-1/kommentar/2016/03/sepsis-nya-definitioner-och-kriterier-foreslas/.
https://www.who.int/news-room/fact-sheets/detail/sepsis., W.H.O.S.A.f.
Mellhammar, L. W. E. et al. Estimating sepsis incidence using administrative data and clinical medical record review. JAMA Netw. Open. 6(8), e2331168. https://doi.org/10.1001/jamanetworkopen.2023.31168 (2023).
Ling, L. Z. J. et al. Population sepsis incidence, mortality, and trends in Hong Kong between 2009–2018 using clinical and administrative data. Clin. Infect Dis. https://doi.org/10.1093/cid/ciad491 (2023).
Valik, J. K. W. L. et al. Validation of automated sepsis surveillance based on the Sepsis-3 clinical criteria against physician record review in a general hospital population: Observational study using electronic health records data. BMJ Qual. Saf. 29(9), 735–745 (2020).
Evans, L., Rhodes, A., Alhazzani, W. et al. Surviving sepsis campaign: international guidelines for management ofsepsis and septic shock 2021. Intensive Care. Med. 47, 1181–1247 (2021).
Johnson, A. E. W. et al. A comparative analysis of sepsis identification methods in an electronic database. Crit. Care Med. 46(4), 494–499. https://doi.org/10.1097/CCM.0000000000002965 (2018).
Wilhelms, S. B., Granath, G. & Sjöberg, F. Assessment of incidence of severe sepsis in Sweden using different ways of abstracting International Classification of Diseases codes: Difficulties with methods and interpretation of results. Crit. Care Med. 38(6), 1442–1449. https://doi.org/10.1097/CCM.0b013e3181de4406 (2010).
Rhee, C. et al. Incidence and trends of sepsis in us hospitals using clinical vs claims data, 2009–2014. JAMA 318(13), 1241–1249. https://doi.org/10.1001/jama.2017.13836 (2017).
Johansson D, E.H., Beronius E, Rasmussen M. Systematisk journalgranskning i Skåne - Diagnoskoder var ofta fel vid svår sepsis och septisk chock [Systematic medical record review in Skåne. Diagnostic codes were often wrong in severe sepsis and septic shock]. Lakartidningen. 2015 Sep 14;112:DI7S. Swedish. PMID: 26371482.
Seymour CW, L.V., Iwashyna TJ, Brunkhorst FM, Rea TD, Scherag A, Rubenfeld G, Kahn JM, Shankar-Hari M, Singer M, Deutschman CS, Escobar GJ, Angus DC. , Assessment of Clinical Criteria for Sepsis: For the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3).
Tsertsvadze, A. et al. Community-onset sepsis and its public health burden: A systematic review. Syst. Rev. 5, 81. https://doi.org/10.1186/s13643-016-0243-3 (2016).
Rhee, C., Zhang, Z., Fram, D., Kadri, S. S. & Klompas, M. CDC Prevention Epicenters Program. Epidemiology of hospital-onset versus community-onset sepsis in US hospitals and association with mortality: A retrospective analysis using electronic clinical data. Crit. Care Med. 47(9), 1169–1176. https://doi.org/10.1097/CCM.0000000000003817 (2019).
Infektionsläkarförening. S. Sepsis och septisk chock – Vårdprogram från SILF. . [cited 2022. https://infektion.net/knowledge/vardprogrammet-for-sepsis-och-septisk-chock/.
Shrirang M Gadrey1, C.E.L., RussClay, Garret TRhodes, Douglas E Lake, ChristopherC Moore,, and J.D.A.J. Moorman. Imputation of partial pressures of arterial oxygen using oximetry.
Cadieux, G. M., Tamblyn, R., Buckeridge, D. L. & Dendukuri, N. Validation of diagnostic groups based on health care utilization data should adjust for sampling strategy. Med. Care. 55(8), e59–e67 (2017).
Klein Klouwenberg, P. M. et al. Interobserver agreement of centers for disease control and prevention criteria for classifying infections in critically ill patients. Crit. Care Med. 41(10), 2373–2378. https://doi.org/10.1097/CCM.0b013e3182923712 (2013).
Funding
Open access funding provided by Karolinska Institute.
This work was supported by The Swedish Society of Medicine (Grant number sls-934824) and a local research fund from Stockholm South General Hospital (Södersjukhuset).
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. The application was designed by Cristian Duré (CD), Sandra Jonmarker (SJ) and Liivi Rimling assisted by Maria Cronhjort, Eva Joelsson-Alm, Hampus Nordqvist (HN), Katarina Bohm, and Kristian Ängeby (KÄ). C.D. and S.J. reviewed the clinical records in with the assistance of K.Ä. and H.N. Statistical advising and calculations was done by Mikael Andersson Franko. The first draft of the manuscript was written by C.D. and all authors commented on previous versions of the manuscript. MC and KÄ have jointly supervised the work. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethics approval
This study was performed in line with the principles of the Declaration of Helsinki. Approval was granted by the Swedish Ethical Review Authority (Ref. number 2018/56931 and complementary ethical permit Ref. number 2019/02683 and Ref. number 2021/00446). The need for obtaining informed consent was waived by the Swedish Ethical Review Authority due to the retrospective nature of the study.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Duré, C., Jonmarker, S., Joelsson-Alm, E. et al. Validation of a software application using electronic health records for automatic detection of community onset sepsis. Sci Rep 15, 16412 (2025). https://doi.org/10.1038/s41598-025-99879-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-99879-9