Introduction

Kawasaki disease (KD), an acute febrile illness in infants and children, is characterized by systemic vasculitis affecting medium-sized arteries, especially coronary arteries.1,2 Since the development of coronary artery abnormality (CAA) is a serious complication,2,3 it is clinically important to control as quickly as possible any inflammation causing vasculitis. Although high-dose (2 g/kg) intravenous immunoglobulin (HD-IVIG) therapy has been established as a standard initial treatment,2,4 the development of CAA in the acute phase and residual CAA beyond the acute phase remains unsolved. Indeed, in a recent nationwide survey in Japan,5 approximately 8% and 2% of the KD patients developed CAA in the acute phase and had a residual CAA beyond the acute phase, respectively. Thus, to further improve therapeutic outcomes regarding the development of CAA, it is important to clarify the clinical characteristics of high-risk cases for developing CAA.

Recently, the utility of hierarchical clustering analysis by the machine learning approach has been demonstrated for assisting the clinical subgroups of diverse diseases.6 In a series of KD cases, Wang H et al. applied an unsupervised, data-driven cluster analysis and successfully identified four subgroups with distinct clinical features and treatment outcomes.7 In the present study, to more straightforwardly identify clinical characteristics associated with the development of CAA through a different approach from the previous report by Wang H et al.7 we performed a cluster analysis in 103 KD cases who developed CAA (Z-score >2.5).8 in the acute phase among 726 consecutive KD cases treated with an identical treatment protocol. We identified four subgroups and evaluated their association with clinical features and therapeutic outcomes.

Materials and Methods

Study participants

The study is a retrospective review of 726 consecutive KD cases in Japan who were diagnosed between July 2015 and December 2022 (7.5 years) in all 11 inpatient facilities for the care of pediatric patients in Yamanashi, as well as in 1 facility in Nagano (Supplementary Table 1). The study was performed with the central approval of the Research Ethics Committee of the University of Yamanashi Hospital (Approval Number 1698). The registration database was constructed with anonymized clinical records, which were provided every year from each facility under the agreement of each ethics committee. Diagnosis of KD was retrospectively confirmed based on criteria defined in the sixth edition of the Japanese Kawasaki Disease Diagnostic Guidelines.8 All 726 cases, including 75 incomplete KD cases, were treated within 9 days after disease onset. The first day of illness was defined as the day when at least one of the major symptoms appeared. During the COVID-19 pandemic (from March 2020 to December 2022), 169 cases were diagnosed with KD, none of whom were considered to be multisystem inflammatory syndrome in children (MIS-C) due to negative SARS-CoV-2 test (polymerase chain reaction analysis or antigen test) results on admission and no recognizable direct contact with COVID-19 cases within 2 months prior to diagnosis.

Echocardiographic evaluation was routinely performed before IVIG treatment for all the cases enrolled in this cohort. Additionally, echocardiographic evaluation was performed 48 hours after the initial treatment, before and after any additional treatment, and before discharge from the hospital. Further evaluation was performed at least every few days in cases that developed coronary artery abnormalities, until it was normalized in the acute phase. The development of coronary artery dilation was defined whenever the body surface area-adjusted Z score of any coronary arteries (left main, anterior descending, and circumflex arteries and right coronary arteries) was > 2.0, and the development of CAA was defined whenever it was > 2.5.8 When CAA was developed in the patients who were treated in the facilities other than University of Yamanashi Hospital in the acute phase (within 30 days after disease onset), they were transferred to the University of Yamanashi Hospital for further cardiac evaluation by pediatric cardiologists and acute-phase treatment including plasma exchange if required. Then, they were followed up in the outpatient clinic of the University of Yamanashi Hospital. Regular cardiac echo evaluation of the case with CAA was independently performed by at least two pediatric cardiologists.

Treatment of Kawasaki disease

All facilities performed an identical treatment protocol, as confirmed in the annual meeting, in which a representative pediatrician from each facility participated. All patients were treated with 2 g/kg/dose of IVIG in combination with oral aspirin (30 mg/kg/day) or its analogs as a first-line treatment immediately after the definitive diagnosis based on the criteria.8 IVIG treatment was completed within 24 hours after diagnosis, without any other anti-inflammatory agents, including steroids and cyclosporine A. The response to the initial treatment was evaluated 48 hours after the initiation of IVIG administration. The cases were considered to be ‘IVIG resistant’ when their body temperature was over 37.5 °C and serum C-reactive protein (CRP) level was higher than half of the peak value.9,10

The second-line treatment was mainly intravenous administration of infliximab at 5 mg/kg. Meanwhile, additional IVIG was administered as the second-line treatment in cases aged under 12 months old, in cases complicated with any active infection, or in cases with disease that developed within 12 weeks after live virus vaccination. The patients were considered to be resistant to the second-line treatment when the fever was unresolved within 48 hours after initiation of the second-line treatment. All cases resistant to the second-line treatment were transferred to the University of Yamanashi Hospital, and plasma exchange was performed in cases refractory to further IVIG treatment as the third-line treatment or in cases with rapidly progressive CAA.11

Machine learning

Cluster analysis using the Python software (version 3.11.5) was performed in 103 cases that developed CAA in the acute phase. For any missing laboratory data, the median value was complementary used in the machine learning. In the first step, using 33 continuous variables before starting initial treatment (Supplementary Table 2), we evaluated Spearman’s rank correlation coefficient between each pair of the 33 continuous variables. When the r-value between two variables was 0.5 or higher, we chose the variable with fewer missing values. Resultantly, we identified 20 variables (Table 1 and Supplementary Fig. 1). The number of missing values for each of the 20 items is indicated in Supplementary Table 3. In the second step, we normalized input data by a standard scaler.12,13, and performed a principal component analysis to ensure that the contribution ratio remained above 80%.14,15,16 In the final step, we performed hierarchical cluster analysis.16,17 The Hopkins statistic was used to assess the clustering tendency of data.18

Table 1 List of the variables used for clustering analysis.

Statistical analysis

In comparisons of continuous variables between the two subgroups, the Mann–Whitney U test was performed when the Kruskal–Wallis rank sum test results were significant among the four subgroups. Pearson’s Chi-squared test was performed for comparisons of categorical variables. The Bonferroni method was used to compare significance levels for multiple comparisons. All statistical analyses were performed using EZR software (version 1.41; Saitama Medical Center, Jichi Medical University, Saitama, Japan).19

Results

During the study period, among 726 consecutive cases, 103 cases (14.2%) developed CAA (Z-score > 2.5)8 within 30 days after disease onset; 18 cases and 3 cases had residual CAA on day 30 and 1 year after disease onset, respectively. No cases developed giant aneurysm during the study period. When the demographic characteristics and laboratory findings were analyzed (Supplementary Table 4), the 103 cases with CAA were significantly younger; had significantly higher rates of incomplete KD and treatment resistance; higher maximum coronary artery Z-scores before initial treatment (Pre-max CA Z-score); lower levels of hemoglobin, total cholesterol, and high-density lipoprotein (HDL) cholesterol; and higher levels of serum albumin, alanine aminotransferase (ALT), and CRP than the 623 cases without CAA.

In the cluster analysis of the 103 cases with CAA, we identified four subgroups (Clusters 1 to 4) (Fig. 1 and Supplementary Fig. 2) with a reliable cluster structure (Hopkins statistic: 0.76) except for two cases; one developed KD shock syndrome and macrophage activation syndrome, and the other developed severe hepatic dysfunction. Thus, we focused on the 101 cases with CAA that were clustered into Cluster 1 (22 cases, 21.8%), Cluster 2 (32 cases, 31.7%), Cluster 3 (30 cases, 29.7%), and Cluster 4 (17 cases, 16.8%). In each facility (Supplementary Table 1), there were some variations in the distribution of each cluster, but no statistically significant differences were observed as a whole (p = 0.23 in Chi-square test).

Fig. 1: Dendrogram of the hierarchical map in the hierarchical clustering analysis.
figure 1

The horizontal axis indicates each case in each subgroup, and the vertical axis indicates Euclidean distance.

In the demographics at diagnosis (Table 2), the cases in Clusters 1 and 2 were significantly younger than those in Clusters 3 and 4 (Fig. 2a). Days of initial treatment were significantly earlier in Cluster 4 than in the other subgroups (Fig. 2b). There were no significant differences in the rate of incomplete KD among the four subgroups.

Fig. 2: Violin diagrams with swarm plots in four subgroups.
figure 2

a age, b days of initial treatment, and c max coronary artery Z-score before treatment (Pre-max CA Z-score). *p < 0.05, **p < 0.01, ***p < 0.001 in the Mann–Whitney U test.

Table 2 Characteristics of each subgroup.

In the treatment responses (Table 2), IVIG resistance in the first and second-line treatments was significantly more common in Cluster 4 but uncommon in Cluster 1. Plasma exchange was significantly more common in Clusters 2 and 4 than in Clusters 1 and 3. In the cardiac echo evaluation (Table 2), maximum coronary artery Z-scores before initial treatment (Pre-max CA Z-scores) were significantly lower in Cluster 4 than in the other subgroups (Fig. 2c). Although it was the highest in Cluster 1, no statistically significant differences were observed in the incidence of residual CAA 30 days from disease onset among the four subgroups. In echocardiographic follow-up of residual CAA (Table 2 and Fig. 3), regression within six months from disease onset was frequently observed in Clusters 1 and 3, while it was rarely observed in Clusters 2 and 4. Regression of residual CAA within six months after diagnosis was observed in seven of eight cases with max Z-score within day 30 < 5.0, while it was observed in only one of eight cases with max Z-score within day 30 > 5.0.

Fig. 3: Regression of residual CAA.
figure 3

The horizontal axis indicates the time from onset, and the vertical axis indicates Z-score. CAA was considered to be regressed when the Z-score fell below 2.5.

In the peripheral blood counts (Table 2), leukocytosis was the most remarkable in Cluster 1 among the four subgroups (Fig. 4a), while anemia was the most remarkable in Cluster 2 (Fig. 4b). Thrombocytosis was significantly less remarkable in Cluster 4 than in the other subgroups (Fig. 4c). Regarding blood chemistry (Table 2), levels of serum aspartate aminotransferase (AST), ALT, and total bilirubin were significantly higher in Cluster 4 than in the other subgroups (Fig. 4d–f), while levels of serum sodium were significantly lower in Cluster 4 (Fig. 4g). There were no significant differences in the level of serum CRP among the four subgroups (Fig. 4h). Serum albumin level was relatively lower in Cluster 2 (Fig. 4i), and a statistically significant difference was observed between Clusters 2 and 3. As for the levels of serum lipid, triglyceride (TG) was significantly higher in Cluster 1 than in the other subgroups (Fig. 4j), total cholesterol was significantly lower in Cluster 2 than in Clusters 1 and 3 (Fig. 4k), and HDL cholesterol was relatively higher in Clusters 3 and 4 than in Clusters 1 and 2 (Fig. 4l). In the scoring systems for predicting IVIG resistance (Table 2), all three scores in Gunma,20 Kurume,9 and Osaka.21 systems were significantly higher in Cluster 4 than in the other subgroups (Fig. 4m–o).

Fig. 4: Violin diagrams with swarm plots of 11 blood test items and 3 IVIG resistance prediction scores in four subgroups.
figure 4

a white blood cell count (WBC), b hemoglobin, c platelet count, d aspartate aminotransferase (AST), e alanine aminotransferase (ALT), f total bilirubin, g sodium, h C-reactive protein (CRP), i albumin, j triglyceride, k total cholesterol, l high density lipoprotein (HDL) cholesterol, m Gunma score, n Kurume score and o Osaka score. *p < 0.05, **p < 0.01, ***p < 0.001 in the Mann–Whitney U test.

In summary, Cluster 1 was characterized by leukocytosis and elevated serum TG levels in a relatively younger-aged group with a higher Z-score of coronary artery dilation before treatment and a relatively good response to initial IVIG treatment. Cluster 2 was characterized by anemia and lower serum total cholesterol level in a relatively younger-aged group with an average response to initial IVIG treatment. Cluster 3 was a relatively older-aged group lacking characteristic laboratory findings, with the highest Z-score of coronary artery dilation before treatment, but earlier regression of residual CAA. Cluster 4 was characterized by elevated levels of serum AST, ALT, and total bilirubin and decreased levels of sodium in a relatively older-aged group with poor response to initial IVIG treatment, despite the lowest Z-score of maximum coronary artery dilation before treatment.

Discussion

In this study, we performed clustering analysis in 103 KD cases who developed CAA within 30 days after disease onset, and identified four subgroups with different clinical characteristics and treatment responses. In the present study, we included serum lipid levels in the clustering analysis, since serum lipid levels have been identified as one of the predicting factors for IVIG-resistance in several previous studies, including ours.22,23,24 Dyslipidemia, including higher levels of TG and lower levels of total cholesterol, is associated with the severity of systemic inflammation in KD, although the underlying mechanism remains unclear. Indeed, each cluster showed a characteristic pattern in serum lipid levels. However, we could not confirm a consistent association of dyslipidemia with the development and regression of CAA as well as IVIG-resistance in each cluster, suggesting that further evaluation in a larger cohort is necessary for this issue.

Among the four subgroups, Cluster 4 was the most characteristic subgroup. Although all the cases in this cluster analysis developed CAA within 30 days after disease onset, coronary artery Z-scores before treatment were significantly lower in Cluster 4 compared to the other three subgroups. In other words, the cases in Cluster 4 most frequently developed CAA during treatment. Indeed, among the four subgroups, resistance to the first and second-line treatments was most frequent in Cluster 4, and, subsequently, plasma exchange was performed more frequently in Cluster 4 than in the other three subgroups. However, in Cluster 4, maximum coronary artery Z- scores within 30 days after disease onset were fairly similar to those in the other three groups, and the incidence of residual CAA on day 30 after disease onset was relatively lower among the four subgroups. Our observations suggested that, in Cluster 4, the second-line treatment and plasma exchange could be effective in preventing further progression of CAA, although we were unable to directly clarify the underlying mechanisms for this association. Cluster 4 was characterized by elevated levels of serum AST, ALT, and total bilirubin and decreased levels of serum sodium. Cluster 4 also showed significantly higher scores in the scoring systems for IVIG resistance prediction, despite the lower incidence of CAA at diagnosis. Thus, it is feasible to predict that the currently ongoing intensified initial IVIG treatment in combination with other anti-inflammatory agents.25,26,27,28,29 might be effective in preventing CAA development in cases characterized by clinical features observed in Cluster 4. In contrast to Cluster 4, Cluster 3 seemed to be the most severe in terms of pre-treatment coronary artery dilatation. However, regression of residual CAA was observed within six months in all three cases with residual CAA, suggesting that further intensification of initial treatment might not be required in this cluster.

Recently, accumulated evidence has shown that a higher Z-score from any coronary artery before initial treatment is associated with a higher risk of developing CAA.30,31,32,33,34,35,36 Indeed, in our consecutive 726 KD cases, the 103 cases with CAA in the acute phase showed significantly higher Z-scores of any coronary arteries before initial treatment than the 623 cases without it. Among the 101 cases with CAA clustered into 4 subgroups, 58 cases (57.4%) showed coronary artery dilation before treatment (Z-score was higher than 2.0). Of note, the majority of these cases with dilated coronary arteries before treatment (57/58 cases) were clustered into Clusters 1–3, and the incidence of residual CAA on day 30 after disease onset was relatively higher in Clusters 1 and 2, despite their younger age onset and lower scores in the scoring systems. In terms of regression of residual CAA, regression within six months from disease onset was common in Clusters 1 and 3, while it was uncommon in Clusters 2 and 4. Thus, when coronary artery dilation was confirmed before treatment in the low-risk cases for IVIG resistance, such as Clusters 1 and 2, intensification of initial treatment by a combination of other anti-inflammatory agents might be more crucial in Cluster 2, rather than in Cluster 1, to prevent CAA development. However, even in Cluster 1, one in five cases with residual CAA had persistent CAA over two years after diagnosis, suggesting a requirement for other biomarker(s) for the prediction of persistent residual CAA in Cluster 1.

Clustering analysis is a method of unsupervised machine learning, and its utility in clinical medicine has been increasingly confirmed.37,38,39,40 In the recent clustering analysis of a large series of KD cases by Wang H et al.7 four subgroups were identified: (1) liver subgroup characterized by hepatobiliary involvement with elevated levels of ALT; gamma-glutamyl transferase, and total bilirubin showing the lowest rate of coronary artery aneurysm and the highest rate of IVIG resistance; (2) band subgroup characterized by the highest band neutrophil count showing a high KD shock rate; (3) node subgroup characterized by cervical lymphadenopathy with higher markers of inflammation and the lowest age-adjusted hemoglobin Z-scores; and (4) young subgroup characterized by young aged onset showing the highest rate of coronary artery aneurysm but the lowest rate of intravenous immunoglobulin resistance. Among these four subgroups identified by Wang H et al.7 although not directly comparable to our observations due to different patient population and treatment, the characteristics in the liver subgroup were similar to those in Cluster 4 in the present study, suggesting the importance and prevalence of this clinical entity as a treatment-resistant subgroup in older children. These subgroups of the cases were commonly characterized by elevated serum ALT and total bilirubin levels in the older-age cases with the highest rate of IVIG resistance, suggesting the importance of these characteristics and laboratory findings in the clinical management of KD cases.

This study has several limitations. First, the majority of the cases in the present study were of Japanese ethnicity, and it was relatively small sample size. Second, although the efficacy of intensive initial IVIG treatment combined with other anti-inflammatory agents has been recently reported for high-risk KD patients,25,26,27,28,29 first-line treatment in the present study was a standardized IVIG protocol in combination with oral aspirin or its analogs. Third, this study was retrospectively performed based on the registration database of clinical records. Fourth, several biomarkers for IVIG-resistance, such as eosinophil count.41,42 were not included in this study, suggesting that such missing factors might somehow affect the pattern of clustering. Finally, the long-term outcome of CAA was not fully evaluated.

In conclusion, our cluster analysis of the KD cases that developed CAA in the acute phase identified four subgroups with distinct clinical characteristics and treatment responses, suggesting different backgrounds for disease progression in a single disease entity. In particular, common characteristics between our Cluster 4 and ‘liver subgroup’ identified by Wang H et al.8 indicate the importance and prevalence of this clinical entity. Clinically, our observations also suggest that intensification of initial treatment with additional anti-inflammatory agents might be more crucial in Clusters 2 and 4 rather than in Clusters 1 and 3. Although further validation is needed, consideration of possible heterogeneity in the progression of CAA may be helpful for a better understanding of pathophysiology and, subsequently, better treatment strategies to protect KD cases from CAA development.