Fig. 3: Prediction results of SM compound combinations based on multiple linear regression.

a Heatmap displaying adjusted R-squared values for predicting compound combinations using MLR. Because there were only two compound combinations in the top ranks of the MLR results, the heatmap selected and visualized only two compound combinations among the results in which the compound combinations were significant. Color-mapped combinations only used data with a p-value < 0.1 and coefficient > 0 in the multiset. The data used for color mapping were limited to those with a significant coefficient of determination (R2) ≥ 0.6. The red boxes represent single compounds or combinations of two compounds with R2 values greater than 0.8. b The top three optimal compound combinations, with adjusted R-squared scores > 0.8, best reproduced the expression patterns of the 21 SM-derived RAS genes. SM Samul-tang, MLR multi-linear regression model, RAS Rat sarcoma virus.