Fig. 4

The selection of characteristic genes of ALI via machine learning algorithm. (A, B) LASSO analysis of the combined dataset. (C) Biomarkers were screened based on SVM-RFE. (D) Based on RF algorithm to screen biomarkers. (E) A Venn diagram illustrated the overlap between diagnostic markers identified through machine learning algorithm. (F) Boxplot showed the expression of hub genes between ALI and control group in combined dataset. (G) Boxplot showed the expression of hub genes between ALI and control group in GSE216943. (H) The ROC curve of the diagnostic efficacy verification. (I) The ROC curve of the diagnostic efficacy verification in GSE216943. P-values were calculated as mean ± SD, P < 0.05 were considered statistically significant differences. *P < 0.05; **P < 0.01; ***P < 0.005. Two-tailed unpaired Student’s t-test for two groups or one-way ANOVA for three groups or more. ALI, Acute Lung Injury. LASSO, Least Absolute Shrinkage and Selection Operator. RF, Random Forest. SVM-RFE, Support Vector Machine-Recursive Feature Elimination. ROC, Receiver Operating Characteristic Curve.