Fig. 2: GDSC model interpretability. | Nature Communications

Fig. 2: GDSC model interpretability.

From: Learning and actioning general principles of cancer cell drug sensitivity

Fig. 2

A Target recovery for the top 25 ligand-target pairs. The length of the bars on the left indicates the fraction of recovery (# of times the drug-specific model identifies the putative gene as important out of 20 train/test splits). Red lines represent the 95th percentile of the Hit Fraction distribution across all genes for a given drug. The length of the bars on the right shows the median pearson correlation for each drug-specific model; B SHAP (teal) and correlation delta (orange) importances for the Venetoclax drug. Permutation importance reflects the decrease in the model’s prediction accuracy when a feature’s values are shuffled, indicating its importance (greater drops signify higher importance). SHAP importance represents a feature’s contribution to the model’s prediction, with larger absolute values indicating greater importance; C An integrated assessment of the Venetoclax model across various cell lines (X axis). The top plot (in black) shows the experimental IC50 z-scores, while the second plot (in gray) depicts the predicted IC50 values, providing a comparison of model performance against experimental data. The third and fourth plots (in teal and red) respectively represent the SHAP values and expression levels of BCL2. Overall, the figure shows how lower IC50 values (higher drug efficacy) are associated with higher BCL2 expression levels and correctly identified impact (negative SHAP value) of the gene on predicted IC50. Source data are provided as a Source Data file.

Back to article page