Extended Data Fig. 8: AlphaGenome performance on the CAGI5 MPRA challenge. | Nature

Extended Data Fig. 8: AlphaGenome performance on the CAGI5 MPRA challenge.

From: Advancing regulatory variant effect prediction with AlphaGenome

Extended Data Fig. 8

Comparison of models predicting the effects of regulatory variants on gene expression using CAGI5 MPRA data. (a) Pairwise comparisons of zero-shot, cell-type matched performance using raw DNase model outputs. Scatter plots show per-gene or per-context Pearson correlations. AlphaGenome shows comparable or slightly improved performance compared to Enformer, ChromBPNet, and Borzoi Ensemble based on overall correlations across compared loci (details in plot annotations; cell-type matching strategy adapted from Enformer/Borzoi papers, see Methods). Note that ChromBPNet only reports performance for TERT-HEK293T, thus TERT-GBM is excluded in the AlphaGenome vs ChromBPNet comparison. (b) Pairwise comparisons of cell-type matched (see a) and cell type agnostic performance using LASSO regression on spatially summed DNase features (Methods). Comparison of the original Borzoi scorer (upper row) and AlphaGenome recommended scoring strategy (bottom row). AlphaGenome outperforms Borzoi in all settings and shows that cell type matching is not required when applying LASSO regression. (c) Pairwise comparisons of cell-type agnostic performance using LASSO regression with different input features: left column shows the composite scorer described in Borzoi (DNase + Histone ChIP-seq + RNA), right column shows the composite scorer described in Enformer (DNase + CAGE). AlphaGenome generally outperforms Enformer and Borzoi for all input feature settings.

Back to article page