Extended Data Fig. 3: Comparison of sequence-to-accessibility and sequence-to-activity models plus controls. | Nature

Extended Data Fig. 3: Comparison of sequence-to-accessibility and sequence-to-activity models plus controls.

From: Targeted design of synthetic enhancers for selected tissues in the Drosophila embryo

Extended Data Fig. 3

a-e) Left: Comparison of predicted DNA accessibility [log2] and predicted enhancer activity [probability] in each tissue for all tested sequences in vivo (inactive in blue, active in red). Density plots show the respective distributions for both predictions for inactive and inactive sequences. Right: precision-recall curves for the sequence-to-accessibility and sequence-to-activity models on test data, plus two additional controls: models trained directly on the in vivo enhancer activity data starting from random initialization and models pre-trained on ATAC-seq data from an unrelated tissue (salivary gland). Respective areas under the precision-recall curve (AUC) are shown. Predictions for all models were computed for each sequence only using the respective cross-validation set where the sequence is held-out for testing.

Back to article page