Fig. 1: Identification of TFA-BT NCVs.
From: Widespread perturbation of ETS factor binding sites in cancer

a Overview of the TFA-BT approach. The number of observed NCVs across tumor samples that disrupt (or create) a binding site of TF A in promoter B is compared to the expected probability distribution to identify significant promoter-TF associations. b Number of TFA-BT NCVs with predicted gain and/or loss of TF binding per cancer type. c Scatter plot showing the number of different TFA-BT NCVs per gene in the PCAWG cohort versus the number of TFA-BT NCV events in the corresponding promoter in patients from PCAWG. Insert shows fraction of patients in PCAWG for each mutation in the TERT promoter. d Percentage of prognostic (i.e., genes whose expression levels are favorably or unfavorably associated with cancer), fitness-related, and essential genes within all protein-coding (n = 19,208), IntOGen (n = 561), Cancer Gene Census (CGC, n = 729), and TFA-BT genes (n = 746). Statistical significance determined by two-sided Fisher’s exact test compared to all protein-coding genes. Error bars indicate standard error of the proportion. e Biological process gene ontology fold enrichment associated with different terms for IntOGen and TFA-BT gene sets. Each dot represents a gene ontology term classified into general classes. Insert shows overlap between TFA-BT and IntoGen genes. Source data are provided as a Source Data file.