Fig. 1: BulkECexplorer online app display.

The image shows a snapshot of the output of the BulkECexplorer when queried for a gene of interest (for example, SRC). The blue section displays the gene detection rate and expression range. Top left box, stacked bar chart depicting the number of datasets with SRC >0 TPM, resolved by EC subtype. The percentage of datasets with SRC >0 TPM in each EC subtype is reported below each bar. The percentage of datasets with SRC >0 TPM across all datasets, independently of subtype, is reported above the bar graph. Top right box, boxplots of SRC TPM values for individual datasets, resolved by EC subtype, including the median (center line). The bottom boxes show the corresponding data with a default ‘>1 TPM’ expression threshold that can be customized. The red dashed line (bottom right box) indicates the 1 TPM gene expression threshold. The green section summarizes data obtained by predicting leaky versus active genes using GMMs. Left box, stacked bar chart depicting the number of datasets in which SRC expression was classified as active, leaky or undetermined, resolved by EC subtype. The percentage of datasets in which SRC expression was classified as active in each EC subtype is reported below each bar. The percentage of datasets in which SRC expression was classified as active versus leaky across all datasets is reported above the bar chart. Right box, GMM for a representative HUVEC dataset; expression values for three core EC genes are indicated. The cyan section summarizes data obtained by predicting leaky versus active genes using zTPM expression standardization for each dataset. Left box, stacked bar chart depicting the number of datasets in which SRC expression was above the −2.38 zTPM threshold, resolved by EC subtype. The percentage of datasets in which SRC was detected above the threshold in each EC subtype is reported below each bar. The percentage of datasets in which SRC was detected above the threshold across all datasets is reported above the bar chart. Right box, boxplots of SRC zTPM values for individual datasets, resolved by EC subtype, including the median (center line). The red dashed line indicates the −2.38 zTPM gene expression threshold. The orange section provides a summary by cell type for the number of datasets analyzed per EC subtype alongside outputs for the analysis of TPM values and the GMM versus zTPM predictions.