Fig. 2: Summary of the steps followed for the comorbidity analyses conducted using transcriptomic data.

a Generate differential gene expression profiles for each patient (comparing their expression with the expression detected in the control samples), selecting the top 500 up- and downregulated genes for the Fisher’s exact test analysis. Then, use the similarity of these profiles to generate the patient similarity network. Drugs are then associated to patients based on the similarities between patients’ differential expression profiles and the alterations of gene expression generated by a large collection of drugs on cell lines as recorded in the LINCS database (see “Methods”). b The number of observed patient−patient interactions connecting different diseases is used to calculate relative molecular similarities as shown, leading to the construction of the Disease Molecular Similarity Network (DMSN) and of the Stratified Comorbidity Networks (SCN). The obtained DMSN is compared to published epidemiological networks and all results are available in the Disease PERCEPTION portal.