Table 2 Biomedical databases commonly used in DDI prediction tasks and their detailed information

From: Machine learning models for drug-drug interaction prediction from computational discovery to clinical application

Database

Entity

Introduction

Sample Size

Positive Rate

Update Frequency

Known Bias

URL

DrugBank46,47,48

Drug, Target, Interaction

DrugBank is a database containing drugs, drug targets and drug interactions, providing detailed pharmacological and clinical data.

4563 FDA approved drugs, 6231 investigational drugs, 1,413,413 drug–drug interactions, 2475 drug-food interactions

TP_rate:0.96 FP_rate:0.32

Monthly

http://www.drugbank.ca

PubChem49,50,51,52

Structure, Chemical, Bioactivity

PubChem is a data platform containing chemical substances and their biological activities, covering the detailed information of various chemical molecules.

121,458,159 Compounds, 334,561,470 Substances, 1,768,452 BioAssays, 250,942 Pathways

Weekly

https://pubchem.ncbi.nlm.nih.gov

KEGG53,54,55

Gene, Pathway, Drug

KEGG provides data on biology, genomics and metabolic pathways, especially in drug discovery and disease research.

over 10 thousand complete genomes of cellular organisms, over 50 million genes

TP_rate:0.93 FP_rate:0.36

1–2 days

http://www.kegg.jp

ChEMBL236,237,238

Compound, Target, Bioactivity

ChEMBL is a data platform containing pharmaceutical compounds and their biological activities, focusing on the information in the process of drug discovery.

over 20.3 million bioactivity measurements and 2.4 million unique compounds

TP_rate:0.99

Regularly

differences between various input data sources for an individual drug or clinical candidate drug

https://www.ebi.ac.uk/chembl

PharmGKB239,240,241

Drug, Gene, Disease

PharmGKB provides related data about drugs and genes, which helps to study pharmacogenomics and individualized treatment.

1994 genes involved in drug response, 240 genes high-quality genotype variation data, 1671 literature entries

Regularly

https://www.pharmgkb.org

SIDER242

Drug, Side Effect

SIDER provides data on the relationship between drugs and their side effects, covering a large number of drugs and their side effects related to patients.

1430 drugs, 5880 ADRs, 140,064 Drug-ADR pairs

FP_rate:0.012

Irregularly

http://sideeffects.embl.de

TWOSIDES56

Drug, Side Effect, Interaction

TWOSIDES provides information about the correlation between drugs and side effects, and contains data of drug interaction, which is helpful to evaluate the safety of drugs.

868,221 significant associations between 59,220 pairs of drugs and 1301 adverse events,

TP_rate:0.99 FP_rate:0.7

Irregularly

https://tatonettilab.org/projects

DDInter57,58

Drug-Drug Interaction

DDInter recorded drug interactions, mechanisms, clinical risks and management suggestions to help avoid adverse drug reactions and interactions.

2310 drugs, 302,516 drug-drug interaction records, 857 DFIs, 8359 DDSIs, 6033 therapeutic duplication records

Irregularly

http://ddinter.scbdd.com

GDSC243

Drug Sensitivity, Cancer Genomic Data

The GDSC database is a resource that provides drug sensitivity data and genomic information from cancer cell lines, aimed at discovering new therapeutic biomarkers for cancer treatment.

73,169 cell line-drug interactions, 329–668 cell drug (mean = 525)

4 months

https://www.cancerrxgene.org

DrugComb244,245

Drug Combinations, Sensitivity, Synergy

DrugComb is a web-based portal website used for depositing and analyzing drug combination screening datasets, providing improved algorithms for drug synergy and sensitivity analysis.

751,498 drug combinations, 717,684 single drug screenings

Irregularly

93.2% of the data points lack replicates.

https://drugcomb.org

DrugCombDB246,247

Drug, Drug Combinations, cancer cell lines

DrugCombDB is a comprehensive database of cancer treatment drug combinations, which is crucial for both experimental and computational screening of synergistic drug combinations.

448,555 drug combinations, 2887 unique drugs, 124 human cancer cell lines

Irregularly

Synergy scores fluctuate with dosage concentrations

http://drugcombdb.denglab.org

RepurposeDrugs248

Drug

RepurposeDrugs is a web tool and database that offers drug-indication associations and predicted approval likelihoods for single drugs and combinations, with interactive visualizations.

4314 compounds, 161 drug combinations, 28,148 drug-disease pairs

Irregularly

Initial database organization may have overlooked some approved drugs and disease indications

https://repurposedrugs.org