Table 2 Statistics on edges in PrimeKG.

From: Building a knowledge graph to enable precision medicine

Relation type

Count

Percent (%)

Anatomy - Protein (present)

3,036,406

37.5

Drug - Drug

2,672,628

33.0

Protein - Protein

642,150

7.9

Disease - Phenotype (positive)

300,634

3.7

Biological process - Protein

289,610

3.6

Cellular component - Protein

166,804

2.1

Disease - Protein

160,822

2.0

Molecular function - Protein

139,060

1.7

Drug - Phenotype

129,568

1.6

Biological process - Biological process

105,772

1.3

Pathway - Protein

85,292

1.1

Disease - Disease

64,388

0.8

Drug - Disease (contraindication)

61,350

0.8

Drug - Protein

51,306

0.6

Anatomy - Protein (absent)

39,774

0.5

Phenotype - Phenotype

37,472

0.5

Anatomy - Anatomy

28,064

0.3

Molecular function - Molecular function

27,148

0.3

Drug - Disease (indication)

18,776

0.2

Cellular component - Cellular component

9,690

0.1

Phenotype - Protein

6,660

0.1

Drug - Disease (off-label use)

5,136

0.1

Pathway - Pathway

5,070

0.1

Exposure - Disease

4,608

0.1

Exposure - Exposure

4,140

0.1

Exposure - Biological process

3,250

<0.1

Exposure - Protein

2,424

<0.1

Disease - Phenotype (negative)

2,386

<0.1

Exposure - Molecular function

90

<0.1

Exposure - Cellular component

20

<0.1

Total

8,100,498

100.0

  1. Listed are the numbers of directed edges in PrimeKG.