Table 1 The information of the organisms used in the current study

From: Protein Localization Analysis of Essential Genes in Prokaryotes

Organism

Group a

RefSeq

Dataset of essential gene b

Dataset of non-essential gene b

Source c

Acinetobacter baylyi ADP1

Bacteria (−)

NC_005966

499, 90.38%, 2.81%

2594, 76.21%, 2.08%

I

Bacillus subtilis 168

Bacteria (+)

NC_000964

271, 94.83%, 1.48%

3955, 81.85%, 1.29%

II

Bacteroides thetaiotaomicron VPI-5482

Bacteria (−)

NC_004663

325, 84.62%, 5.23%

4453, 65.10%, 5.19%

II

Burkholderia thailandensis E264

Bacteria (−)

NC_007650

42, 84.73%, 4.93%

2314, 70.65%, 3.23%

II

 

Bacteria (−)

NC_007651

364, 84.73%, 4.93%

2912, 70.65%, 3.23%

II

Campylobacter jejuni NCTC 11168

Bacteria (−)

NC_002163

228, 76.75%, 3.07%

1395, 80.43%, 2.94%

II

Caulobacter crescentus NA1000

Bacteria (−)

NC_011916

480, 83.96%, 4.79%

3224, 63.31%, 3.97%

I

Escherichia coli MG1655

Bacteria (−)

NC_000913

609, 80.13%, 2.79%

2923, 82.18%, 2.39%

I

Francisella novicida U112

Bacteria (−)

NC_008601

392, 87.76%, 3.83%

1329, 76.98%, 3.01%

II

Haemophilus influenzae Rd KW20

Bacteria (−)

NC_000907

642, 87.07%, 1.25%

512, 87.11%, 1.95%

I

Helicobacter pylori 26695

Bacteria (−)

NC_000915

323, 76.47%, 2.48%

1135, 76.83%, 4.58%

I

Methanococcus maripaludis S2

Archaeon

NC_005791

519, 92.49%, 0.39%

1077, 86.72%, 0.09%

I

Mycobacterium tuberculosis H37Rv

Bacteria (−)

NC_000962

687, 85.01%, 3.78%

3070, 61.34%, 2.31%

I

Mycoplasma genitalium G37

Mycoplasmas

NC_000908

381, 78.48%, 0.52%

94, 67.02%, 0.00%

I

Mycoplasma pulmonis UAB CTIP

Mycoplasmas

NC_002771

310, 80.00%, 0.65%

322, 60.25%, 0.62%

I

Porphyromonas gingivalis ATCC 33277

Bacteria (−)

NC_010729

463, 87.26%, 1.94%

1627, 62.81%, 3.01%

II

Pseudomonas aeruginosa PAO1

Bacteria (−)

NC_002516

117, 85.47%, 1.71%

5454, 76.88%, 3.04%

II

Pseudomonas aeruginosa UCBPP-PA14

Bacteria (−)

NC_008463

335, 79.10%, 1.79%

960, 65.83%, 1.98%

I

Salmonella enterica serovar Typhi Ty2

Bacteria (−)

NC_004631

358, 90.78%, 2.79%

3906, 73.35%, 2.18%

I

Salmonella enterica serovar Typhimurium SL1344

Bacteria (−)

NC_016810

353, 88.67%, 2.55%

4035, 74.65%, 2.55%

I

Salmonella enterica serovar Typhimurium 14028S

Bacteria (−)

NC_016856

105, 90.48%, 1.90%

5210, 63.88%, 2.13%

II

Salmonella typhimurium LT2

Bacteria (−)

NC_003197

230, 87.83%, 3.91%

4228, 74.83%, 2.46%

II

Shewanella oneidensis MR-1

Bacteria (−)

NC_004347

402, 88.81%, 3.48%

1103, 86.49%, 3.45%

I

Sphingomonas wittichii RW1

Bacteria (−)

NC_009511

535, 76.26%, 2.80%

4315, 72.28%, 4.47%

II

Staphylococcus aureus N315

Bacteria (+)

NC_002745

302, 95.03%, 0.66%

2281, 80.45%, 1.18%

II

Staphylococcus aureus NCTC 8325

Bacteria (+)

NC_007795

351, 90.03%, 1.99%

2541, 76.62%, 0.87%

II

Streptococcus sanguinis SK36

Bacteria (+)

NC_009009

218, 94.50%, 0.46%

2052, 82.36%, 1.17%

I

Vibrio cholerae N16961

Bacteria (−)

NC_002505

565, 56.87%, 1.41%

2105, 78.05%, 2.89%

II

 

Bacteria (−)

NC_002506

214, 56.87%, 1.41%

838, 78.05%, 2.89%

II

  1. aBacteria (+),Gram-positive bacteria; Bacteria (−), Gram-negative bacteria.
  2. bThe dataset description usually contain three numbers: X, Y%, Z%. X is the number of essential (or non-essential) genes of the organism. Y% is the prediction coverage of essential (or non-essential) genes. Z% is the percentage of proteins which may have multiple localization sites among essential (or non-essential) genes.
  3. cSource of the non-essential genes. I, the non-essential genes are obtained based on the original literatures. II, the non-essential genes are the complementary set of essential genes.