Table 2 Top genes for Sequence Cluster prediction.

From: Lineage structure of Streptococcus pneumoniae may be driven by immune selection on the groEL heat-shock protein

SPN23F

 

Name

Type/Function

00090

  

Phospholycenate mutase

00540

 

recO

DNA recombination and repair protein

00660

 

vanZ

Teicoplanin resistance protein

02370

  

Transcriptional regulator

03790

 

spi

Signal peptidase I

04050

  

Hypothetical protein

04730

  

Histidine triad nucleotide-binding protein

06210

  

ABC transporter, ATP-binding protein

06880

 

sodA

Manganese superoxide dismutase

07240

  

Hypothetical protein

07340

  

Hydrolase/Haloacid dehalogenase-like family

07930

 

iscU

Putative iron-sulfur cluster assembly scaffold protein

08320

  

Putative membrane protein

09040

  

O-methyltransferase family protein C1

09280

 

lmb

Laminin-binding protein

09460

  

N-acetyltransferase GNAT family protein

10040

  

Cytosolic protein containing multiple CBS domains

10480

  

Hypothetical protein

10670

 

pdhB

Acetoin dehydrogenase E1 component β-subunit

11320

  

Acetyltransferase GNAT family protein

11630

 

licA

Choline kinase

11660

 

carB

Membrane protein/O-antigen and teichoic acid

13490

  

Hypothetical protein

14640

 

lta

Bacterocin transport accessory protein

15100

 

pclA

Putative NADPH-dependent FMN reductase

16930

  

Hypothetical protein

17080

  

Hypothetical protein

18130

  

Hypothetical protein

19240 *

a

recX

Regulatory protein

19250 *

a

 

Cysteinyl-tRNA synthase related protein

19300 *

b

groEL

Heat shock protein 60 family chaperone

19310 *

b

groES

Heat shock protein 60 family co-chaperone

19330 *

b

 

Short-chain dehydrogenase

19340 *

b

ytpR

Phenylalanyl-tRNA synthetase domain protein

19360 *

b

 

Hypothetical protein

19370 *

b

 

Hypothetical protein

19380 *

b

 

Membrane protein

19390 *

b

 

Response regulator of LytR/AlgR family

20880

c

 

Hydrolase, haloacid dehalogenase-like family

20900

c

thrC

Threonine synthase

22500

 

mreD

Rod shape-determining protein

  1. Genes marked with * flank up to 10 genes, upstream or downstream from the groESL operon. Letters a to c denote groups of contiguous genes (minimum proximity of 2 genes).