Table 1 Manual annotation results for the top ten most important classification features.

From: Quantitative prediction of disinfectant tolerance in Listeria monocytogenes using whole genome sequencing and machine learning

Pan-genome gene cluster

Protein description

Identity (%)

Coverage (%)

E-value

group_6155

TetR family transcriptional regulator

100

96

1.2E–95

group_11531

LPXTG cell wall anchor domain-containing protein

100

13

3.2E–07

group_8857

DUF6262 family protein (TnpC)

100

99

1.0E–75

qacC

Quaternary ammonium compound efflux SMR transporter QacH

100

99

6.0E–81

group_6910

HK97 gp10 family phage protein

100

100

1.4E–83

group_11002

Quaternary ammonium compound efflux SMR transporter QacH

92.6

97

6.5E–70

group_11916

DUF1642 domain-containing protein

60.6

57

2.1E–54

group_11003

TetR family transcriptional regulator

76.7

40

1.8E–69

ebrB

Quaternary ammonium compound efflux SMR transporter BcrC

100

100

3.6E–76

group_8424

Hypothetical protein (RepC)

100

99

3.9E–66