Table 1 Summary of pangenome analysis results.

From: Mash-based analyses of Escherichia coli genomes reveal 14 distinct phylogroups

Phylogroup

Core genome (97% strains)

Accessory genome

Unique

Total (Pangenome)

Core/pan (%)

No. of strains

Clusters

Proteins

Clusters

Proteins

Clusters

Proteins

Clusters

Proteins

Clusters

All

2663

28,566,052

82,821

22,783,754

50,499

51,099

135,983

51,400,905

1.96

10,667

A

3184

7,142,893

41,769

3,246,591

24,501

24,828

69,454

10,414,312

4.58

2232

B1

3141

9,365,646

44,019

4,887,086

24,590

24,844

71,750

14,277,576

4.38

2960

B2-1

3708

2,016,812

10,990

619,867

7048

7180

21,746

2,643,859

17.05

541

B2-2

3425

4,709,983

22,762

1,819,538

12,566

12,763

38,753

6,542,284

8.84

1367

C

3899

2,132,258

10,413

738,879

5242

5290

19,554

2,876,427

19.94

540

D1

3666

1,006,271

10,012

318,372

7659

7770

21,337

1,332,413

17.18

273

D2

3524

626,693

11,703

221,033

6765

7181

21,992

854,907

16.02

177

D3

3754

668,359

7252

201,292

4814

4936

15,820

874,587

23.73

177

E1

3151

885,018

14,883

471,354

7969

8088

26,003

1,364,460

12.12

279

E2(O157)

4060

3,080,073

6128

743,413

4442

4535

14,630

3,828,021

27.75

750

F

3486

698,031

9465

288,420

5381

5480

18,332

991,931

19.02

199

G

3783

365,756

5716

98,269

4016

4066

13,515

468,091

27.99

96

Shig1

3128

564,868

4903

256,426

2815

2883

10,846

824,177

28.84

177

Shig2

3732

3,383,814

6870

719,247

4751

4799

15,353

4,107,860

24.31

899

  1. Values obtained from the different pangenome analysis using the 14 phylogroups separately and the entire set of assembled genomes (10,667 genomes) using UCLUST47. The same parameters were used throughout all of the analysis.