Table 2 Taxonomic classification of Augustus predicted proteins in superkingdoms by the Last Common Ancestor algorithm of DIAMOND among each dataset.

From: Improvement of eukaryotic protein predictions from soil metagenomes

Clade

Plant-ass.

Terrestrial1

Terrestrial2

Total

%

Prokaryote

12,271,986

11,564,201

20,560,428

44,396,615

47.6

Eukaryote

4,986,024

1,951,235

1,064,070

8,001,326

8.6

Viruses

23,743

25,409

70,942

120,094

0.1

Undetermined

4,511,252

29,664,147

6,655,739

40,831,138

43.7

Total

21,793,005

43,204,992

28,351,179

93,349,176

100