Table 2 Parent profile identification from synthetic patterns.

From: SaccharomycesIDentifier, SID: strain-level analysis of Saccharomyces cerevisiae populations by using microsatellite meta-patterns

Sample Name

Expected

Identified

True positive rate

False positives

GLMerror*

A

01_MF+02_MF

01_MF, 02_MF, 03_MF

100%

1

0.61

B

02_MF+03_MF

02_MF, 03_MF

100%

0

1.22

C

01_MF+02_MF+13_MF

01_MF, 02_MF, 03_MF, 13_MF

100%

1

0.30

D

01_MF+03_MF+13_MF

01_MF, 03_MF, 13_MF

100%

0

0.30

E

01_MF+02_MF+03_MF+13_MF

01_MF, 02_MF, 03_MF, 13_MF

100%

0

0.30

F

01_MF+02_MF+03_MF+08_MF+13_MF

01_MF, 02_MF, 03_MF, 13_MF

80%

0

0.30

  1. In the “Identified” column, the strains ID in bold are the parental strains correctly identified by the GLM. True positive rate is the percentage of parental strains identified in the sample by the model. The column “False positives” indicates the number of strains identified by lasso analysis but not present in the query sample. *GLMerror was estimated as the percentage of alleles differing between the query sample and the combination of the identified strains’ patterns, on the total of alleles (equation 3).