Table 1 Distribution of the relative frequencies of the 12 substitution classes in the development dataset for the 4,908 tumors with a somatic substitution prevalence of ≥1 sub/Mb.

From: Somatic substitution signature as an innovative tool in lung cancer diagnosis

 

All

Other location

Lung cancer

p-value

(n = 4908)

(n = 4206)

(n = 702)

%CA >12

13.47% (661)

7.01% (295)

52.14% (366)

<0.0001

%GT >13

15.14% (743)

9.49% (399)

49% (344)

<0.0001

%CG ≥1

82.46% (4047)

84.38% (3549)

70.94% (498)

<0.0001

%GC ≥1

81.32% (3991)

83.05% (3493)

70.94% (498)

<0.0001

%CT, median (IQR)

22 (15; 30)

24 (18; 31)

11 (0; 16)

<0.0001

%GA, median (IQR)

22 (15; 30)

25 (18; 32)

11 (0; 16)

<0.0001

%AC, median (IQR)

1 (0; 2)

1 (0; 3)

1 (0; 1)

<0.0001

%TG, median (IQR)

1 (0; 3)

1 (0; 3)

0 (0; 1)

<0.0001

%AG >5

43.44% (2132)

47.81% (2011)

17.24% (121)

<0.0001

%TC >5

42.97% (2109)

47.0% (1977)

18.8% (132)

<0.0001

%AT >3

26.67% (1309)

24.82% (1044)

37.75% (265)

<0.0001

%TA >3

26.55% (1303)

24.37% (1025)

39.6% (278)

<0.0001

  1. Results presented as percentage and frequency %(n) unless otherwise indicated. IQR: interquartile range; %CA = relative frequency of C > A substitutions (%); %GT = relative frequency of G > T substitutions (%); %CG = relative frequency of C > G substitutions (%); %GC = relative frequency of G > C substitutions (%); %CT = relative frequency of C > T substitutions (%); %GA = relative frequency of G > A substitutions (%); %AC = relative frequency of A > C substitutions (%); %TG = relative frequency of T > G substitutions (%); %AG = relative frequency of A > G substitutions (%); %TC = relative frequency of T > C substitutions (%); %AT = relative frequency of A > T substitutions (%); %TA = relative frequency of T > A substitutions (%).