Table 1 Sociodemographic characteristics of the commercial dataset compared with the SLaM dataset.

From: Transdiagnostic individualized clinically-based risk calculator for the automatic detection of individuals at-risk and the prediction of psychosis: external replication in 2,430,333 US patients

 

Commercial (external validation database)

(n = 2,430,333)

Mean (SD)

SLaM (original development database)

(n = 34,209)

Mean (SD)

Age, years

34.2 (16.88)

34.43 (18.89)

Ethnicity(a)

 

No. (%)

 Black

0.12 (0.10)

7,055 (22.19)

 White

0.79 (0.11)

18,768 (59.03)

 Asian

0.04 (0.04)

1,149 (3.61)

 Mixed

0.03 (0.01)

1,319 (4.15)

 Other

0.02 (0.03)

3,502 (11.02)

Sex

No. (%)

No. (%)

 Male

995,262 (40.95)

17,511 (51.20)

 Female

1,435,071 (59.05)

16,688 (48.80)

Index diagnosis

No. (%)

No. (%)

 CHR-P

-

314 (0.92)

 Acute and transient psychotic disorders

1,316 (0.05)

747 (2.18)

 Substance use disorders

153,401 (6.31)

7,187 (21.01)

 Bipolar mood disorders

64,623 (2.66)

980 (2.86)

 Nonbipolar mood disorders

543,854 (22.38)

6,364 (18.60)

 Anxiety disorders

1,092,893 (44.97)

8,279 (24.20)

 Personality disorders

11,572 (0.48)

1,297 (3.79)

 Developmental disorders

74,072 (3.05)

1,413 (4.13)

 Childhood/adolescence onset disorders

418,316 (17.21)

4,201 (12.28)

 Physiological syndromes

68,476 (2.82)

2,560 (7.48)

 Mental retardation

1,810 (0.07)

867 (2.53)

  1. (a) Ethnicity data in Commercial were imputed so they are not directly comparable with SLaM. The means and SDs presented here represent the average proportion of ethnicities in patients’ Metropolitan Statistical Area (MSA).