Table 1 Numbers of tokens per category corresponding to different frequency ranges obtained from Fig. 2.

From: Combining deep learning with token selection for patient phenotyping from electronic health records

Frequencies

1

2–5

5–10

10–25

25–50

50–100

100–200

200–300

300–20,000

>20,000

Numbers of tokens

20,293

12,807

4,103

4,065

2,362

1,947

1,280

554

1418

20