Table 1 Numbers of tokens per category corresponding to different frequency ranges obtained from Fig. 2.
From: Combining deep learning with token selection for patient phenotyping from electronic health records
Frequencies | 1 | 2–5 | 5–10 | 10–25 | 25–50 | 50–100 | 100–200 | 200–300 | 300–20,000 | >20,000 |
Numbers of tokens | 20,293 | 12,807 | 4,103 | 4,065 | 2,362 | 1,947 | 1,280 | 554 | 1418 | 20 |