Table 1 Coverage included in the filtered County Tweet Lexical Bank dataset from 2019 to 2020

From: Robust language-based mental health assessments in time and space through social media

CTLB Data Descriptives

 

Count

Word Instances

15,361,519,145

Posts

992,194,052

Unique Words

57,448,057

Users

2,198,980

Counties

1490

 

Mean (S.D.)

Posts per User

451.2 (749.9)

Posts per User/Week

10.2 (25.7)

Users per County

1249.4 (4,609.7)

  1. Filtering consisted of excluding non-English posts, reposts, posts containing a hyperlink, and duplicated posts from users. Standard deviations are included in parentheses next to mean measurements.