The All of Us Research Program’s wearables dataset

Patten, Theresa; Preble, Edward A.; Master, Hiral; Adjemian, Jennifer; Ramirez, Andrea; McClain, James; Price, Amy Rose

doi:10.1038/s41591-026-04352-3

Download PDF

Resource
Open access
Published: 27 April 2026

The All of Us Research Program’s wearables dataset

Nature Medicine (2026) Cite this article

11k Accesses
20 Altmetric
Metrics details

Subjects

Abstract

Digital health technologies (DHTs) are revolutionizing medical research, offering unprecedented insights into health monitoring and disease detection through continuous, real-world data collection. Here we characterize the data in one of the largest and most demographically rich DHT datasets as part of the All of Us Research Program. Through a historic device distribution effort, the program reached a broad range of participants nationwide, yielding a DHT dataset with an expanded a large demographic scope. This dataset contains Fitbit data from more than 59,000 participants spanning 14 years with more than 39 million step observations and 31 million sleep observations. Nearly half (46%) of participants with Fitbit data also contributed electronic health records, physical measurements, genomics and survey data. This resource enables researchers to study relationships between digital health metrics and clinical outcomes, advancing DHT methodologies through its large size, broad representation and multi-modal data linkage.

Association of step counts over time with the risk of chronic disease in the All of Us Research Program

Article Open access 10 October 2022

Wearable fitness tracker use in federally qualified health center patients: strategies to improve the health of all of us using digital health devices

Article Open access 25 April 2022

Association of chronic disease risk and physical activity measured by wearable devices in the All of Us program

Article Open access 14 January 2026

Main

DHTs are fundamentally transforming biomedical research by enabling continuous, real-world health monitoring at unprecedented scale and granularity. Commercial wearable devices, now owned by 20–45% of people in the USA^1,2,3, generate rich streams of physiological and behavioral data that can inform clinical decision-making and improve patient outcomes^4,5. However, the potential impact of these technologies remains constrained by substantial demographic biases, with wearables ownership and much of DHT research disproportionately representing white individuals with higher educational and income levels^3,6,7. This gap in representation limits our understanding of how digital biomarkers manifest across demographic groups and creates barriers to developing health interventions based on DHT data that benefit all.

The National Institutes of Health’s All of Us Research Program is a historic initiative to collect health data from 1 million or more people living in the USA and to make these data available for research purposes to registered users. The program’s mission is to address historical gaps in research experience and advance precision health for all, particularly those from populations with unique life experiences and health needs (for example, older adults, rural populations, individuals with less access to healthcare)⁸.

Since November 2020, the program has made de-identified Fitbit data available to researchers through its Bring Your Own Device (BYOD) program, enabling novel investigations that integrate DHT measures with other rich data types such as electronic health records (EHR), physical measurements and surveys. However, like many DHT datasets, BYOD data lacked sufficient representation from broad demographic populations needed to advance health research and precision medicine⁹. To address this limitation, the All of Us Research Program created the Wearables Enhancing All of Us Research (WEAR) study—a device distribution effort that provided Fitbit devices to invited participants from across the USA at no cost¹⁰. This strategic approach substantially expanded the breadth of participants contributing wearables data to the All of Us Research Program, including from many populations that have been historically underrepresented in DHT research.

The most recent data release provides registered researchers with Fitbit data from more than 59,000 participants, spanning 14 years and including 39 million step observations and 31 million sleep observations. This Resource paper presents a characterization of the expanded WEAR study dataset, documenting its size and the breadth of participant representation. The combined WEAR and BYOD dataset represents one of the largest wearables datasets available for biomedical research, enabling investigators to examine patterns of physical activity, sleep and their relationship to health outcomes across numerous population groups. By linking wearables data with other rich data types such as genomics, EHRs and survey responses, this resource creates opportunities to advance our understanding of digital biomarkers and their clinical applications. We also provide methodological considerations and frameworks for responsible use to help researchers maximize the scientific and societal impact of this valuable resource.

Results

Cohort size and demographics

Enrollment in the All of Us Research Program and the WEAR study began in May 2017 and February 2021, respectively, as indicated by gray and blue vertical lines in Fig. 1a. The number of participants contributing data through BYOD has grown steadily and includes data from before enrollment began for the All of Us Research Program. This is possible because participants can donate all historical Fitbit data when they consent, including activity data recorded before they enrolled in the program. The WEAR study began as a limited pilot in 2021, with enrollment increasing gradually through protocol refinements in 2022. A major expansion from April to June 2023 broadened the eligibility criteria and initiated large-scale participant recruitment (Fig. 1a). Both BYOD and WEAR programs show similar and widespread participation across the country, with participants from all 50 states (Fig. 1b,c). Geographic distributions show similar patterns between BYOD and WEAR, with the highest concentrations in states such as California, Wisconsin, Pennsylvania and Illinois where the All of Us Research Program has large healthcare provider organization partners.

**Fig. 1: Temporal and geographic distribution of Fitbit data availability.**

To measure the success of the WEAR study in collecting Fitbit data from participants with varying health needs and research experiences, we compared the demographic characteristics of those who donated activity data through the BYOD program (n = 32,035) to WEAR study participants (n = 22,474) in our general activity analysis cohort (Table 1). As expected, the demographic profile of the WEAR study cohort differed from the BYOD cohort across a range of characteristics, including self-reported race and ethnicity (for example, 77.3% versus 55.1% of participants reported being white), age (for example, 22.3% versus 15.7% of participants reported an age between 55 and 64 years), income (for example, 6.1% versus 15.2% of participants reported a household annual income between US$10,000 and US$25,000), education (for example, 32.2% versus 27.2% of participants reported having a college degree), healthcare access and utilization (32.8% versus 43.4% of participants reported inadequate access to healthcare) and disability status (for example, 1.8% versus 4.1% reported blindness or difficulty seeing) (Table 1).

Table 1 Demographic characteristics of general activity cohort BYOD and WEAR participants

Full size table

Cohort-level trends in daily steps and sleep duration

We quantified high-level wearables metrics in our general activity cohort (n = 54,509) and general sleep cohort (n = 34,378). Participants in these cohorts recorded a median of 6,454 daily steps (interquartile range (IQR) 4,432–8,958) and a median daily sleep duration of 6.8 h (IQR 6.2–7.2), respectively (Table 2; ‘Full cohort’). In addition, using the sleep duration categories defined previously¹¹, we found that 36.6% (n = 12,592) of participants in our general sleep cohort had median sleep durations in the normal range (7–9 h per night), while 61.1% (n = 21,020) had short sleep (5–7 h per night), 1.4% (n = 492) had very short sleep (<5 h per night) and 0.8% (n = 274) had long sleep (≥9 h per night)¹².

Table 2 Baseline wearables outcomes by cohort with select demographic group comparisons

Full size table

Next, we compared high-level wearables metrics between the BYOD and WEAR cohorts, and reported median daily steps and sleep durations across select demographic characteristics, including sex, age, race, ethnicity, and American Indian and Alaska Native status (Table 2). Both median daily steps and sleep duration were significantly higher in the BYOD cohort than in the WEAR cohort (Table 2). Specifically, the BYOD activity cohort recorded a median of 6,867 steps (n = 32,035) compared to 5,797 steps in the WEAR activity cohort (n = 22,474), and a median sleep duration of 6.8 h in the BYOD sleep cohort (n = 21,794) compared to 6.6 h in the WEAR sleep cohort (n = 12,584) (Mann–Whitney U tests; P < 2.2 × 10⁻¹⁶ for both comparisons). Stratified comparisons by additional demographic subgroups (for example, sex, age, self-reported race or ethnicity) were not conducted.

Seasonal trends in daily steps and sleep duration

Next, to evaluate whether the All of Us Research Program’s Fitbit dataset reflects established seasonal patterns in physical activity and sleep^13,14, we calculated the normalized median (IQR) daily steps and sleep durations of eligible participants (per month) and plotted values between January 2018 and September 2023 in Fig. 2 (absolute values are available in Supplementary Fig. 1).

**Fig. 2: Seasonal variation in physical activity and sleep.**

We observed expected seasonal trends in physical activity and sleep, with more steps generally taken in spring and summer than in winter (Fig. 2a; n = 53,295) and longer sleep durations in winter (Fig. 2b; n = 33,471). However, seasonal variation was more pronounced for steps than sleep (approximately ±10% change versus approximately ±2%) (Fig. 2a,b). A notable exception in both trends occurred in 2020, likely due to the COVID-19 pandemic and associated lockdowns, as has been previously reported¹⁵. During this year, daily steps continued declining through March and April, while daily sleep durations increased, contrary to typical seasonal patterns.

Overlap of Fitbit data with other data types

A strength of the All of Us dataset is that it allows registered researchers to bring together many different data types in a secure cloud-computing environment. This report aims to highlight scientific opportunities available using the program’s extensive and demographically rich DHT dataset, rather than pursue novel discoveries. As such, the research questions and analyses described here remain purposefully high level. However, we expect external researchers will combine multiple data types to discover novel associations and risk factors. To highlight the potential power of the DHT dataset to address more complex research questions, we quantified the number of participants who donated multiple data types (Fig. 3). Of the participants who shared Fitbit data, 44% (25,877) also donated EHR data, physical measurements, genomics, responses to at least one of the Basics, Family History, Lifestyle, Personal Medical History, Overall Health, and Healthcare Access and Utilization surveys, and responses to a survey on social determinants of health (SDOH). Incorporating the environmental and social information captured by these surveys into DHT research is essential to ensure impactful advancements in the field using this expanded data resource¹⁶.

**Fig. 3: Available data types for those with Fitbit data.**

Case study with wearable and EHR data: recovery in daily steps following lower limb injury

To demonstrate the power of integrating wearables data with another data type in the same individuals, we conducted a case study examining the impact of lower limb fracture on daily step counts. Among 61 participants who sustained a lower limb fracture, the 30-day rolling average of normalized daily steps shows a sharp decline from baseline immediately after injury. This decline continued for ~33 days after injury, dropping to 40% below baseline before gradually recovering to near pre-injury levels by 120 days after injury (Fig. 4).

**Fig. 4: Recovery of daily steps following lower limb fracture.**

Discussion

In this Resource paper, we highlight the value of the All of Us Research Program’s expanded wearables dataset. We examined how multiple DHT outcomes aligned with expected trends previously published in the literature. Specifically, we calculated baseline cohort activity and sleep outcomes in large cohorts of more than 30,000 participants, observed seasonal variations in physical activity and sleep, and presented a case study of the activity trajectory of participants following a lower limb fracture. Together, these analyses demonstrate the value and unique nature of the All of Us Fitbit data resource in terms of its scale, longitudinal observation period and integration with clinical outcomes, including those recorded in the EHR data.

The longitudinal nature of this dataset enables examination of temporal patterns. Although variation in seasonal activity and sleep is relatively well-established, few studies have measured oscillations directly via continuous activity monitoring over several years¹³. An advantage of commercial wearable device data (for example, Fitbit) in large cohort studies is potentially higher compliance and more continuous data.

Analysis of the All of Us dataset revealed expected seasonal variation in physical activity, as measured by median daily steps, including a deviation from this pattern in 2020 owing to the COVID-19 pandemic. This deviation was observed in an earlier analysis of this dataset, but at that time, the sample size was much smaller (n = 5,443) and less demographically varied¹⁵. Interestingly, our analysis shows that median daily steps never fully recovered to pre-pandemic levels (Fig. 2a, years 2021–2023). This incomplete recovery likely reflects two factors. First, 2021 marks the first year that WEAR participants’ step data was incorporated into the seasonal average. As shown in Table 2, WEAR participants have significantly lower step counts than their BYOD counterparts, suggesting this compositional shift in the All of Us cohort contributed to lower step counts beginning in 2021. Second, lingering pandemic-related behavioral changes, such as extended remote work policies, may have also reduced baseline activity. Future work is needed to disentangle these contributing factors. This expanded dataset will strengthen researchers’ ability to study typical physical activity patterns and the factors that influence deviations from these patterns¹⁷.

We also observed seasonal variation in sleep duration, consistent with other self-reported and objective measures in the literature, which generally show longer sleep in winter and shorter sleep in spring and summer^18,19,20,21. Notably, we observed increased sleep durations starting in winter 2020 that gradually returned to baseline by winter 2023 (Fig. 2b). Self-reported data have documented similar increases in sleep duration during this period^22,23, as have several studies using objective measures of sleep early in the COVID-19 pandemic²⁴. Our data provide additional confirmation of this pattern and extend the observation period through winter 2023, demonstrating a gradual return to baseline.

We observed a median baseline of 6,454 steps per day in our general activity cohort (Table 2). Published estimates from comparable cohorts (for example, UK Biobank and National Health and Nutrition Examination Survey (NHANES)) often report ~9,000–9,600 steps per day^25,26,27. However, these comparisons are sensitive to the step-count algorithm utilized²⁸.

In addition, All of Us ingests wearable data via the Fitbit Application Programming Interface, which provides summary tables and metrics derived from Fitbit’s proprietary algorithms. As a result, raw accelerometry data are not available to researchers. Although this standardization may improve comparability in All of Us studies, it complicates comparisons with other cohorts that do provide raw accelerometry data (for example, UK Biobank, NHANES). Furthermore, whereas NHANES and UK Biobank distribute devices for 1 week, All of Us participants donated data for extended periods. Finally, All of Us is a broad convenience sample and is not representative of the US population. Despite the WEAR study’s success in increasing the number of people from certain demographic groups (for example, lower income, less access to healthcare), the All of Us dataset is still older, more female and more highly educated than the general US population. Researchers making detailed comparisons to other cohorts or the general population should apply post-stratification or weighting methods to account for sampling and demographic differences.

The recommended daily sleep duration for adults is 7–9 h and self-reported estimates from US adults typically range from 6.5 to 7.5 h^29,30. We were interested in how these subjective sleep durations, from surveys like the NHANES would compare to the objectively measured sleep durations in our cohort. In our cohort, the median (IQR) daily sleep duration was 6.8 h (6.2–7.2) (Table 2), which is comparable to the NHANES estimates. However, whereas NHANES data suggests that ~32% of US adults experience ‘short sleep’ (<7 h), we found a much larger percentage (62.5%, n = 21,612) of participants with a median main sleep duration classified as short or very short sleep (<7 h). Although these differences are interesting to note, our cohort is not nationally representative and uses device-measured rather than self-reported sleep, complicating direct comparisons. In addition, while research suggests self-reported sleep can lead to overestimations³¹, the magnitude of the difference (32% versus 62.5%) suggests additional factors may be involved.

Recent studies of large cohorts using consumer sleep trackers have generated estimates of global sleep patterns, perhaps providing more appropriate comparisons to our device-measured data. One such study reported a slightly longer average sleep duration in its US subset³²: 6.9 h versus 6.8 h in our cohort. That study measured sleep in ~50,000 Oura ring users who donated an average of ~242 nights of data from January 2021 to January 2022. By contrast, participants in our cohort donated a median of 159 nights of valid sleep data over a median data donation window of 464 days, spanning from 2009 to 2023 (Supplementary Table 1). The cohorts were similar in age and sex, but socioeconomic status—known to influence sleep duration³³—was not reported³². The WEAR program successfully enrolled individuals from lower socioeconomic statuses who are less likely to be included in wearables datasets that rely on independent device purchases (Table 1). The likely difference in socioeconomic composition between the studies may partially explain the lower sleep durations we observed.

Another study using an under-mattress sleep device, the Withings Sleep Analyzer, reported a significantly higher average sleep duration of ~7.5 h for US device users³⁴. Validation studies suggest that the Withings device significantly overestimates sleep duration when compared to polysomnography and may do so to a greater extent than Fitbit devices^35,36. In addition, the Withings Sleep Analyzer study assessed sleep over 9 months in adults who registered to use the device between July 2020 and March 2021, a period that overlapped significantly with the COVID-19 pandemic. Several reports suggest population-level sleep abnormalities during this time, including increased time in bed and total sleep duration^23,37. Although our dataset includes this pandemic period, it also includes data from many years before and after, which would have mitigated the impact of pandemic-related changes on our longitudinal median sleep duration.

A key strength of the All of Us data is the ability to examine individual-level changes in relation to clinical events. To demonstrate the value of integrating wearable outcomes with clinical events documented in EHR records, we examined daily step counts in participants who experienced a lower limb fracture. Among the 61 participants in this case study, we observed considerable variability in average daily steps both before and after injury. Nevertheless, the cohort showed a rapid decline in steps relative to baseline immediately following the injury, with recovery taking on average 90 days after injury (seen as 120 rolling average days in Fig. 4). Even at 180 days (6 months) after injury, the cohort had not fully returned to pre-injury activity levels. Given the range of injury severity represented in the ‘Fracture of Lower Limb’ concept ID (Supplementary Table 2) and published reports indicating that several of these injuries require recovery times exceeding 6 months—particularly in older adults—this incomplete recovery was expected^38,39. Our primary purpose in conducting this case study was to demonstrate how wearable data can be integrated with clinical outcomes to understand correlations between health events and changes in activity patterns. Although we chose a relatively straightforward case study with an expected result, future researchers can leverage these integrated data types to identify novel biomarkers and associations between health, physical activity and sleep outcomes.

Realizing the full potential of this dataset requires continued methodological advancement in several areas. For example, the impact of device type on wearable outcomes remains poorly understood, and there are currently no consensus methods for addressing the use of multiple Fitbit devices in a single study or by a single participant⁴⁰. Similarly, approaches for handling missing data in DHT datasets are not standardized. Missing data in these datasets are unlikely to be random and may reflect conscious or subconscious decisions to remove a device, which can correlate with participant characteristics or health states (for example, mood) and introduce bias⁴¹.

Although Fitbits have demonstrated reasonable reliability compared to gold-standard devices for certain activity and sleep metrics^42,43,44, their reliability varies across specific measures (for example, sleep stages, heart rate), populations and device types^40,45,46. For example, research suggests that Fitbits measure heart rate less reliably in people with darker skin tones because of differences in how sensors optically measure light absorption⁴⁰. In addition, Fitbit step estimation accuracy may be reduced in people with irregular gait patterns from neurological conditions such as Parkinson’s disease, with inaccuracies varying by device type⁴⁷. The effect of these limitations on study findings will depend on the specific research question, outcome measures and population being studied. Researchers should carefully consider these device-specific and population-specific reliability limitations when designing analyses and interpreting results from this dataset.

Another important consideration is that wearables data may be subject to measurement reactivity, where participants temporarily alter their behavior when first provided with activity and sleep trackers. However, the duration of this effect is likely short-lived and depends on the health-related behavior of interest (for example, daily steps versus exercise minutes)⁴⁸. Researchers should consider their research question and observation period carefully and may wish to exclude the first few days or weeks of data donated by participants to avoid bias^49,50. Given the large-scale and longitudinal nature of the analyses in this manuscript, we chose not to exclude any days of data.

Future research using the All of Us Fitbit dataset will benefit from methodological advancements that address current limitations; however, developing such approaches was beyond the scope of this paper. Instead, our goal was to present the dataset at a high level, with the expectation that the broader research community will leverage it for methodological developments. Encouragingly, the research community has already begun this work, including several reports that specifically evaluate and provide considerations for using the All of Us Fitbit dataset^40,50,51. Future All of Us wearables data users are encouraged to reference the program’s user support hub (https://support.researchallofus.org), which contains additional information and guidance, including multiple ‘featured workspaces’ with example code and support articles, such as one titled ‘Considerations while using Fitbit data in the All of Us Research Program’.

Finally, analyses of demographic variables and DHT outcomes (for example, daily steps and sleep duration) require careful consideration to avoid misleading conclusions. A strength of the All of Us dataset is that it integrates many data types, including EHR, genomics and extensive self-reported survey data. Specifically, 82% (48,487 out of 59,018) of participants with Fitbit data also responded to the program’s SDOH survey, which asks about social factors like neighborhood, social life and perceived stress. We urge researchers to plan their analyses carefully, consult experts and community members in their research design, and consider all the data the program collects to study factors underlying sleep and activity differences.

An important consideration for all real-world datasets, including the data presented here, is that many factors of data collection are beyond experimenter control, and some of these uncontrolled factors may introduce sources of error or bias. For example, participants in our cohort used 41 different Fitbit device models with various sensors and technologies (Supplementary Table 3). This device heterogeneity may affect measurement accuracy owing to device-specific limitations or user-selected settings. In addition, although 59,018 participants donated Fitbit data to All of Us, only 52,860 (89.6%) had device information available in the device table, and a small fraction of participants showed evidence of using five or more devices during their data donation window (Supplementary Fig. 2). Such data characteristics reflect the real-world nature of this dataset, in which participants use their own devices over extended periods under free-living conditions.

Because our goal was to provide a high-level overview of Fitbit data availability and trends as a Resource paper, and because there are currently no consensus methods in the field for addressing device heterogeneity in consumer-grade wearables research, we did not prescribe specific analytical approaches for handling these factors. Establishing such methods is an active area of research that extends beyond the scope of a resource description paper. As the field continues to evolve, researchers should carefully consider potential sources of error or bias when analyzing real-world data, and important findings obtained in observational real-world datasets should ideally be followed up with controlled interventional studies when feasible.

In sum, although potential errors and biases present challenges when working with real-world data, there are also crucial benefits that make real-world datasets a valuable resource for the research community. These include their massive scale, richness of longitudinal data, integration with multiple data types (for example, EHR, surveys, genomics), and ability to support a wide range of research objectives—benefits that are often difficult to obtain in more controlled, small-scale datasets. The All of Us Fitbit dataset, with its extended observation periods, large and diverse participant population, and linkage to clinical outcomes, offers opportunities for discovery that complement findings from traditional research-grade accelerometry and plethysmography studies.

The WEAR study was a strategic and innovative effort by the All of Us Research Program to expand the number and representativeness of individuals donating DHT data by distributing Fitbit devices to participants at no cost. WEAR’s success is evidenced by a larger proportion of participants from varying backgrounds donating activity data through the WEAR study relative to the BYOD program (Table 1). The All of Us Research Program is accelerating research in precision medicine, a field that initially focused on the potential for human genetics to enable individually tailored treatments and improve health outcomes, but that over time has broadened its scope to appreciate the role of additional data types, including DHT. By substantially increasing the amount of DHT available from a broader range of individuals across the US population, the expanded All of Us Fitbit dataset offers a valuable resource to advance biomedical research. This resource can help researchers better understand the contributions of sleep, heart rate and physical activity on important health outcomes, and inform the development of more precise treatments and interventions.

Methods

This research complies with all relevant ethical regulations. Specifically, secondary use of All of Us Research Program data has been designated nonhuman participants research by the All of Us Institutional Review Board. Therefore, additional informed consent was not required. In addition, this study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guideline for cohort studies⁵².

Data sources and participants

Using the Controlled Tier CDR v.8 (C2024Q3R5) on the All of Us Researcher Workbench, we analyzed demographic and Fitbit data from participants who live in the USA or its territories and were aged 18 or older who enrolled in the All of Us Research Program either at a healthcare provider site or by directly visiting the enrollment website between 31 May 2017 and 1 October 2023. There are two pathways by which a participant could donate Fitbit data to the All of Us Research Program: the BYOD program⁵³, in which a participant consents to share data from an existing device, or the WEAR study, in which participants were provided with a Fitbit device by the program at no cost. WEAR participants were given a choice between the Fitbit Charge and the Fitbit Versa. Over the course of the study, a variety of each model (for example, Charge 3, Charge 4) was distributed to WEAR participants. WEAR study participants in this data release enrolled between February 2021 and September 2023. We classified participants as WEAR if they consented to join the WEAR program and had data starting on or after 22 February 2021. Participants without WEAR consent or who began donating Fitbit data before 22 February 2021 were considered BYOD participants. Demographic details of all participants were obtained from the All of Us ‘the Basics’ survey.

C2024Q3R5 includes data from 59,018 participants who contributed Fitbit data spanning over 14 years, with data types such as daily and intraday or sequence level metrics for activity, steps, heart rate, sleep and device information (Supplementary Table 1). A subset of this Fitbit data and information from health surveys administered in English or Spanish were used in this study. Information about specific Fitbit tables and the survey questions used are given in Supplementary Table 4.

Definitions for eligibility criteria

Participants met general activity eligibility criteria (n = 54,509) if they completed the Basics survey, had a step count >0 in the activity summary table, were ≥18 years of age at the time of their earliest activity data point, and had at least 4 days of valid activity data, where a ‘valid activity day’ is defined as having ≥10 h of data per day, and ≥100 steps but <100,000 steps in a day (Supplementary Fig. 3 and Supplementary Table 5).

Participants met seasonal activity eligibility criteria (n = 53,295) if they met the general activity eligibility criteria above and had at least 7 ‘valid activity days’ per month for any month during which they donated Fitbit data (Supplementary Fig. 3 and Supplementary Table 6).

Participants met general sleep eligibility criteria (n = 34,378) if they completed the Basics survey, were ≥18 years of age at the time of their earliest sleep data point, and had at least 4 days of ‘valid sleep data’, where valid sleep data is defined as having slept ≥4 h on at least 70% of donated days^54,55 (Supplementary Fig. 4 and Supplementary Table 7).

Participants met seasonal sleep eligibility criteria (n = 33,471) if they met the general sleep eligibility criteria above and had 7 or more ‘valid sleep days’ per month for any month during which they donated Fitbit data. (Supplementary Fig. 4 and Supplementary Table 8).

Participants met the lower limb fracture case study eligibility criteria (n = 61) if they had both EHR records for the ‘Fracture of Lower Limb’ SNOMED concept ID 4187096 (Supplementary Table 2 for participant counts and subcodes), had Fracture of Lower Limb EHR records on at least five separate days indicating injuries serious enough to require multiple subsequent visits, and had at least 300 days of step data during the 360-day observation period (±180 days of the earliest recorded Fracture of Lower Limb EHR record). The cohort builder tool on the All of Us Researcher Workbench was used to identify the initial cohort of 2,476 participants with fracture EHR records and Fitbit data, which was then restricted based on the above criteria to a final cohort of 61 (Supplementary Fig. 5 and Supplementary Table 9).

Enrollment and geographic analyses

Figure 1a shows cumulative start date data for all WEAR and BYOD participants (n = 59,018), where participants were added to the cumulative count of each cohort on the day of their earliest recorded Fitbit data. Stratification into WEAR versus BYOD cohorts is described in the section ‘Data sources and participants’. Figure 1b,c shows the state-level distributions of the WEAR and BYOD cohorts, by using each individual’s state of residence data.

Activity analyses

Participants who met the general activity eligibility criteria (n = 54,509) were included in a detailed demographic analysis (Table 1) and an analysis of baseline activity levels (Table 2). Median daily steps and IQRs were calculated using all available data from the entire observation window (3 October 2009 to 30 September 2023) (Table 2). The amount and duration of valid sleep data donated by each participant varied (Supplementary Fig. 6 and Supplementary Table 5).

For the seasonal activity analysis, we first calculated the median daily step count per month for each eligible participant (n = 53,295). We then computed the overall monthly medians and IQRs across all participants for each month. Normalization was performed by dividing each individual’s monthly median step count by their overall monthly median step count during the entire observation window (3 October 2009 to 30 September 2023). Although data are available for the whole of this date range, only data from 1 January 2018 to 30 September 2023 are shown in Fig. 2. This date range was selected to allow visualization of seasonal oscillations and to include the COVID-19 pandemic period, during which deviations from typical seasonal trends occurred.

Sleep analyses

Participants who met general sleep eligibility criteria (n = 34,378) were included in an analysis of baseline sleep duration (Table 2). Median daily main sleep durations and IQRs were computed using all available data, which ranged from 6 October 2009 and 30 September 2023. The amount and duration of valid sleep data donated by each participant varied (Supplementary Fig. 7 and Supplementary Table 7). Fitbit devices record all sleep events, differentiating between shorter periods of sleep, such as naps, and the longest sleep period, which is designated the ‘main sleep’. Next, using each participant’s median daily sleep duration, we calculated the percentage of the general sleep cohort that fell into the following categories¹¹: normal sleep (7–8.99 h per night), very short sleep (<5 h per night), short sleep (5–6.99 h per night) and long sleep (≥9 h per night).

For the seasonal sleep analysis, we first calculated the median daily sleep duration per month for each eligible participant (n = 33,471). We then computed the overall monthly medians and IQRs across all participants for each month. Normalization for the seasonal data was completed by dividing each individual’s monthly median sleep duration by their overall median sleep duration during the entire observation window from 6 October 2009 and 30 September 2023. Although data are available for the entire observation window, only data from 1 January 2018 to 30 September 2023 are shown in Fig. 2. This subset of the data was selected for the same reasons as described above in the ‘Activity analyses’ section.

Data type overlap analysis

All participants who donated any Fitbit data (n = 59,018) were assigned a flag for each data type of interest. The authors selected the highlighted data types based on their expectations of those with the broadest interest. The counts and overlap of participants donating each data type were visualized in a Venn diagram (Fig. 4). Supplementary Table 4 specifies the source data used to determine the number of participants who donated key data types in addition to any Fitbit data for this overlap analysis.

Lower limb fracture: case study analysis

Participants who met the lower limb fracture case study eligibility criteria (n = 61) were included in an analysis to track the decline and recovery in median daily steps around the time of an EHR event that indicated a broken leg (see Supplementary Fig. 5 and Supplementary Tables 2 and 9 for SNOMED subcodes and participant counts). For this analysis, we deviated from our standard practice of removing days with step counts <100 or >100,000 as ‘invalid’ (see the ‘valid activity day’ definition in the ‘Activity analyses’ section). Given the nature of the analysis, we were specifically interested in anomalously low step days and the highest value seen in this cohort was 51,101 steps. As a result, no step days were excluded before analysis. To compare all participants on a uniform scale, daily step data were normalized by dividing each participant’s daily step count by their mean step count across all the days in the 360-day observation period. Plots show 30-day rolling averages of this normalized step count.

Device type analysis

There is significant heterogeneity in the devices used in the All of Us Fitbit dataset. Although we did not account for this heterogeneity directly in our methods, we did develop a detailed catalog of all Fitbit device models present in the dataset, including participant counts for each model, sensor specifications, estimated release years and device-specific considerations (Supplementary Table 3). We also analyzed the distribution of device count per participant (Supplementary Fig. 1). Together, these resources enable researchers to understand the range of devices used and their measurement capabilities for future research.

Statistical analyses

R and Python programming languages were used to conduct all the analyses on the All of Us Researcher Workbench. Summary demographic information is reported as count (n) and percentage for each cohort (WEAR, BYOD, total cohort). Summary data are reported as medians and IQRs, because most of the step and sleep data distributions are not normally distributed (either skewed, or even bimodal if zero values are included), and mean and standard deviation combinations would be inaccurate as to the actual distribution shapes. Owing to our large N, normality tests (Anderson–Darling, Shapiro–Wilk) typically fail, so Q–Q plots were interpreted with consistent skewing indicated. Many subsets also contained a significant portion of ‘zeros’, rendering the data bimodal. Participant counts <20 for any reporting category were obscured to protect the privacy of participants and in accordance with the All of Us Research Program’s Data and Statistics Dissemination Policy (https://www.researchallofus.org/faq/data-and-statistics-dissemination-policy).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

This study used data from the All of Us Research Program’s Controlled Tier Dataset version 8 (C2024Q3R5), available to registered users on the All of Us Researcher Workbench (https://workbench.researchallofus.org). The dataset is accessible only to registered researchers to protect patient privacy. Step-by-step instructions for how an institution and individual can gain access is available at https://support.researchallofus.org/hc/en-us/articles/9005549268756-How-to-Obtain-a-DURA-with-All-of-Us.

Code availability

Code used for this study is available to users of the All of Us Research Workbench platform in a featured workspace. In addition, a public-facing code repository (outside the Researcher Workbench) is available via GitHub at https://github.com/RTIInternational/allofus_NIH_wear.

References

Vogels, E. A. About one-in-five Americans use a smart watch or fitness tracker. Pew Research https://www.pewresearch.org/short-reads/2020/01/09/about-one-in-five-americans-use-a-smart-watch-or-fitness-tracker/ (2020).
Al-Alusi, M. A. et al. Trends in consumer wearable devices with cardiac sensors in a primary care cohort. Circ. Cardiovasc. Qual. Outcomes 15, e008833 (2022).
Article PubMed PubMed Central Google Scholar
Nagappan, A., Krasniansky, A. & Knowles, M. Patterns of ownership and usage of wearable devices in the United States, 2020–2022: survey study. J. Med. Internet Res. 26, e56504 (2024).
Article PubMed PubMed Central Google Scholar
Hughes, A., Shandhi, M. M. H., Master, H., Dunn, J. & Brittain, E. Wearable devices in cardiovascular medicine. Circ. Res. 132, 652–670 (2023).
Article CAS PubMed PubMed Central Google Scholar
Zahedani, A. D. et al. Digital health application integrating wearable data and behavioral patterns improves metabolic health. NPJ Digit. Med. 6, 216 (2023).
Article PubMed PubMed Central Google Scholar
Holko, M. et al. Wearable fitness tracker use in federally qualified health center patients: strategies to improve the health of all of us using digital health devices. NPJ Digit. Med. 5, 53 (2022).
Article PubMed PubMed Central Google Scholar
Kim, E. H. et al. Association of demographic and socioeconomic indicators with the use of wearable devices among children. JAMA Netw. Open 6, e235681 (2023).
Article PubMed PubMed Central Google Scholar
Mapes, B. M. et al. Diversity and inclusion for the All of Us Research Program: a scoping review. PLoS ONE 15, e0234962 (2020).
Article CAS PubMed PubMed Central Google Scholar
Master, H., Kouame, A., Hollis, H., Marginean, K. & Rodriguez K. 2022Q4R9 v7 Data Characterization Report (All of Us Research Program, 2024).
Through ‘All of Us’’ program, Scripps Research launches wearable technology study to accelerate precision medicine. Scripps Research https://www.scripps.edu/news-and-events/press-room/2021/20210224-aou-fitbit-study.html (24 February 2021).
Nie, Q. et al. Analysis of sleep for the American population: result from NHANES database. J. Affect. Disord. 347, 134–143 (2024).
Article PubMed Google Scholar
Mayo, K. R. et al. The All of Us Data and Research Center: creating a secure, scalable, and sustainable ecosystem for biomedical research. Annu. Rev. Biomed. Data Sci. 6, 443–464 (2023).
Article PubMed PubMed Central Google Scholar
Garriga, A., Sempere-Rubio, N., Molina-Prados, M. J. & Faubel, R. Impact of seasonality on physical activity: a systematic review. Int. J. Environ. Res. Public Health 19, 2 (2021).
Article PubMed PubMed Central Google Scholar
Tucker, P. & Gilliland, J. The effect of season and weather on physical activity: a systematic review. Public Health 121, 909–922 (2007).
Article CAS PubMed Google Scholar
Desine, S. et al. Daily step counts before and after the COVID-19 pandemic among All of Us research participants. JAMA Netw. Open 6, e233526 (2023).
Article PubMed PubMed Central Google Scholar
Thomas Craig, K. J. et al. Leveraging data and digital health technologies to assess and impact social determinants of health (SDoH): a state-of-the-art literature review. Online J. Public Health Inform. 13, E14 (2021).
PubMed PubMed Central Google Scholar
Jeong, H. et al. Data from the All of Us Research Program reinforces existence of activity inequality. NPJ Digit. Med. 8, 8 (2025).
Article PubMed PubMed Central Google Scholar
Titova, O. E., Lindberg, E., Elmstahl, S., Lind, L. & Benedict, C. Seasonal variations in sleep duration and sleep complaints: a Swedish cohort study in middle-aged and older individuals. J. Sleep Res. 31, e13453 (2022).
Article PubMed Google Scholar
Mattingly, S. M. et al. The effects of seasons and weather on sleep patterns measured through longitudinal multimodal sensing. NPJ Digit. Med. 4, 76 (2021).
Article PubMed PubMed Central Google Scholar
Dunster, G. P. et al. Daytime light exposure is a strong predictor of seasonal variation in sleep and circadian timing of university students. J. Pineal Res. 74, e12843 (2023).
Article CAS PubMed Google Scholar
Scott, H. et al. Variations in sleep duration and timing: weekday and seasonal variations in sleep are common in an analysis of 73 million nights from an objective sleep tracker. Sleep 48, zsaf099 (2025).
Article PubMed PubMed Central Google Scholar
Mandelkorn, U. et al. Escalation of sleep disturbances amid the COVID-19 pandemic: a cross-sectional international study. J. Clin. Sleep Med. 17, 45–53 (2021).
Article PubMed PubMed Central Google Scholar
Batool-Anwar, S. et al. Examining changes in sleep duration associated with the onset of the COVID-19 pandemic: who is sleeping and who is not? Behav. Med. 49, 162–171 (2023).
Article PubMed Google Scholar
Rezaei, N. & Grandner, M. A. Changes in sleep duration, timing, and variability during the COVID-19 pandemic: large-scale Fitbit data from 6 major US cities. Sleep Health 7, 303–313 (2021).
Article PubMed PubMed Central Google Scholar
Tudor-Locke, C., Johnson, W. D. & Katzmarzyk, P. T. Accelerometer-determined steps per day in US adults. Med. Sci. Sports Exerc. 41, 1384–1391 (2009).
Article PubMed Google Scholar
Saint-Maurice, P. F. et al. Association of daily step count and step intensity with mortality among US adults. JAMA 323, 1151–1160 (2020).
Article PubMed PubMed Central Google Scholar
Small, S. R. et al. Self-supervised machine learning to characterize step counts from wrist-worn accelerometers in the UK Biobank. Med. Sci. Sports Exerc. 56, 1945–1953 (2024).
Article PubMed PubMed Central Google Scholar
Koffman, L., Crainiceanu, C. & Muschelli, J. Comparing step counting algorithms for high-resolution wrist accelerometry data in NHANES 2011–2014. Med. Sci. Sports Exerc. 57, 746–755 (2025).
Article PubMed Google Scholar
Cepeda, M. S., Stang, P., Blacketer, C., Kent, J. M. & Wittenberg, G. M. Clinical relevance of sleep duration: results from a cross-sectional analysis using NHANES. J. Clin. Sleep Med. 12, 813–819 (2016).
Article PubMed PubMed Central Google Scholar
Ford, E. S., Cunningham, T. J. & Croft, J. B. Trends in self-reported sleep duration among US adults from 1985 to 2012. Sleep 38, 829–832 (2015).
Article PubMed PubMed Central Google Scholar
Lauderdale, D. S., Knutson, K. L., Yan, L. L., Liu, K. & Rathouz, P. J. Self-reported and measured sleep duration: how similar are they? Epidemiology 19, 838–845 (2008).
Article PubMed PubMed Central Google Scholar
Willoughby, A. R., Alikhani, I., Karsikas, M., Chua, X. Y. & Chee, M. W. L. Country differences in nocturnal sleep variability: observations from a large-scale, long-term sleep wearable study. Sleep Med. 110, 155–165 (2023).
Article PubMed Google Scholar
Wetzel, S. & Bilal, U. Socioeconomic status and sleep duration among a representative, cross-sectional sample of US adults. BMC Public Health 24, 3410 (2024).
Article PubMed PubMed Central Google Scholar
Scott, H. et al. Are we getting enough sleep? Frequent irregular sleep found in an analysis of over 11 million nights of objective in-home sleep data. Sleep Health 10, 91–97 (2024).
Article PubMed Google Scholar
Kainec, K. A. et al. Evaluating accuracy in five commercial sleep-tracking devices compared to research-grade actigraphy and polysomnography. Sensors 24, 635 (2024).
Article PubMed PubMed Central Google Scholar
Manners, J. et al. Performance evaluation of an under-mattress sleep sensor versus polysomnography in >400 nights with healthy and unhealthy sleep. J. Sleep Res. 34, e14480 (2025).
Article PubMed PubMed Central Google Scholar
Yuan, R. K., Zitting, K. M., Maskati, L. & Huang, J. Increased sleep duration and delayed sleep timing during the COVID-19 pandemic. Sci. Rep. 12, 10937 (2022).
Article CAS PubMed PubMed Central Google Scholar
Eastwood, E. A. et al. Patients with hip fracture: subgroups and their outcomes. J. Am. Geriatr. Soc. 50, 1240–1249 (2002).
Article PubMed Google Scholar
Vaartjes, T. P. et al. Most patients with tibial plateau fractures regain function for daily activities but do not return to their pre-injury level of sport: comparison of multicenter cohort of 1101 patients and age-related peers. Eur. J. Trauma Emerg. Surg. 51, 285 (2025).
Article PubMed PubMed Central Google Scholar
Lederer, L. et al. The importance of data quality control in using Fitbit device data from the research program. JMIR Mhealth Uhealth 11, e45103 (2023).
Article PubMed PubMed Central Google Scholar
Strain, T., Wijndaele, K., Pearce, M. & Brage, S. Considerations for the use of consumer-grade wearables and smartphones in population surveillance of physical activity. J. Meas. Phys. Behav. 5, 8–14 (2022).
Article Google Scholar
Feehan, L. M. et al. Accuracy of Fitbit devices: systematic review and narrative syntheses of quantitative data. JMIR Mhealth Uhealth 6, e10527 (2018).
Article PubMed PubMed Central Google Scholar
Haghayegh, S., Khoshnevis, S., Smolensky, M. H., Diller, K. R. & Castriotta, R. J. Accuracy of wristband Fitbit models in assessing sleep: systematic review and meta-analysis. J. Med. Internet Res. 21, e16273 (2019).
Article PubMed PubMed Central Google Scholar
Germini, F. et al. Accuracy and acceptability of wrist-wearable activity-tracking devices: systematic review of the literature. J. Med. Internet Res. 24, e30791 (2022).
Article PubMed PubMed Central Google Scholar
Robbins, R. et al. Accuracy of three commercial wearable devices for sleep tracking in healthy adults. Sensors 24, 6532 (2024).
Article PubMed PubMed Central Google Scholar
Xie, J. et al. Evaluating the validity of current mainstream wearable devices in fitness tracking under various physical activities: comparative study. JMIR Mhealth Uhealth 6, e94 (2018).
Article PubMed PubMed Central Google Scholar
Wendel, N. et al. Accuracy of activity trackers in Parkinson disease: should we prescribe them? Phys. Ther. 98, 705–714 (2018).
Article PubMed Google Scholar
Konig, L. M., Allmeta, A., Christlein, N., Van Emmenis, M. & Sutton, S. A systematic review and meta-analysis of studies of reactivity to digital in-the-moment measurement of health behaviour. Health Psychol. Rev. 16, 551–575 (2022).
Article PubMed Google Scholar
Clemes, S. A. & Deans, N. K. Presence and duration of reactivity to pedometers in adults. Med. Sci. Sports Exerc. 44, 1097–1101 (2012).
Article PubMed Google Scholar
Bailey, C. P. Fitbit physical activity and sleep data in the All of Us Research Program: data exploration and processing considerations for research. Med. Sci. Sports Exerc. 57, 2946–2953 (2025).
Article PubMed PubMed Central Google Scholar
Van Der Donckt, J. et al. Mitigating data quality challenges in ambulatory wrist-worn wearable monitoring through analytical and practical approaches. Sci. Rep. 14, 17545 (2024).
Article PubMed PubMed Central Google Scholar
von Elm, E. et al. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. J. Clin. Epidemiol. 61, 344–349 (2008).
Article Google Scholar
Master, H. et al. How Fitbit data are being made available to registered researchers in All of Us Research Program. Pac. Symp. Biocomput. 28, 19–30 (2023).
PubMed PubMed Central Google Scholar
Zheng, N. S. et al. Sleep patterns and risk of chronic disease as measured by long-term monitoring with commercial wearable devices in the All of Us Research Program. Nat. Med. 30, 2648–2656 (2024).
Article CAS PubMed PubMed Central Google Scholar
McCauley, P. et al. A new mathematical model for the homeostatic effects of sleep loss on neurobehavioral performance. J. Theor. Biol. 256, 227–239 (2009).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank All of Us Research Program participants for their contributions, without whom this research would not have been possible. We also thank the National Institutes of Health’s All of Us Research Program for making available the participant data examined in this study. The All of Us Research Program is supported by the NIH, Office of the Director: Regional Medical Centers (1 OT2 OD026549, 1 OT2 OD026554, 1 OT2 OD026557, 1 OT2 OD026556, 1 OT2 OD026550, 1 OT2 OD 026552, 1 OT2 OD026553, 1 OT2 OD026548, 1 OT2 OD026551, 1 OT2 OD026555; IAA: AOD21037, AOD22003, AOD16037, AOD21041); Federally Qualified Health Centers (HHSN 263201600085U); Data and Research Center (5 U2C OD023196); Biobank (1 U24 OD023121); The Participant Center (U24 OD023176); Participant Technology Systems Center (1 U24 OD023163, 1 OT2 OD 030043); Communications and Engagement (3 OT2 OD023205, 3 OT2 OD023206); and Community Partners (1 OT2 OD025277, 3 OT2 OD025315, 1 OT2 OD025337, 1 OT2 OD025276). H.M. acknowledges additional funding from the NIH (1-K12-AR085544-01, UNC BIRWCH K12). This work was conducted as part of the official duties of T.P., J.A., A.R., J.M. and A.R.P. in their capacity as employees of the NIH. E.P. and H.M. contributed under support from OT2OD028395 and OT2OD035404 awarded by NIH to RTI and the All of Us Data and Research Center. The funders had no role in study design, data collection and analysis, decision to publish, or manuscript preparation beyond directing and funding the work as part of agency operations.

Author information

Authors and Affiliations

National Institutes of Health, Bethesda, MD, USA
Theresa Patten, Jennifer Adjemian, Andrea Ramirez, James McClain & Amy Rose Price
RTI International, Research Triangle Park, NC, USA
Edward A. Preble
Vanderbilt Institute of Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN, USA
Hiral Master
Department of Health Sciences, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Hiral Master

Authors

Theresa Patten
View author publications
Search author on:PubMed Google Scholar
Edward A. Preble
View author publications
Search author on:PubMed Google Scholar
Hiral Master
View author publications
Search author on:PubMed Google Scholar
Jennifer Adjemian
View author publications
Search author on:PubMed Google Scholar
Andrea Ramirez
View author publications
Search author on:PubMed Google Scholar
James McClain
View author publications
Search author on:PubMed Google Scholar
Amy Rose Price
View author publications
Search author on:PubMed Google Scholar

Contributions

T.P., H.M., J.A., A.R., J.M. and A.R.P. were responsible for study conceptualization. T.P., E.A.P., H.M., J.A., J.M. and A.R.P. developed the study methodology. T.P., E.A.P. and A.R.P. undertook formal analysis of the data. T.P. and A.R.P. wrote the original draft. T.P., H.M., J.A., A.R., J.M. and A.R.P. reviewed and edited the original draft. T.P., E.A.P. and A.R.P. wrote the revised manuscript. T.P., E.A.P., J.A. and A.R.P. reviewed and edited the revised manuscript.

Corresponding author

Correspondence to Jennifer Adjemian.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Medicine thanks Sarah Kozey Keadle, Ambarish Pandey and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Liam Messin, in collaboration with the Nature Medicine team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Supplementary Methods, Supplementary Tables 1–9 and Supplementary Figs. 1–7.

Reporting Summary (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Patten, T., Preble, E.A., Master, H. et al. The All of Us Research Program’s wearables dataset. Nat Med (2026). https://doi.org/10.1038/s41591-026-04352-3

Download citation

Received: 01 June 2025
Accepted: 16 March 2026
Published: 27 April 2026
Version of record: 27 April 2026
DOI: https://doi.org/10.1038/s41591-026-04352-3