Orphanhood and caregiver death among children in the United States by all-cause mortality, 2000–2021

Villaveces, Andrés; Chen, Yu; Tucker, Sydney; Blenkinsop, Alexandra; Cluver, Lucie; Sherr, Lorraine; Losby, Jan L.; Graves, Linden; Noonan, Rita; Annor, Francis; Kojey-Merle, Victor; Wang, Douhan; Massetti, Greta; Rawlings, Laura; Nelson, Charles A.; Unwin, H. Juliette T.; Flaxman, Seth; Hillis, Susan; Ratmann, Oliver

doi:10.1038/s41591-024-03343-6

Download PDF

Article
Open access
Published: 10 January 2025

Orphanhood and caregiver death among children in the United States by all-cause mortality, 2000–2021

Nature Medicine volume 31, pages 672–683 (2025) Cite this article

11k Accesses
7 Citations
81 Altmetric
Metrics details

Subjects

Abstract

Deaths of parents and grandparent caregivers threaten child well-being owing to losses of care, financial support, safety and family stability, but are relatively unrecognized as a public health crisis. Here we used cause-specific vital statistics death registrations in a modeling approach to estimate the full magnitude of orphanhood incidence and prevalence among US children aged 0–17 years between 2000 and 2021 by cause, child age, race and ethnicity, sex of deceased parent and state, and also accounted for grandparent caregiver loss using population survey data. In 2021, we estimate that 2.91 million children (4.2% of children) had in their lifetime experienced prevalent orphanhood and caregiver death combined, with incidence increasing by 49.5% and prevalence by 7.9% since 2000. Populations disproportionately affected by orphanhood included 5.2% of all adolescents; 6.4% and 4.7%, respectively, of non-Hispanic American Indian or Alaska Native, and non-Hispanic Black children; and children in southern and eastern states. In 2021, drug overdose was the leading cause of orphanhood among non-Hispanic white children, but not among minoritized subgroups. Effective policies and programs to support nearly three million bereaved children are needed to reduce the acute and long-term negative effects of orphanhood.

Global and regional estimates of orphans attributed to maternal cancer mortality in 2020

Article Open access 20 November 2022

Disparities in childhood human capital investments in the United States

Article Open access 31 March 2026

Quality of life and associated factors among family caregivers of individuals with psychiatric illness at DRH, South Wollo, Ethiopia, 2020

Article Open access 03 November 2022

Main

Children’s lives are increasingly impacted by intersecting crises, including pandemics, organized crime, political and social conflict, and climate change. One of the most fundamental threats that children are facing through escalating crises is orphanhood, defined by United Nations Children’s Fund as the death of one or both parents^1,2, and more generally caregiver loss, including the death of co-residing grandparent caregivers who are responsible for most or some needs of their grandchildren^3,4. Nearly 15 years ago, the World Health Organization (WHO) identified parental death as an adverse childhood experience increasing adult mental health risks⁴. As an adverse childhood experience, parent or caregiver loss may have lifelong consequences^5,6,7,8, including increased risks of suicide, post-traumatic stress disorder, violence, insecure housing⁹, and chronic and infectious diseases^10,11. These consequences often lead to ongoing needs for health and mental health services; parenting, educational and economic support for affected children remaining with surviving parents or caregivers; and foster care or adoption services for children bereft of care¹¹. Yet, available data on caregiver loss are limited to specific causes such as human immunodeficiency virus (HIV) and acquired immunodeficiency syndrome (AIDS)¹², coronavirus disease (COVID-19)¹³, maternal cancers¹⁴ or drug overdose^15,16.

Despite serious risks for bereaved children, we know that timely, multifactored policies that address social determinants of health and promote safe, stable and nurturing relationships and environments are effective in restoring hope and building resilience for orphaned and vulnerable children, their families and communities^17,18. For example, during the COVID-19 pandemic, real-time COVID-19-associated caregiver-loss incidence estimates^19,20,21 led to policies that support bereaved children and families including recommendations in the US National COVID-19 Pandemic Preparedness Plan²²; investments in financial, bereavement and mental health support in some states²³; and a range of global financial, legal and educational support across Brazil, Colombia, Peru, Indonesia and Mexico City²⁴. The World Bank Rapid Social Response Program dedicated funding supporting 12 nations addressing pandemic-linked orphanhood and caregiver-loss agendas²⁵. Furthermore, since 2003, the multibillion dollar President’s Emergency Plan for AIDS Relief has been investing 10% of bilateral funding to provide nutritional, educational, psychosocial and livelihood support for 7.2 million children who were orphaned and vulnerable due to the HIV pandemic²⁶. Together, these responses and frameworks²⁷ raise the possibility that new standards of care can be extended for any child experiencing orphanhood or grandparent caregiver loss, regardless of cause¹⁴.

In the United States, the COVID-19 pandemic intersected with increasing challenges including substance use, economic crises and mental health distress, resulting in compounding deaths due to overdose, suicide, excessive alcohol use and contagion^28,29. Caregiver loss amidst intersecting social and health crises is profoundly exacerbated by inequities in access to social determinants of health, including financial support, education, housing, employment and access to health care. These inequities intensify child risks and vulnerability in particular populations, such as non-Hispanic American Indian or Alaska Native children²⁰.

Understanding the full extent of orphanhood and grandparent caregiver loss due to all causes in the United States, including incidence, prevalence trends and associated inequalities, is essential for informing evidence-based prevention and response strategies for affected children and families. In particular, relative to white children, non-Hispanic Black, non-Hispanic Asian and Hispanic children are twice as likely to live with a grandparent²⁰. From a policy perspective, estimates of orphanhood and grandparent-caregiver-loss prevalence are essential for informing the extent of essential investments needed for children under age 18, as the psychosocial, economic, housing and educational consequences of such losses continue to threaten well-being, health and mental health throughout childhood and adolescence. Just as vital statistics and death records inform public health prevention and response policies according to leading causes of death³⁰, this approach may widely inform support systems for children experiencing caregiver loss.

In this study, we have three aims. First, we leverage standard vital statistics records to establish a modeling approach for comprehensively estimating prevalence, incidence and trends in all-cause orphanhood and co-residing grandparent caregiver loss. Second, we aim to estimate the numbers, rates, time trends and disparities in all-cause caregiver loss among children in the United States between 2000 and 2021, accounting for orphanhood and co-residing grandparent caregiver loss among the ~6.6 million (9.1%) US children who live with a grandparent who owned or rented their housing and provided some or all of their basic needs^31,32. Third, we aim to characterize the leading and evolving drivers of orphanhood in terms of causes of parental death among children in the United States between 2000 and 2021. We characterize leading causes of orphanhood and identify the extent to which the compounding crises of the COVID-19 pandemic and drug overdose epidemic in 2020 and 2021 were associated with escalating orphanhood. We aim to identify populations disproportionately affected, by child age, race and ethnicity, orphanhood type (maternal or paternal) and state, to advance evidence-based strategies responsive to social determinants of health. Our findings may strengthen or expand policies and programs that address parental and caregiver loss and its consequences (Table 1).

Table 1 Policy summary

Full size table

Results

Estimating all-cause orphanhood from vital statistics data

We developed a broadly applicable approach for inferring incidence and prevalence of orphanhood among children aged 0–17 years from any cause of parental death based on widely reported age- and sex-specific vital statistics on individual live birth and death registrations, along with population size estimates (Extended Data Fig. 1). Our approach, adapted from a previous study²⁰, centers on attributing to each deceased woman aged 15–66 years and each deceased man aged 15–94 years the average number of children orphaned using subgroup-specific fertility rates in the previous 0–17 years (Methods). Line-list vital statistics released by the National Center for Health Statistics (NCHS)³³ in the United States (Extended Data Figs. 2–4) are linked to information on race and ethnicity of decedents and causes of death reported in the ninth or tenth revision of the International Classification of Diseases (ICD-10), which we mapped into one of 53 rankable caregiver-loss cause-of-death categories (or ‘other’ category; Supplementary Table 1). This enables the estimation of orphanhood stratified by age of child, and by age, sex, race and ethnicity, and cause of death of parent, based on population-level administrative registration data rather than sampling-based survey data (Methods). Incidence calculations adjust for double counting in case the opposite-sex parent died previously, and prevalence of orphanhood is calculated by cumulatively summing incidence estimates by 1 year age of child in the current and previous 17 years, excluding children who would have since turned 18. For the United States, we further extended orphanhood estimates to include loss of co-residing primary (both grandparents providing care in the absence of parents and primary grandparent caregivers providing most of their grandchildren’s basic needs) and secondary grandparent caregivers serving as head of household who own or rent the family’s housing based on data from the US Census American Community Survey (ACS)³⁴. To avoid double counting of children affected by loss of both a parent and grandparent caregiver, all caregiver-loss totals were de-duplicated (Methods).

Trends in US orphanhood and grandparent caregiver loss

In 2021, we estimate that 494,036 (95% uncertainty interval (UI): 457,957–533,274) children experienced incident orphanhood or grandparent caregiver death from any cause in the United States corresponding to 0.71% of children (Table 2). Most children (82.5%) lost a parent, while 6.6% lost a primary grandparent caregiver providing care in the absence of parents or providing most of their grandchildren’s basic needs and 11.7% a secondary grandparent caregiver providing housing but not most other basic needs (Extended Data Fig. 5). Incidence of orphanhood and grandparent caregiver death combined increased from 2000 to 2019 by 11.7%, and then rapidly through the COVID-19 pandemic years 2020–2021 by 33.9% (Table 2). Prevalence of orphanhood and grandparent caregiver loss combined decreased slightly by 1.3% from 2000 to 2019, then increased by 9.4% during the pandemic years 2020–2021. In 2021, we estimate that 2,912,817 (95% UI: 2,654,936–3,202,040) children experienced prevalent orphanhood or caregiver death, representing 4.2% of all children, of whom 81.6% lost a parent, 7.5% a primary grandparent caregiver and 11.7% a secondary grandparent caregiver in their lifetime. These estimates are an order of magnitude higher than previous cause-specific reports describing children orphaned by HIV or AIDS (1998)³⁵, maternal cancers (2020)¹⁴, COVID-19 (2020–2022)¹³, drug overdose (2011–2021)¹⁵ and firearms¹⁶.

Table 2 Trends in all-cause orphanhood and grandparent caregiver (primary and secondary) loss from 2000 to 2021, before and during the COVID-19 pandemic

Full size table

Leading causes of orphanhood

Increases in caregiver-loss incidence between 2000 and 2021 were highest among children losing one or both parents (~56%; Table 2). We next focused on characterizing the drivers and trends in orphanhood in terms of the leading parental causes of death as of 2021 (Extended Data Fig. 1 and Supplementary Table 2): COVID-19, drug overdose, the remaining top five causes of death overall (heart disease, cancer, unintentional injuries, cerebrovascular diseases, chronic lung disease) and any additional causes of death in the top five for men and women aged 15–44 (suicide and homicide), as these ages include those more likely to be parents. We used a hierarchical approach for classifying cause of death; therefore, we singled out drug overdose from the three categories suicide, homicide and unintentional injuries (for example, motor vehicle crashes) (Methods and Supplementary Table 2).

Since 2020, drug overdose has been the leading cause of orphanhood incidence and prevalence, surpassing COVID-19 (Fig. 1). In 2000, we estimate that only 0.02% of children experienced orphanhood caused by parental drug overdose, increasing to 0.04% in 2012 and then sharply to 0.07% in 2019 and 0.10% in 2021.

**Fig. 1: Magnitude of children experiencing orphanhood in the United States.**

More broadly, the incidence and prevalence of orphanhood due to fatal injuries—comprising drug overdose, suicide, homicide and unintentional injuries—exceeded orphanhood due to leading chronic disease (heart disease and cancers) causes of death in parents. Orphanhood due to parental suicide decreased until 2008 and subsequently increased; orphanhood due to parental homicide and unintentional injuries decreased until 2019, then increased until 2021. With the coronavirus pandemic, orphanhood due to COVID-19 emerged in 2020. In comparison, orphanhood due to parental death from malignant neoplasms consistently decreased until 2000; orphanhood due to death from cardiovascular diseases remained relatively unchanged until 2019, then increased in the COVID-19 pandemic. The disproportionate impact of parental fatal injuries on orphanhood was evident in 2021, with drug overdose contributing 17.4% to orphanhood incidence, yet only 3.1% to adult mortality (Fig. 1 and Supplementary Table 3); similarly, unintentional injuries, suicide and homicide in parents contributed 8.3%, 5.0% and 3.7% to orphanhood, versus 3.6%, 1.2% and 0.7%, respectively, to adult deaths. We provide data by maternal and paternal orphanhood, co-residing grandparent caregiver loss and all caregiver loss causes of death in Supplementary Tables 4 and 5.

Primary factors linked with variations in orphanhood prevalence

We expected extensive heterogeneity in orphanhood burden by age and race and ethnicity of child and sex of parent as seen during the COVID-19 pandemic²⁰, and differences in the drivers of orphanhood along these strata especially due to different causes of death by sex and race and ethnicity in parent age ranges (Extended Data Figs. 2–4). The race and Hispanic origin of decedents were typically reported by the next of kin and recorded in varying formats over time, which we standardized to five race and ethnicity categories (Supplementary Tables 6–8), and we assumed that the race and ethnicity of the child matched those of the parent. Decedents of more than one race were not coded consistently across the time period required for orphanhood estimation and excluded in this study, corresponding in 2021 to 0.47% of deaths. In 2021, 1,691,918 children aged 10–17 years (5.2% of children) experienced prevalent orphanhood, 5.0 times more likely than children aged 0–4 years (Fig. 2 and Supplementary Table 9). Among the orphaned children, 66.8% lost their father in their lifetime and 33.2% lost their mother (Supplementary Table 10). Disparities were even larger across race and ethnicity. In 2021, orphanhood affected 6.4% of non-Hispanic American Indian or Alaska Native children, 4.7% of non-Hispanic Black children, 3.9% of non-Hispanic white children, 2.1% of Hispanic children and 1.7% of non-Hispanic Asian children (Supplementary Table 11), with additional heterogeneity by age of child (Fig. 2).

**Fig. 2: Differences and time trends in orphanhood among US children by age, sex, standardized race and ethnicity, and cause.**

Impact of drug overdose on orphanhood

The causes underpinning these disparities in orphanhood differed primarily by parental race and ethnicity and sex (Fig. 3, Extended Data Fig. 6 and Supplementary Tables 12 and 13). Although drug overdose was among the top three causes of both maternal and paternal orphanhood for almost all race and ethnicity groups (except Asian mothers and fathers), we found that causes other than drug overdose were the leading cause of orphanhood in every minoritized subgroup. The top cause of orphanhood was COVID-19 in fathers and chronic liver disease in mothers of non-Hispanic American Indian or Alaska Native children, heart disease in fathers and COVID-19 in mothers of non-Hispanic Black children, COVID-19 in fathers and mothers of Hispanic children, and heart disease and COVID-19 (tied) in fathers and cancers in mothers of non-Hispanic Asian children. This suggests that new standards of care and services need to be contextualized for each of the most vulnerable child populations. Our time trend analyses further showed that, except for cancers in mothers of non-Hispanic Asian children, orphanhood incidence increased substantively in all leading causes since 2000 (Fig. 3 and Extended Data Fig. 6).

**Fig. 3: Leading causes of orphanhood incidence among US children in 2021 by race and ethnicity and sex of parent.**

Extent of orphanhood across US states

To provide a reference for policy development at the state level, we next generated state-level estimates of cause-specific incidence and prevalence of orphanhood and caregiver death for 2021. NCHS live births and mortality data did not include information on state of residence from 2005 onward, and we used live births and mortality data stratified by 5 year age bands, sex and state from the Centers for Disease Control and Prevention (CDC) WONDER (https://wonder.cdc.gov/) with counts below 10 suppressed^12,36, which we accounted for using imputation and correction factors (Methods and Extended Data Fig. 7). In 2021, we estimate that 30 states had >3% of all children experiencing prevalent orphanhood, pervasively covering almost all regions of the United States (Fig. 4 and Supplementary Table 5). California, Texas and Florida had the highest orphanhood incidence and prevalence in 2021. West Virginia and New Mexico had the highest incidence (0.8–0.9%) and prevalence rates (4.5–5.0%) of orphanhood in 2021. Injury-associated parental deaths—including overdose, suicide and/or unintentional injury—were among the top two causes of orphanhood prevalence in 47 states (Extended Data Fig. 8 and Supplementary Table 14). Drug overdose was the leading cause of orphanhood prevalence in 30 states (Fig. 4), with high rates of orphanhood prevalence due to overdose in white children and also minoritized subgroups, where further evaluation by state and race and ethnicity was possible (Extended Data Fig. 9 and Supplementary Table 15).

**Fig. 4: Spatial distribution of US children experiencing orphanhood in 2021.**

Discussion

We estimate that in 2021 over 2.91 million children—4.2% of all children in the US—had lost a parent or a primary or secondary grandparent caregiver in their lifetime. The lives of these children are permanently affected by the loss of their fathers, mothers and co-residing grandparents who provided their homes, needs and care^4,20,31,37 (Table 2). Populations disproportionately impacted by all-cause orphanhood in 2021 included over 1.69 million adolescents aged 10–17 years (1 of every 20 adolescents) and children of non-Hispanic American Indian or Alaska Native, and non-Hispanic Black, race and ethnicities (approximately 1 of 15 and 1 of 20 children, respectively). We observed the highest orphanhood burden among non-Hispanic American Indian or Alaska Native adolescents—approximately 1 of 10 children—on par with the 9% orphanhood prevalence among children in sub-Saharan Africa across 40 countries early in the HIV pandemic³⁸. Five states with the highest rates of orphanhood prevalence—West Virginia, New Mexico, Mississippi, Louisiana and Kentucky (approximately 1 of every 25 children)—also had the highest poverty ranking³⁴, indicating the wider implications of poverty to the premature death of parents spawning a hidden generation of orphanhood among their bereaved children^39,40.

Over the past two decades, the prevalence of all-cause orphanhood and caregiver death decreased slightly until 2012 and then increased from 2013 to 2021 to historic levels, with intersecting crises of the overdose epidemic and COVID-19 pandemics^41,42. Our data showed that orphanhood incidence rates due to drug overdose escalated during the pandemic, surpassing COVID-19 as the leading parental cause of death. These findings coupled with modeling and evidence suggest that both crises amplified each other syndemically. Modeling data show that the pandemic was associated with an increased drug overdose risk^43,44, and epidemiologic data demonstrate that substance misuse appeared to increase nonadherence to CDC COVID-19 mitigation guidelines⁴⁵.

We quantified the full scale of orphanhood and grandparent-caregiver-loss burden among children in the United States to inform the scope of action needed for an adequate public health response. Our findings show that all-cause orphanhood is over ten times greater than cause-specific orphanhood due to HIV and AIDS³⁵, maternal cancers¹⁴, COVID-19 (ref. ¹³), and overdose and firearms^15,16. A policy framework that sustains short- and long-term responses to orphanhood-linked threats for nearly three million children is necessary. This would require a comprehensive approach addressing needs shared across orphanhood causes, and the disparities affecting population subgroups, geographies and orphanhood causes. Our data suggest specific priorities for evidence-based action, from tackling drug overdose as a leading parental cause of death, addressing disparities by race and ethnicity, and prioritizing support in states with the greatest endemic poverty as they also have the highest orphanhood burden.

Effective services at the federal, state, municipal and community levels are key to supporting population-based approaches that address both all-cause and cause-specific orphanhood, and its consequences that linger over time and vary in impact. Several robust strategies can help guide policy responses: the WHO INSPIRE package for ending violence against children^46,47 proposes life-course approaches to guide individual, familial, community and societal interventions, including addressing legal strategies, norm changes, safe environments, parenting support, income strengthening, improving response services, and education and life skills^19,20,48,49. The ‘prevent–prepare–protect’ strategy⁴⁹ seeks to prioritize preventing the death of parents and caregivers by accelerating equitable access to health and social services, preparing families and caregivers to provide safe and nurturing family-based support, and protecting children using evidence-based strategies that address their poverty, childhood adversity and violence risks, and strengthen their recovery. The ‘prevent–prepare–protect’ strategy can strengthen health equity shared across parental death causes by addressing shared disparities such as poverty and racism. It may also address priorities across causes, including access to health and social services, supporting kinship and family-based care, and ensuring that each affected child is protected and has access to safety, nutrition, school and nurturing family care⁵⁰. To guide such priorities, promptly identifying, assessing and referring children experiencing orphanhood to services is paramount. For example, the state of Utah is piloting a program to support children who have lost caregivers, which engages schools to identify bereaved children⁶. Others have called for policies that include a checkbox on the death certificate to identify children living in the home of the deceased, so that bereaved children can be systematically linked in near real time to mutually reinforcing services that provide parenting, economic and education support⁵¹. Such interventions could also ensure that standards of care are age appropriate, inclusive and nurturing; acknowledge individual, familial, community and structural inequalities; and include children bereaved by the death of their undocumented parents or grandparents.

Disaggregated data on leading causes of orphanhood by age, sex, and racial and ethnic groups, and across geographies, may help tailor policies for affected families and communities^2,20,49. The evolving nature of drug supply, including the proliferation of illegally made fentanyl and the resurgence of stimulants such as methamphetamine¹⁶, has affected younger adults still caring for children in almost every state^52,53. We found that in 2021, fatal injuries—drug overdose, suicide, homicide and unintentional injuries—were among the top two causes of orphanhood incidence and prevalence in 48 states. In the United States, severe injuries cluster in structurally marginalized neighborhoods with higher unemployment, poverty, racial and ethnic minority residents, and lower education and income levels⁵⁴. For communities with high prevalences of non-Hispanic Black fathers for whom homicide was a leading orphanhood cause, contextualized policies to ‘prevent’ homicide-linked deaths might include violence prevention programs, whereas ‘preventing death’ among minoritized populations for whom chronic diseases were a leading cause might involve lifestyle interventions and smoking cessation programs. The need to ‘prepare’ guardianship options for children whose parents may have an elevated risk of premature death, as recognized for parents living with HIV, may be important for other subgroups, such as those who had survived opioid overdose. Finally, addressing child bereavement to ‘protect’ mental health may require more intensive services for losses associated with greater stigma among surviving children such as those whose parents died of homicide, suicide or overdose⁵⁵.

By addressing all-cause orphanhood and grandparent caregiver loss, our findings extend previous reports^15,16 of the scale of overdose-linked parental loss in the United States. In particular, while overdose remains a leading cause of premature parental death for every race and ethnicity group, the leading cause of both maternal and paternal orphanhood in 2021 for every minority subgroup was not overdose, but included COVID-19, liver diseases and cirrhosis, heart disease and cancers. Our findings also highlight the importance of targeted policies, so that they are responsive to differences between structurally marginalized groups in leading causes of parental death¹⁵.

Our study has several limitations. First, cause-specific estimates of children experiencing orphanhood and grandparent caregiver deaths are derived from cause-specific mortality statistics and may be underestimates⁵⁶ for causes associated with erroneous or incomplete reporting, uncertainty in the chain of events preceding death or coding limitations—such as for COVID-19, drug overdose or suicide⁵⁷. Furthermore, we attributed only one child per primary or secondary grandparent caregiver loss. The total numbers of children affected by caregiver loss are underestimated, particularly given our approach to de-duplicating numbers of children affected by both parent and grandparent caregiver loss. We also assumed that current mortality is unrelated to historic fertility for years before 1990, and sensitivity analyses suggest this may¹⁵ lead to inaccuracies in orphanhood prevalence estimates until 2007 (Extended Data Fig. 10). Similarly, we assumed no correlations between reported sex and race and ethnicity grandparent caregiver characteristics, and this may have added to inaccuracies. Publicly available state-specific data were also partly suppressed owing to small counts; therefore, we cannot exclude bias in state-specific estimates. We did not account for any of those 4.4 million US citizen–children living with an undocumented parent who died, leading to further underestimation of caregiver loss⁵⁸. Finally, to characterize caregiver-loss prevalence in 2000–2021, we had to consider vital statistics since 1983 and consolidate changes in cause of death and race and ethnicity coding. Our sensitivity analyses (Methods and Extended Data Fig. 10), including detailed comparisons of NCHS with CDC WONDER vital statistics for each of the 21 years’ calculations (Supplementary Table 16), suggest that our incidence and prevalence estimates are robust minimum estimates of orphanhood and primary or secondary grandparent caregiver loss.

In conclusion, we estimate that at least 2.91 million children and adolescents in the United States have experienced orphanhood or the loss of a primary or secondary grandparent caregiver. These children require evidence-based responses that ensure housing stability and provide healing and support. Given unprecedented rates of drug overdose and the lasting mental health and economic impacts of the COVID-19 pandemic, it is essential to prepare for the possibility that children living with a parent negatively affected by substance misuse may experience the loss of that parent^47,59. It is also essential to recognize that causes other than overdose resulted in the highest rates of orphanhood among all minoritized subgroups, highlighting the need for prevention strategies addressing disparities. The burden and policy relevance of orphanhood and caregiver loss has global ramifications, such as in Africa, where approximately 10% of all children had been orphaned by all-cause orphanhood in 2021, or in Latin America, where orphanhood linked to COVID-19 has been disproportionately high^2,60. The ‘prevent–prepare–protect’ policy framework is relevant for any setting in which structural inequalities modulate health outcomes of specific subgroups. Given the scales of US national and global burden of orphanhood and grandparent caregiver loss among children, implementing policies to build their recovery and resilience is a public health and moral imperative.

Methods

To estimate the magnitude, time trends and inequities in all-cause orphanhood and co-residing grandparent-caregiver-loss incidence and prevalence among US children, we extended a modeling methodology of COVID-19-associated orphanhood and caregiver death^20,48 according to the Guidelines for Accurate and Transparent Health Estimates Reporting. The following sections summarize our methods.

Study populations

The United Nations Children’s Fund defines orphanhood as children experiencing the death of one or both parents^1,2. As previously²⁰ we considered mothers of ages 15–66 years and fathers of ages 15–94 years, so the maximum ages of parents at the birth of a child were, respectively, 49 and 77years. Mortality data were recorded among US residents, and for this reason, orphanhood estimates are restricted to children of US residents. Grandparents play indispensable roles as caregivers for children^{31,37,61,62,63,64}; therefore, we include as previously²⁰ minimum estimates of children who lost a primary grandparent caregiver, defined as a co-residing, custodial grandparent aged 30 years or older and providing care in the absence of a parent, or providing for most of their basic needs in the presence of a parent, and children who lost a secondary grandparent caregiver defined as a co-residing grandparent aged 30 years or older serving as head of household who owns or rents the family’s housing and provides for some but not most of the basic needs of their grandchildren^32,64. Mortality data were recorded among US residents, and so grandparent caregiver death estimates are also restricted to children of US-resident grandparent caregivers.

National-level NCHS mortality data by rankable causes of death, 1983–2021

We obtained line-list mortality data on US residents from the NCHS Vital Statistics portal for each year from 1983 to 2021 (https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm). Data were collected from 1983 onward because the corresponding children who lost a caregiver in 1983 at age 0 were of age 17 in 2000, and so entered our estimation of orphanhood prevalence in 2000. For each mortality record, we retained year of death, the corresponding codes of the underlying cause of death (ICD-9 code and 282 cause recode before 1999; ICD-10 code and 113 cause recode after 1999) and demographic data of the decedent including sex, age at death and information on race and Hispanic origin (https://www.cdc.gov/nchs/nvss/mortality_public_use_data.htm). Data on Hispanic origin were not available for 1983. Individuals of other races in 1984–1991 and individuals of more than one race in 2021 were not coded consistently and not included in this study (less than 0.019% (17,454) of line-list records were removed).

Information about the race and Hispanic origin of decedents in death certificates is typically self-reported by the surviving next of kin, or on the basis of observation in the absence of an informant⁶⁵. Race and Hispanic origin were reported in different formats across the study period⁶⁶, which we mapped to standardized race and Hispanic origin categories as described in Supplementary Table 6. Specifically, we grouped individuals of Hispanic origin and all individuals of non-Hispanic origin by their race, that is, ‘Hispanic’, ‘non-Hispanic American Indian or Alaska Native’, ‘non-Hispanic Asian or Pacific Islander’, ‘non-Hispanic Black’ and ‘non-Hispanic white’, and we refer to the resulting categories as ‘standardized race and ethnicity’ for simplicity. This approach to harmonizing race reporting over the study period did not necessarily make the primary data fully comparable. Earlier research shows inaccuracies are limited⁶⁷, which indicates that the incremental implementation of multiple race reporting in the United States is unlikely to have introduced notable bias in orphanhood estimates.

The underlying cause of death is defined by the WHO as “the disease or injury which initiated the train of events leading directly to death, or the circumstances of the accident or violence which produced the fatal injury”⁶⁸. For 1983–1998, underlying causes of death were in the data classified with the ICD-9 (ref. ⁶⁹) and grouped further into 282 selected causes of death, termed ‘282 cause recode’⁷⁰. For 1999–2021, underlying causes of death were in the data classified with the ICD-10 and grouped further into ‘113 Selected Causes of Death’⁷¹. We defined 53 non-overlapping rankable underlying causes of death that we termed ‘caregiver-loss causes of death’ and that apart from the following small modifications are identical to the 52 rankable underlying causes of death from the NCHS 113 Selected Causes of Death list^71,72 (Supplementary Table 1). Specifically, we re-categorized drug-induced causes of death into ‘drug overdose’, joining the ICD-9 and ICD-10 causes of death ‘intentional self-poisoning by and exposure to nonopioid analgesics, antipyretics and antirheumatics’, ‘intentional self-poisoning by and exposure to antiepileptic, sedative–hypnotic, antiparkinsonism and psychotropic drugs, not elsewhere classified’, ‘intentional self-poisoning by and exposure to narcotics and psychodysleptics (hallucinogens), not elsewhere classified’, ‘intentional self-poisoning by and exposure to other drugs acting on the autonomic nervous system’ and ‘intentional self-poisoning by and exposure to other and unspecified drugs, medicaments and biological substances’ (E950.0–E950.5; X60–X64); ‘assault by drugs, medicaments and biological substances’ (E962.0; X85); ‘accidental poisoning by and exposure to nonopioid analgesics, antipyretics and antirheumatics’, ‘accidental poisoning by and exposure to antiepileptic, sedative–hypnotic, antiparkinsonism and psychotropic drugs, not elsewhere classified’, ‘accidental poisoning by and exposure to narcotics and psychodysleptics (hallucinogens), not elsewhere classified’, ‘accidental poisoning by and exposure to other drugs acting on the autonomic nervous system’ and ‘accidental poisoning by and exposure to other and unspecified drugs, medicaments and biological substances’ (E850–E858; X40–X44); and ‘poisoning by and exposure to nonopioid analgesics, antipyretics and antirheumatics, undetermined intent’, ‘poisoning by and exposure to antiepileptic, sedative–hypnotic, antiparkinsonism and psychotropic drugs, not elsewhere classified, undetermined intent’, ‘poisoning by and exposure to narcotics and psychodysleptics (hallucinogens), not elsewhere classified, undetermined intent’, ‘poisoning by and exposure to other drugs acting on the autonomic nervous system, undetermined intent’ and ‘poisoning by and exposure to other and unspecified drugs, medicaments and biological substances, undetermined intent’ (E980.0–E980.5; Y10–Y14). We retained these four drug overdose causes of death sub-categories as separate drug overdose subgroups for the purpose of data harmonization (see below). Correspondingly, we removed the related three drug overdose causes of death sub-categories, respectively, from ‘intentional self-harm’, ‘assault’ and ‘accidents’. We renamed these resulting three categories as ‘suicide excluding drug overdose’, ‘homicide excluding drug overdose’ and ‘unintentional injuries excluding drug overdose’. Supplementary Table 2 summarizes our aggregation of the 53 rankable causes of death into the leading parental cause-of-death groups and ‘other’ parental causes of death that we refer to in the main text and are shown in Fig. 1. The ‘other’ parental causes of death comprise the remaining 46 rankable caregiver-loss causes of death and any other causes of death that are not included in the NCHS 52 rankable causes of death.

We then mapped and aggregated line-list mortality records to the 53 rankable caregiver-loss causes of death. This was done by mapping the 1983–1998 line-list data using the 282 recodes to the 52 NCHS rankable causes of death as described in ref. ⁷³. For drug-induced causes, we used Table 2 in ref. ⁷⁴. For 1999–2021, we mapped the ICD-10 113 cause recodes to the NCHS 52 rankable causes of death based on Table A in ref. ⁷⁵, and subsequently mapped the NCHS 52 rankable causes of death to the 53 rankable caregiver-loss causes of death based on the descriptions in Supplementary Table 2.

Next, we aggregated line-list death records to annualized death counts by the 53 rankable caregiver-loss causes of death for each year in 1983–2021, as well as by sex, age band (15–19, 20–24, 25–29, 30–34, 35–39, 40–44, 45–49, 50–54, 55–59, 60–64, 65–69, 70–74, 75–79, 80–84 and 85+ years) and standardized race and ethnicity of the decedent. Data on race and ethnicity were not available for 1983, and for this year, we attributed mortality counts to standardized race categories according to the age-, sex- and cause-of-death-specific standardized race compositions in 1984. To harmonize cause-of-death data from 1983 to 1999 to the ICD-10 cause-of-death classifications and avoid discontinuities in our orphanhood estimates from 1998 to 1999, we used where available the comparability ratios in Table 1 of ref. ⁷². Thirteen comparability ratios of rankable causes of death were not provided when underlying estimations were considered imprecise⁷², and in these cases, we set the comparability ratios to 1.

Extended Data Fig. 2 illustrates the aggregated annual mortality data among US residents by standardized race categories and Extended Data Fig. 3 by leading parental causes of death as relevant for caregiver loss and orphanhood.

National-level live birth data, 1969–2021

We required live birth data before 1983 to calculate fertility rates and attribute children experiencing orphanhood to descendants between 1983 and 2021. This is because the children who lost a caregiver in 1983 at age 1–17 years were born between 1966 and 1982. However, due to limitations in publicly available population size data, we considered only live birth data from 1990 in the central analysis and assumed constant fertility rates before 1990. We investigated the sensitivity of our orphanhood estimates to this assumption using live birth records since 1980 together with corresponding population size estimates, and found that orphanhood prevalence estimates since 2008 were not affected by our assumptions on historic fertility, while in the sensitivity analysis, orphanhood prevalence estimates for 2000–2007 were slightly lower due to overall lower fertility rates in the 1980s (Extended Data Fig. 10a). Line-list live births with demographic data on both mothers and fathers were available and downloaded from the NCHS Vital Statistics portal (https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm) for each year between 1969 and 2021. We considered only live birth records to US-resident mothers for consistency with the mortality data available. Information on residency status of fathers was not available, and we assumed fathers were also US residents in the retained birth records associated with US-resident mothers. For each live birth, we retained the year of birth, the age of mothers and fathers, and information on race and Hispanic ethnicity (https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm). The age of mothers and fathers was reported by single year of age. Following ref. ²⁰, we considered live births to women aged 15–49 years and men aged 15–77 years, to match the mortality data of women aged 15–66 years and men aged 15–94 years. In total, less than 0.012% (19,367) of line-list records were removed from further analysis because parents were outside of these age ranges or demographic information was unreported or not stated. Age information was mapped to 5 year age bands 15–19 years, …, 45–49 years for mothers and age bands 15–19 years, …, 50–54 years and 55–77 years for fathers. Information about the race and Hispanic origin of mothers and fathers was between 1969 and 2021 self-reported in different standards due to the revision of the US certificates of live births on race in 1989 and 2003⁷⁶. Information on the race of mothers and fathers has been publicly available since 1969, and information on their ethnicity since 1978. For 1978–2021, we mapped available information on race and ethnicity to the same standardized race and ethnicity categories used to stratify the mortality data, according to Supplementary Table 7. Individuals of multiple races were excluded (less than 0.23% (398,424) of line-list natality records were removed).

Next, we aggregated line-list live birth records to annualized live birth counts to mothers (in age bands 15–19, …, 45–49 years) and fathers (in age bands 15–19, …, 55–77 years) for each year in 1969–1977, and stratified further by standardized race and ethnicity categories for each year in 1978–2021. Before 1985, data reporting was incomplete for some US states and we used NCHS sampling weights reported in each year (see Appendix A of ref. ⁷⁷ to extrapolate reported live birth counts to state populations.

National-level population size data, 1990–2021

We further required population size data of US residents in the age, sex and standardized race and ethnicity categories of the live birth data to calculate fertility rates. We obtained CDC WONDER Vintage bridged-race postcensal and ethnicity population size estimates for each year in 1990–1999 and 2000–2020 (https://wonder.cdc.gov/wonder/help/bridged-race.html#About%201990-2020) and single race population size estimates for 2021 (https://wonder.cdc.gov/wonder/help/single-race.html#About%202020-2021). Data were extracted by 5 year age bands (15–19 years, …, 85+ years) and single years of age from 75 to 77 years, and data for individuals aged 55–77 years were summed into a single age band as required for the purposes of our analyses. Information about race and Hispanic origin was self-reported during the US Census. Race-specific population size estimates were aggregated to the standardized race and ethnicity categories described in Supplementary Table 8.

For sensitivity analyses (see below), we also obtained national-level population size estimates by 5 year age bands, sex and US states without race and ethnicity stratification for each year in 1969–1989 from the US National Cancer Institute Surveillance, Epidemiology, and End Results Program (https://population.un.org/wpp/Download/Standard/Mortality/).

Statistical analysis

Estimating national-level fertility rates, 1990–2021

We calculated age-, sex- and standardized race and ethnicity-specific fertility rates in each year in 1990–2021 for women in one of the age bands a ∈ {15–19, …, 45–49} years and men in one of the age bands a ∈ {15–19, …, 55–77} years according to

$${\rm{F{R}}}_{y,a,s,r}=\frac{{B}_{y,a,s,r}}{{P}_{y,a,s,r}},$$

(1)

where the number of live births and population sizes in each strata are denoted by B_y,a,s,r and P_y,a,s,r, respectively. Calculations were done for each of the five standardized race categories ‘Hispanic’, ‘non-Hispanic American Indian or Alaska Native’, ‘non-Hispanic Asian or Pacific Islander’, ‘non-Hispanic Black’ and ‘non-Hispanic white’. In the central analysis, we assumed the same fertility rates in each year in 1966–1989 as in 1990 as shown in Extended Data Fig. 4, and considered alternative assumptions in several sensitivity analyses (see below). We calculated mortality rates in analogy to equation (1) and found correlations by standardized race and ethnicity between fertility and mortality rates that changed primarily by age of mothers and fathers, and less so over calendar years. These correlations prompted us to estimate national-level incidence and prevalence of orphanhood by standardized race and ethnicity and then sum the standardized race and ethnicity-specific estimates to obtain national-level estimates.

Estimating national-level orphanhood, 1983–2021

We estimated the number of children who newly experienced orphanhood in year y, y = 1983, …, 2021, from the population-level mortality records of US residents in year y and the number of children each decedent was expected to leave behind. We obtained the expected number of children per US resident of age a years, sex s and standardized race and ethnicity r in year y who are of age b = 0, 1, …, 17 years (denoted by C_y,a,s,r,b) by multiplying the corresponding fertility rates of equation (1) with pediatric survival probabilities of children born in year y − b and surviving until age b + 1 (denoted with ${p}_{y-b,b+1}^{{\rm{survive}}}$). Specifically

$${C}_{y,a,s,r,b}={\rm{F{R}}}_{y-b,a-b,s,r}\times {p}_{y-b,b+1}^{{\rm{survive}}},$$

(2)

where y ranges from 1983 to 2021, the single year of age of mothers ranges from a ∈ {15, …, 66} years, the single year of age of fathers ranges from a ∈ {15, …, 94} years, the standardized race and ethnicity categories are as described before, and b = 0, 1, …, 17 years. We obtained the pediatric survival probabilities from child mortality data (https://population.un.org/wpp/Download/Standard/Mortality/) and use in equation (2) the fertility rates of the age band that includes age a − b. We then estimated the number of children aged b who newly experienced in year y the death of a parent s of age specified in 5 year age bands ${a}^{{\prime} }\in {\mathcal{A}}$ = {15–19, …, 80–84, 85+ years}, and standardized race and ethnicity r who died of caregiver-loss cause of death c by

$${O}_{y,{a}^{{\prime} },s,r,b,c}^{{\rm{death}}\,{\rm{of}}\,{\rm{parent}}}={C}_{y,{a}^{{\prime} },s,r,b}\times {D}_{y,{a}^{{\prime} },s,r,c},$$

(3)

assuming that parents and their children have the same standardized race and ethnicity and where the expected number of children of parents in age bracket ${a}^{{\prime} }$, ${C}_{y,{a}^{{\prime} },s,r,b}$, is calculated as the mean over the expected number of children of parents aged $a\in {a}^{{\prime} }$ in equation (2). Due to the correlations between standardized race and ethnicity-specific fertility and mortality rates, we calculated equation (3) for each standardized race and ethnicity. In equation (3), we assumed that population-level fertility rates are not correlated with population-level mortality rates, which may lead to upward or downward bias in orphanhood estimates that we explored in several sensitivity analyses (see below and Extended Data Fig. 10a–c).

As orphanhood considers children who experienced the death of their mother, father or both, we are interested in the sum of equation (3) for both mothers and fathers, but need to subtract children who lost their other parent in the previous b − 1 years, or who lost their other parent in the same year y. We assumed that the other parent 1 − s is in the same age band ${a}^{{\prime} }$ and of the same standardized race and ethnicity as parent s. The probability that the other biological parent of the children in equation (3) died in the same year y is based on standard life table calculations⁷⁸, specifically the probability that an individual of sex 1 − s and standardized race and ethnicity r died in year y between (continuous) age a and a + n conditional on survival up to age a, where n = 5 corresponds to the width of the 5 year age bands considered. This mortality hazard is approximated using midpoints x = (a + a + 5)/2 in each age interval, through

$${\scriptstyle{\atop {n}}}{h}_{x}={\frac{{{\scriptstyle{\atop n}}}f(x)}{S(x)}}={\frac{1}{n}}{\frac{{{\scriptstyle{\atop n}}}{q}_{a}S(a)}{{\frac{1}{2}}{\left(S(a)+S(a+n)\right)}}},$$

(4a)

$$=\frac{1}{n}\frac{{\scriptstyle{\atop n}}{q}_{a}S(a)}{\frac{1}{2}(S(a)+(1-_{n}{q}_{a})S(a))}=\frac{1}{n}\frac{{2}_{n}{q}_{a}}{2-_{n}{q}_{a}}=\frac{1}{n}\frac{{\scriptstyle{\atop n}}{D}_{a}}{{\scriptstyle{\atop n}}{P}_{a}},$$

(4b)

where for ease of readability we have suppressed y, s and r, and _nP_a and _nD_a are respectively the estimated population sizes and observed death counts by the end of the corresponding calendar year in each 5 year age band ${a}^{{\prime} }=\left[a,a+5\right)$. The intermediate quantities in equation (4) are the approximated (unknown) mortality probability density function _nf(x) at age midpoint x and (unknown) survival function S(a) up to age a, which can be expressed in terms of the age-specific mortality rate _nq_a defined as the proportion of individuals alive at age a and who die before reaching age a + n in the corresponding calendar year. It is standard to estimate _nq_a with $({\scriptstyle{\atop n}}{D}_{a})/({\scriptstyle{\atop n}}{P}_{a}+\frac{1}{2}{\scriptstyle{\atop n}}{D}_{a})$, from which equation (4) follows⁷⁸. Using equation (4), we estimated the number of children aged b who newly experienced in year y the death of one parent due to cause-of-death c and the death of the other parent due to any cause with

$${O}_{y,{a}^{{\prime} },s,r,b,c}^{{\rm{new}}\,{\rm{double}}}=_{5}{h}_{y,x,1-s,r}\times {C}_{y,{a}^{{\prime} },s,r,b}\times {D}_{y,{a}^{{\prime} },s,r,c},$$

(5)

where x is the midpoint age in age band ${a}^{{\prime} }$. In equation (5), we assumed that deaths among parents occurred independently of each other and ignored correlations of deaths among parents by the same cause of death such as COVID-19 (ref. ²⁰), as well as correlations of deaths among parents who died of different causes of death. Following the same rationale, we estimated the number of children aged b who newly experienced in year y the death of one parent s due to cause-of-death c and the death of their other parent 1 − s due to any cause in any of the previous i = 1, …, b − 1 years with

$${O}_{y,{a}^{{\prime}},s,r,b,c}^{{\rm{previous}}}=\left(\frac{1}{5}\sum _{x-i\in {a}^{{\prime} }}\mathop{\sum }\limits_{x=1}^{b-1}{\scriptstyle{\atop 5}}{h}_{y-i,x-i,1-s,r}\right)\times {C}_{y,{a}^{{\prime} },s,r,b}\times {D}_{y,{a}^{{\prime} },s,r,c}.$$

(6)

With these considerations, we estimated the number of children aged b who newly experienced orphanhood in year y = 1983, …, 2021, by the death of one or both parents of age ${a}^{{\prime} }$ and standardized race and ethnicity r who died of cause-of-death c with

$${O}_{y,{a}^{{\prime} },r,b,c}^{{\rm{new}}}=\quad {O}_{y,{a}^{{\prime} },s,r,b,c}^{{\rm{death}}\,{\rm{of}}\,{\rm{parent}}}+{O}_{y,{a}^{{\prime} },1-s,r,b,c}^{{\rm{death}}\,{\rm{of}}\,{\rm{parent}}}$$

(7a)

$$-\left({O}_{y,{a}^{{\prime} },s,r,b,c}^{{\rm{new}\,{double}}}+{O}_{y,{a}^{{\prime} },1-s,r,b,c}^{{\rm{new}\,{double}}}\right)/2$$

(7b)

$$-{O}_{y,{a}^{{\prime} },s,r,b,c}^{{\rm{previous}}}-{O}_{y,{a}^{{\prime} },1-s,r,b,c}^{{\rm{previous}}}.$$

(7c)

Equation (7b) subtracts the children who lost in year y a parent owing to cause c and the other parent owing to any cause, which are counted twice in equation (7a). Without line-list family data and working from individual-level live birth and death statistics, the two terms ${O}_{y,{a}^{{\prime} },s,r,b,c}^{{\rm{new}\,{double}}}$ and ${O}_{y,{a}^{{\prime} },1-s,r,b,c}^{{\rm{new}\,{double}}}$ are not identical, and for this reason, we subtracted the average of both. Equation (7c) subtracts the children who already lost the other parent in previous years. In previous studies on COVID-19-associated orphanhood²⁰, we did not consider the possibility of reinfection with COVID-19 and for this reason did not subtract children who already lost the other parent owing to COVID-19 in previous years in these studies. Further arguments show that across ages, standardized race and ethnic groups, and caregiver-loss causes of death, the values in equations (7b) and (7c) are approximately equal to orphanhood prevalence divided by four, and in the US context remain below 1% of the average values in equation (7a).

To estimate the prevalence of orphanhood in year y = 2000, …, 2021, we accrued the number of children who newly experienced orphanhood in the previous 17 years and current year y, while accounting for aging. Specifically, for each calendar year since 2000, we estimated the total number of children aged b = 0, …, 17 years in calendar year y and of race and ethnicity r who experienced orphanhood and survived in their lifetime by cause-of-death c in one or both parents with

$${O}_{y,r,b,c}^{{\rm{lifetime}}}=\mathop{\sum }\limits_{i=0}^{b}\left(\sum _{{a}^{{\prime} }}{O}_{y-i,{a}^{{\prime} },r,b-i,c}^{{\rm{new}}}\times \mathop{\prod }\limits_{j=1}^{i}(1-_{1}{h}_{y-j,r,b-j})\right),$$

(8)

which sums over children who newly experienced orphanhood at younger ages in previous years conditional on survival up to the time point y + 1, where ₁h_y,r,b is as described in equation (4). In equation (8), j does not start at 0 because we already conditioned on survival up to the current year in the incidence calculations via equation (2). For 2021, the downward adjustments in equation (8) accounting for survival amounted to less than 1.5% of the prevalence count.

Following equations (7) and (8), we derived additional key quantities such as the number of children aged b in calendar year y = 1983, …, 2021 and standardized race and ethnicity r who newly experienced orphanhood in year y by parental sex s and cause-of-death c

$${O}_{y,s,r,b,c}^{{\rm{new}}}=\sum _{{a}^{{\prime} }}{O}_{y,{a}^{{\prime} },s,r,b,c}^{{\rm{death}}\,{\rm{of}}\,{\rm{parent}}}-{O}_{y,{a}^{{\prime} },s,r,b,c}^{{\rm{previous}}};$$

(9)

so maternal and paternal orphanhood incidence estimates each include children who experience the death of both parents^20,79. Furthermore, we derived the number of children aged b in calendar year y = 2000, …, 2021 and standardized race and ethnicity r who experienced orphanhood in their lifetime by parent sex s and cause-of-death c by

$${O}_{y,s,r,b,c}^{{\rm{lifetime}}}=\mathop{\sum }\limits_{i=0}^{b}\left({O}_{y-i,s,r,b-i,c}^{{\rm{new}}}\times \mathop{\prod }\limits_{j=1}^{i}(1-_{1}{h}_{y-j,r,b-j})\right);$$

(10)

the number of children of standardized race and ethnicity r who experienced orphanhood in calendar year y = 2000, …, 2021 in their lifetime by cause-of-death c in one or both parents by

$${O}_{y,r,c}^{{\rm{lifetime}}}=\sum _{{a}^{{\prime} }}\mathop{\sum }\limits_{i=0}^{17}\mathop{\sum }\limits_{b=0}^{17-i}\left({O}_{y-i,{a}^{{\prime} },r,b,c}^{{\rm{new}}}\times \mathop{\prod }\limits_{j=1}^{i}(1-{}_{1}{h}_{y-j,r,b-j})\right).$$

(11)

All other total numbers reported in this paper are aggregations of equations (7)–(11).

Estimating national-level grandparent caregiver loss, 1983–2021

We estimated the number of children who newly experienced grandparent caregiver death in year y, y = 1983, …, 2021, from the population-level mortality records in year y of US residents aged 30 years and above (30+). Starting from 2010, ACS³² collected data on the proportion ${\gamma }_{y}^{{\text{co-reside}}}$ of adults aged 30+ years living with their grandchildren of age 17 or under in the United States, and proportions of these by sex, and separately by bridged-race and Hispanic origin (https://data.census.gov/cedsci/table?tid=ACSST5Y2019.S1002). Information about the bridged-race and Hispanic origin was self-reported. In addition, the ACS derive data on the proportion ${p}_{y}^{{\text{most}\,\text{responsible}}}$ of those who provide most of the care to any of their grandchildren through the question, ‘Is this grandparent currently responsible for providing most of the basic needs of any children under the age of 18 years and living in this house or apartment?’, and further derive information on the proportion ${q}_{y}^{{\text{skip gen}}}$ of those who are responsible for children in the absence of parents through column ‘Householder or spouse responsible for grandchildren with no parent of grandchildren present’ in Table S1002. From these data, we estimated the sex- and race and ethnicity-specific proportions of adults aged 30+ years who respectively are most responsible for the basic needs of grandchildren in the absence of a parent, who are most responsible for the basic needs of grandchildren in the presence of a parent and, finally, who serve as head of household who own or rent the family’s housing and provide for some but not most of the basic needs of their grandchildren^31,32 by

$${\gamma }_{y,s,r}^{{\text{skip gen}}}={\gamma }_{y,s}^{{\text{co-reside}}}\times {p}_{y,r}^{{\text{co-reside}}}\times {p}_{y}^{{{\text{most}\,\text{responsible}}}}\times {q}_{y}^{{\text{skip gen}}}$$

(12a)

$${\gamma }_{y,s,r}^{{{\text{most responsible not sg}}}}={\gamma }_{y,s}^{{\text{co-reside}}}\times {p}_{y,r}^{{\text{co-reside}}}\times {p}_{y}^{{\text{most responsible}}}\times \left(1-{q}_{y}^{{\text{skip gen}}}\right)$$

(12b)

$${\gamma }_{y,s,r}^{{\text{co-reside not mr}}}={\gamma }_{y,s}^{{\text{co-reside}}}\times {p}_{y,r}^{{\text{co-reside}}}\times \left(1-{p}_{y}^{{\text{most responsible}}}\right).$$

(12c)

From 2010 to 2021, the proportions of grandparent caregivers providing for most of the basic needs of their grandchildren with or without a parent present (respectively ${\gamma }_{y,s,r}^{\text{skip gen}}$ and ${\gamma }_{y,s,r}^{{\text{most responsible not sg}}}$) declined over time, whereas the proportions of grandparent caregivers providing housing and for some but not most of the needs of their grandchildren (${\gamma }_{y,s,r}^{{\text{co-reside not mr}}}$) increased, and so we expected different trends in grandparent caregiver loss across these categories. We then assumed that each grandparent caregiver leaves upon death a minimum of one child behind (corresponding to equation (2)) and estimated the minimum number of grandchildren who newly experienced grandparent caregiver death in year y = 1983, …, 2021, with a US-resident grandparent caregiver aged 30+ years, sex s and standardized race and ethnicity category r who died of leading cause c by

$${G}_{y,s,r,c}^{x}=1\times {\gamma }_{y,s,r}^{\,x}\times \sum _{{a}^{{\prime} }\ge 30}{D}_{y,{a}^{{\prime} },s,r,c},$$

(13)

where x represents the three types of grandparent caregivers in equation (12), and assuming that ${\gamma }_{y,s,r}^{x}$ is for y = 1983, …, 2009 the same as in 2010. As illustrated in Extended Data Fig. 1, we then estimated the number of grandchildren who newly experienced the death of respectively a primary or secondary grandparent caregiver of sex s and race and ethnicity r due to caregiver-loss cause-of-death c in year y = 1983, …, 2021 by

$${G}_{y,s,r,c}^{{\rm{primary}}}={G}_{y,s,r,c}^{{\text{skip gen}}}+{G}_{y,s,r,c}^{{\text{most responsible not sg}}}$$

(14a)

$${G}_{y,s,r,c}^{{\rm{secondary}}}={G}_{y,s,r,c}^{{\text{co-reside not mr}}}.$$

(14b)

ACS did not collect data on whether both grandparents are alive and live with their grandchildren of age 17 or under, and for this reason, we did not adjust equation (14) further for loss of other grandparents. We investigated in sensitivity analyses our assumption that ${\gamma }_{y,s,r}={\sum }_{x}{\gamma }_{y,s,r}^{\,x}$ was approximately constant from 1983 to 2010 using longitudinal United Nations Population Division data on Households and Living Arrangements of Older Persons for the United States, which suggested that the proportion of older persons who live with children or who are the primary caregivers of children has in the United States remained fairly constant since 1990 (see below). To estimate the minimum number of children who experienced grandparent caregiver death in their lifetime, we additionally need disaggregations of equation (14) by single year of age b = 0, …, 17 years. For the central analysis, we assumed that the age composition of ${G}_{y,s,r,c}^{x}$ is the same as the age composition of children who lost parents older than 30 years

$${G}_{y,s,r,b,c}^{x}={G}_{y,s,r,c}^{x}\times \frac{{\sum }_{{a}^{{\prime} }\ge 30}{O}_{{a}^{{\prime} },s,r,b,c}^{{\rm{new}}}}{\mathop{\sum }\nolimits_{b = 0}^{17}{\sum }_{{a}^{{\prime} }\ge 30}{O}_{{a}^{{\prime} },s,r,b,c}^{{\rm{new}}}},$$

(15)

where x represents primary or secondary grandparent caregiver loss and ${O}_{{a}^{{\prime} },s,r,b,c}^{{\rm{new}}}$ are obtained from equation (7) by summing over the years y = 2000, …, 2021, and c is one of the leading parental causes of death; these age compositions differ across causes of death while they are relatively more stable across standardized race and ethnicity. It is plausible that the age composition of ${G}_{y,s,r,c}^{x}$ may differ from the age composition of children who lost parents older than 30 years, and we explored alternative approaches to equation (15) in sensitivity analyses; none of these had a considerable impact on our overall estimates.

Estimating national-level caregiver loss, 1983–2021

To estimate the total number of children experiencing caregiver loss defined as either orphanhood or grandparent caregiver loss, we finally sought to subtract from equation (14) those grandchildren who previously experienced orphanhood or who experienced the death of their mother or father in the same year. For the proportion p^{both parents present} of grandchildren who co-resided with their grandparent and both parents at the start of year y, we assumed that either or both of the parents may have died in the remainder of the year after the ACS survey. For the latter, we considered age-, sex- and race and ethnicity-specific mortality rates and aggregated these using parent age compositions as weights to obtain the mortality rate ${h}_{y,s,r}^{{\rm{parent}}}$ of a parent of sex s and race and ethnicity r in year y. Then, we subtracted from grandchildren who co-resided with their grandparent and both parents at the start of year y the proportion $({h}_{y,M,r}^{{\rm{parent}}}+{h}_{y,F,r}^{{\rm{parent}}}-{h}_{y,M,r}^{{\rm{parent}}}{h}_{y,F,r}^{{\rm{parent}}})\times 6/12$ that we expected to additionally experience the loss of one or both of their parents in the 6 months on average after the ACS survey. For the proportion 1 − p^{both parents present} of grandchildren who co-resided with their grandparent and one parent at the start of year y, we assumed that in a proportion p^{other parent died} the other parent had died previously and removed these children experiencing grandparent caregiver loss from the caregiver loss count. In the remaining proportion, we assumed again that either or both of the parents may have died in the remainder of the year after the ACS survey. Finally, for grandchildren who lived in skip generation households at the start of year y, we assumed a proportion p^{skip gen parent died} previously lost either their mother, their father or both, and removed these children experiencing grandparent caregiver loss from the caregiver loss count. In the remaining proportion, we assumed again that either or both of the parents may have died in the remainder of the year after the ACS survey. We thus obtained for each year y = 1983, …, 2021 the estimated number of US children aged 0–17 years newly experiencing the loss of a caregiver of sex s and race and ethnicity r due to caregiver-loss cause-of-death c through

$${L}_{y,s,r,c}^{{\rm{new}}}={O}_{y,s,r,c}^{{\rm{new}}}+{G}_{y,s,r,c}^{{\text{de-dup}}}$$

(16a)

$${G}_{y,s,r,c}^{\text{de-dup}}={G}_{y,s,r,c}^{{\text{skip gen}}}\times (1-{p}^{{\text{skip gen parent died}}})\times$$

(16b)

$$\left(1-\left({h}_{y,M,r}^{{\rm{parent}}}+{h}_{y,F,r}^{{\rm{parent}}}-{h}_{y,M,r}^{{\rm{parent}}}{h}_{y,F,r}^{{\rm{parent}}}\right)\frac{6}{12}\right)$$

(16c)

$$+\left[{G}_{y,s,r,c}^{{\text{most responsible not sg}}}+{G}_{y,s,r,c}^{{\text{co-reside not mr}}}\right]\times$$

(16d)

$$\left({p}^{{\text{both parents present}}}+(1-{p}^{{\text{both parents present}}})\left(1-{p}^{{\text{other parent died}}}\right)\right)\times$$

(16e)

$$\left(1-\left({h}_{y,M,r}^{{\rm{parent}}}+{h}_{y,F,r}^{{\rm{parent}}}-{h}_{y,M,r}^{{\rm{parent}}}{h}_{y,F,r}^{{\rm{parent}}}\right)\frac{6}{12}\right).$$

(16f)

In equation (16), we specified p^{skip gen parent died} = 11% and p^{other parent died} = 11% based on US data indicating that the large majority of grandparent caregivers provide care owing to child maltreatment, and/or parents experiencing substance misuse or incarceration^31,37. We set p^{both parents present} = 70% based on UN data on US household composition and living arrangements of persons aged 60 or over (https://www.un.org/development/desa/pd/data/living-arrangements-older-persons), and further supported by ref. ³¹.

To estimate caregiver-loss prevalence, we noted that the grandparent caregiver-loss estimates are derived from cross-sectional data, and we therefore summed the orphanhood prevalence estimates in equation (8) and the de-duplicated annual grandparent-caregiver-loss contributions in equation (16d)–(16f) conditional of survival of the children

$${L}_{y,s,r,b,c}^{{\rm{lifetime}}}={O}_{y,s,r,b,c}^{{\rm{lifetime}}}+\sum_{i = 0}^{b}\left({G}_{y-i,s,r,b-i,c}^{{\text{de-dup}}}\times \mathop{\prod }\limits_{j=0}^{i}(1-_{1}{h}_{y-j,r,b-j})\right),$$

(17)

where the age-specific, de-duplicated annual grandparent-caregiver-loss contributions are calculated analogously to equation (15).

Uncertainty quantification in national estimates

To capture uncertainty, we followed a similar phenomenological approach as in ref. ²⁰ and added Poisson noise around mortality, natality and population size data for all year, sex, age, standardized race and ethnicity, and cause-of-death strata that was co-monotonized across years to maximize uncertainty due to temporal autocorrelations⁸⁰. We generated 1,000 Poisson noise random variables around each live birth, death and population size count by year, sex, age, race and ethnicity and then ranked the Poisson counts by size. For uncertainty in mortality data, we additionally considered uncertainty related to the harmonization of cause-of-death counts from 1983 to 1999 to the ICD-10 classification to avoid discontinuities in orphanhood estimates between 1998 and 1999. Specifically, we resampled the comparability ratios 1,000 times, assuming these were normally distributed with the standard deviations reported in Table 1 of ref. ⁷² where available, and multiplied these ratios with the Poisson noise mortality data. Then, we repeated orphanhood incidence and prevalence estimates using the randomized live birth, deaths and population size data, and calculated 2.5% and 97.5% quantiles to generate 95% UIs around median estimates.

For grandparent caregiver loss, we accounted for uncertainty in the estimated number of adults aged 30+ years living with grandchildren and in the attribution to sex, and standardized race and ethnicity groups using uncertainty ranges published by ACS. ACS provide for each of their published estimates margins of error that correspond to 90% confidence intervals. We converted the 90% margins of error into standard deviations and bootstrap re-sampled the available totals and separate proportions 1,000 times assuming normal distributions around their median estimates and the corresponding standard deviations, multiplied the bootstrap resampled totals and proportions, and then divided by the corresponding population sizes. Across years, we found that these sources of uncertainty amounted on average to 95% bootstrap intervals on the order of ±0.81% of the median estimate of the number of Hispanic women aged 30+ years who live with grandchildren, ±8.23% among non-Hispanic American Indian or Alaska Native women, ±1.92% among non-Hispanic Asian women, ±0.95% among non-Hispanic Black women and ±0.68% among non-Hispanic white women, and similarly for men. We similarly generated 1,000 resampled data sets, repeated grandparent-caregiver-loss incidence and prevalence estimates on each data set, and then calculated 2.5% and 97.5% quantiles to generate 95% uncertainty intervals around median estimates.

Estimating state-level orphanhood, 2021

To guide policy at the state level, we further estimated orphanhood and grandparent caregiver loss in each of the 50 US states and District of Columbia. We focused on estimating orphanhood incidence and prevalence in each state in 2021, by the leading parental causes of death only (COVID-19, heart disease, drug overdose, homicide excluding drug overdose, malignant neoplasms, suicide excluding drug overdose, and unintentional injuries excluding drug overdose (Supplementary Table 2) and one leading cause of orphanhood in South Dakota: chronic liver disease and cirrhosis). To obtain state-level prevalence estimates for 2021, we had as before to estimate state-level incidence from 2004, for which state-level fertility rates were required from 1987. Overall, we proceeded analogously as for the national estimations, but obtained data from CDC WONDER (https://wonder.cdc.gov/Deaths-by-Underlying-Cause.html) as state-level mortality data were not publicly available from NCHS from 2005 onward (https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm). Counts below 10 were suppressed in CDC WONDER, and for this reason, we performed all estimations without stratification by standardized race and ethnicity. We adjusted for data suppression and known biases in this approach that are due to correlations in mortality and fertility rates across standardized race and ethnicity. We note these are modeled adjustments and we cannot exclude that this resulted in bias in state-specific orphanhood estimates.

We extracted annual death counts by state, sex, age bands (15–19 years, …, 95–99 years, ≥100 years) and cause of death (ICD-10 113 Selected Causes of Death) from the CDC WONDER mortality portal (https://wonder.cdc.gov/Deaths-by-Underlying-Cause.html) from 2005 to 2021, and combined these data with the previously described, cause-specific NCHS mortality data sets from 2004 for which information on the state of residence of the decedent was publicly available. For the leading parental causes of death, the average discrepancy in the state-, sex-, age- and cause-of-death-specific data from CDC WONDER was, when aggregated to the national level and compared with the corresponding NCHS mortality data, 22.9% among women and 13.5% among men. As a first step, we imputed suppressed entries with a value of 2, which reduced average discrepancies to 6.80% among women and 2.53% among men. To account for these discrepancies, as a second step, we adjusted the state-level mortality counts according to

$${D}_{y,{a}^{{\prime} },s,l,c}^{* }={\eta }_{y,{a}^{{\prime} },s,c}\times {D}_{y,{a}^{{\prime} },s,l,c}$$

(18a)

$${\eta }_{y,{a}^{{\prime} },s,c}={D}_{y,{a}^{{\prime} },s,c} \Big/ \sum _{l}{D}_{y,{a}^{{\prime} },s,l,c},$$

(18b)

where y is the period 2005 to 2021 and ${a}^{{\prime} }$ denotes the age band of the parent, s the sex of the parent, c the leading parental cause-of-death of the parent, ${D}_{y,{a}^{{\prime} },s,c}$ the national-level deaths derived from the line-list NCHS data in equation (3), and ${D}_{y,{a}^{{\prime} },s,l,c}$ the state-specific deaths derived from CDC WONDER for each of the 50 US states and District of Columbia, indexed by l. The suppression-adjusted, cause-specific, state-level mortality counts matched the cause-specific, national-level mortality counts well, except when all deaths ${D}_{y,{a}^{{\prime} },s,c,l}$ were entirely suppressed across all states, which was the case for deaths due to homicide in women aged 55 years or older (Extended Data Fig. 7). Overall, these suppression adjustments did not noticeably change the contribution of causes of death to mortality, because the majority of death counts were not suppressed when the data were aggregated to leading parental causes of death.

We next extracted live birth counts for mothers by state and age bands (15–19, …, 44–49 years) from the CDC WONDER natality portal (https://wonder.cdc.gov/natality.html) from 2005 to 2021, and combined these data with the previously described NCHS live birth data sets from 1987 to 2004 for which information on the state of residence of mothers was available. We proceeded analogously for fathers using the age bands 15–19, …, 44–49, 50–54, 55+ years from 2016 to 2021, as natality records were not available by demographic characteristics of fathers from 2005 to 2015. Data suppression was not an issue for the stratifications required to estimate orphanhood: for 2005–2021, the average discrepancy in the CDC WONDER and NCHS natality data sets was 0.026% for mothers and 0.032% for fathers, compared with male NCHS live birth data with ages up to 77 years. To compute fertility rates at the state level, we further extracted age- and sex-stratified annual population size estimates for each state from 1987 to 1989 from the National Cancer Institute Surveillance, Epidemiology, and End Results database (https://population.un.org/wpp/Download/Standard/Mortality/), and CDC WONDER from 1990 to 2021 (https://wonder.cdc.gov/wonder/help/bridged-race.html#About%201990-2020, https://wonder.cdc.gov/single-race-population.html). State-level fertility rates were calculated as in equation (1) when live birth counts were available, but without stratification by race and ethnicity. For mothers, live birth counts were not publicly available for women aged 44–49 years in several years and 8 states (Alaska, Delaware, Montana, North Dakota, South Dakota, Vermont, West Virginia and Wyoming), and in these cases, we interpolated state-specific fertility rates with locally estimated scatterplot smoothing as implemented in the R stat package version 4.2.3 with span argument 0.85. For Wyoming, no data were publicly available after 2019, and so interpolation was not possible and we used 2019 values. For fathers, live birth counts were not publicly available from 2005 to 2015, and for these years, we estimated state-specific fertility rates with locally estimated scatterplot smoothing with span argument 0.85, and using NCHS data from 2000 to 2004 and CDC WONDER data from 2016 to 2020.

We then estimated state-specific incidence and prevalence of orphanhood from the suppression-adjusted state-level, cause-specific mortality counts and partially imputed fertility rates as outlined in equations (7)–(11), with one modification. As described above, the state-level mortality and fertility data were not disaggregated by race and ethnicity. To account for potential bias arising from correlations in mortality and fertility rates across race and ethnicity, we compared the national-level estimates of the number of children who newly experienced orphanhood in equation (9) to the state-specific estimates with the correction factors

$${\nu }_{y,s,c}=\sum _{r,{a}^{{\prime} },b}{O}_{y,{a}^{{\prime} },s,r,b,c}^{{\rm{new}}}\left/\sum _{l,{a}^{{\prime} },b}{O}_{y,{a}^{{\prime} },s,l,b,c}^{{\rm{new}}}.\right.$$

(19)

Overall, the correction factors ν_y,s,c tended to be close to one, except for ‘homicide excluding drug overdose’ in women and ‘other’ caregiver-loss causes of death (Extended Data Fig. 7). We then adjusted the state-specific estimates as follows:

$${O}_{y,{a}^{{\prime} },s,l,b,c}^{{\rm{new}}* }={\nu }_{y,s,c}\times {O}_{y,{a}^{{\prime} },s,l,b,c}^{{\rm{new}}},$$

(20)

and used equation (20) to calculate the state-specific analogs to equations (8)–(11). The resulting state-level orphanhood incidence estimates matched the national-level estimates with discrepancies of up to 0.5%. Exactly matching estimates could have been obtained with age-specific correction factors, but we felt this would result in a false sense of accuracy in the state-level orphanhood incidence estimates. Our correction factors ensure only that our state-level estimates sum to a total close to our national-level estimates, and it is possible that the true state-level orphanhood counts could differ from our estimates.

Estimating state-level grandparent caregiver loss, 2021

We estimated the number of children who newly experienced grandparent caregiver death in year y, y = 2004, …, 2021 from the state-level mortality records in year y of US residents aged 30 years and above (30+), and assuming that each decedent who is estimated to have been a grandparent caregiver in each state leaves a minimum of one grandchild behind (corresponding to equation (2)). Overall, we proceeded as for the national-level estimation of grandparent caregiver loss, as ACS also published state-specific data on adults aged 30+ years who live with grandchildren of age 17 or under (https://data.census.gov/cedsci/table?tid=ACSST5Y2019.S1002). For each year y = 2010, …, 2021, we calculated the expected number of adults aged 30+ years who live with their grandchildren of age 17 or under by multiplying the corresponding total co-resident numbers with the sex-specific proportions for each state, and then divided these with corresponding population sizes to obtain the proportions ${\gamma }_{y,s,l}^{\,x}$ corresponding to equation (12). Overall, these proportions were considerably more uncertain than in the national-level analysis owing to smaller sample sizes. For y = 2004, …, 2009, we assumed that ${\gamma }_{y,s,l}^{\,x}$ is the same as in 2010. We estimated the minimum number of grandchildren who newly experienced primary or secondary grandparent caregiver death in year y = 2004, …, 2021, with a US-resident grandparent caregiver aged 30+ years, sex s and state category l who died of leading parental cause-of-death c by

$${G}_{y,s,l,c}^{{\rm{primary}}}={G}_{y,s,l,c}^{{\text{skip gen}}}+{G}_{y,s,l,c}^{{\text{most responsible not sg}}}$$

(21a)

$${G}_{y,s,l,c}^{{\rm{secondary}}}={G}_{y,s,l,c}^{{\text{co-reside not mr}}}.$$

(21b)

To estimate the minimum number of children who experienced grandparent caregiver death in their lifetime in each state, we additionally need disaggregations of equation (21) by single year of age b = 0, …, 17 years. For the central analysis, we assumed that the age composition of ${G}_{y,s,l,c}^{x}$ is the same as the age composition of children who lost parents older than 30 years in the same state

$${G}_{y,s,l,b,c}^{x}={G}_{y,s,l,c}^{x}\times \frac{{\sum }_{{a}^{{\prime} }\ge 30}{O}_{{a}^{{\prime} },s,l,b,c}^{{\rm{new}}* }}{\mathop{\sum }\nolimits_{b = 0}^{17}{\sum }_{{a}^{{\prime} }\ge 30}{O}_{{a}^{{\prime} },s,l,b,c}^{{\rm{new}}* }},$$

(22)

where ${O}_{{a}^{{\prime} },s,l,b,c}^{{\rm{new}}* }$ are obtained from equation (20) by summing over the years y = 2004, …, 2021, and c is one of the leading parental causes of death.

Uncertainty quantification in state-level estimates

As for uncertainty quantification at the national level, we added co-monotonized Poisson noise around state-specific mortality, natality and population size data for all year, sex, age and cause-of-death strata. We generated 1,000 Poisson noise random variables around each live birth, death and population size count by year, state, sex, age, race and ethnicity and then ranked the Poisson counts by size. We then repeated orphanhood incidence and prevalence estimates using the randomized data, and then calculated 2.5% and 97.5% quantiles to generate 95% uncertainty intervals around median estimates.

To estimate grandparent caregiver loss, we accounted for uncertainty in the estimated number of adults aged 30+ years living with grandchildren by US state using uncertainty ranges published by ACS and as described above for national-level uncertainty quantification. As expected from smaller sample sizes at the state level across years, we found that uncertainty in the proportions ${\gamma }_{y,s,l}={\sum }_{x}{\gamma }_{y,s,l}^{\,x}$ was in some states considerably larger than for the national analysis, with 95% bootstrap intervals of up to ±13.6% of the median estimates. We similarly generated 1,000 resampled data sets, repeated grandparent-caregiver-loss incidence and prevalence estimates on each data set, and then calculated 2.5% and 97.5% quantiles to generate 95% uncertainty intervals around median estimates.

Estimating racial and ethnic groups impacted by leading causes of caregiver loss in US states, 2021

Characterizing the racial and ethnic groups of children that are impacted by orphanhood and grandparent caregiver loss at the state level is challenging owing to limits in publicly available data. We focused on estimating and comparing 2021 orphanhood and grandparent-caregiver -loss prevalence rates by standardized race and ethnicity in the 10 US states with the highest orphanhood prevalence rates and for the primary (first-ranked) parental cause of death only: West Virginia (‘drug overdose’), New Mexico (‘drug overdose’), Mississippi (‘unintentional injuries excluding drug overdose’), Louisiana (‘drug overdose’), Kentucky (‘drug overdose’), Tennessee (‘drug overdose’), Alabama (‘heart disease’), Oklahoma (‘unintentional injuries excluding drug overdose’), Ohio (‘drug overdose’) and Florida (‘drug overdose’). Throughout, we obtained publicly available data from CDC WONDER (https://wonder.cdc.gov/Deaths-by-Underlying-Cause.html) from 2005 onward (https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm). We adjusted for data suppression, but note that these are modeled adjustments and we cannot exclude that this resulted in bias in state-, race- and ethnicity-specific orphanhood estimates.

Annualized live birth counts were extracted as described for the state-level analysis for 2005 to 2019, but now also stratified by bridged-race and Hispanic origins, and then combined with NCHS live birth data from 1987 to 2004. For 2020–2021, live birth counts to women were not available by bridged-race at the state level. Analogously, annual death counts for the leading parental causes of death were compiled as described above for 2005 to 2021, stratified into the standardized race and ethnicity categories in Supplementary Table 6, and then combined with the NCHS mortality data sets from 2004. Suppressed values were imputed by 1 and adjusted further for suppression as in equation (18). We followed previous criteria on data reliability and excluded from consideration for each state those race and ethnicity groups that had more than two age bands with fewer than 20 live birth counts⁸¹. Supplementary Table 15 lists the corresponding strata as ‘small populations’ and also describes the average completeness of live birth records for the years 1995–2004 when directly comparable line-list live birth records were also available from NCHS. For the year, state, sex, race and ethnicity strata that were not excluded, we considered the corresponding mortality data for the leading caregiver-loss cause of death in each state. Again, we excluded from consideration those standardized race and ethnicity groups that had more than 2 age bands with fewer than 20 death counts⁸¹. Supplementary Table 15 lists the corresponding strata as ‘small death counts’ and also describes the average completeness of death records for 1999–2004 when directly comparable line-list death records were also available from NCHS. Extended Data Fig. 9a,b illustrates these data completeness evaluations on data from New Mexico for women. Population size estimates for each state were obtained from the CDC WONDER population size portal for 1990 to 2021 (https://wonder.cdc.gov/Bridged-Race-v2020.HTML, https://wonder.cdc.gov/single-race-population.html) by bridged race and converted to the standardized race and ethnicity categories described in Supplementary Table 8. We then estimated state-, race- and ethnicity-specific incidence and prevalence of orphanhood as outlined in equations (7)–(11), except that female state-, race- and ethnicity-specific fertility rates in 2020–2021 were assumed to be as in 2019 owing to limitations in publicly available data. To investigate estimation accuracy, we compared the resulting sum of the state-, race- and ethnicity-specific orphanhood incidence estimates to the previous state-specific orphanhood incidence estimates (Extended Data Fig. 9c). For Alaska and Oklahoma, the sum of the race and ethnicity- and state-specific orphanhood incidence estimates was more than 20% below the state-specific orphanhood incidence estimates attributable to the leading parental cause of death; Supplementary Table 15 lists these states as ‘large discrepancy in estimates’. The table then reports, for all remaining states, 2021 state-, race- and ethnicity-specific orphanhood prevalence rate estimates that are attributable to the leading parental cause of death.

Sensitivity analyses

Sensitivity in mortality counts to mortality data processing

In the central analysis, we derived mortality counts by year, sex, age band, standardized race and ethnicity, and caregiver-loss causes of death from NCHS line-list mortality records from 1983 to 2021. Aggregate mortality counts are also available from CDC WONDER (https://wonder.cdc.gov/Deaths-by-Underlying-Cause.html) from 1999 to 2021. We extracted data from the NCHS Vital Statistics portal because this allowed us to use the same data source across all years, bypass data suppression and incorporate uncertainties in standardized race and ethnicity reporting. The CDC WONDER mortality counts allowed us to check our in-house data aggregations, if we obtain data from CDC WONDER at coarser population strata. For this purpose, we extracted annual death counts by sex and age bands (15–19 years, …, 95–99 years, ≥100 years) from the CDC WONDER mortality portal (https://wonder.cdc.gov/Deaths-by-Underlying-Cause.html) from 2000 to 2021. The overall mortality counts that we aggregated from the NCHS line-list data were identical to the CDC WONDER mortality data without cause-of-death stratification. Secondly, we extracted annual death counts by sex, age bands (15–19 years, …, 95–99 years, ≥100 years) and cause of death (ICD-10 113 Selected Causes of Death) from the CDC WONDER mortality portal (https://wonder.cdc.gov/Deaths-by-Underlying-Cause.html) from 2000 to 2021, without stratification by race and ethnicity. We then compared the mortality counts from the two data sources for each year and the leading parental causes of death. We found that across years, the maximum discrepancy in the CDC WONDER counts relative to the NCHS mortality counts was 0.0068% in women and 0.0057% in men, which reflected data suppression especially due to homicide excluding drug overdose in women, and overall indicated consistency of our data with that from CDC WONDER (Supplementary Table 16).

Sensitivity in live birth counts to live birth data processing

In the central analysis, we derived live birth counts by year, sex, age band, and standardized race and ethnicity from NCHS line-list natality records from 1969 to 2021. Aggregate live birth counts are also available from CDC WONDER (https://wonder.cdc.gov/natality.html) from 1995 to 2021. We chose to extract data from the NCHS Vital Statistics portal because this allowed us to use the same data source across all years. The CDC WONDER live birth counts allowed us to check our in-house data aggregations. For this purpose, we extracted annual live birth counts from the CDC WONDER natality portal for women by age bands 15–19 years, …, 45–49 years from 2000 to 2021, and separately for men by age bands 15–19 years, …, 55+ years from 2016 to 2021 without stratification by race and ethnicity, which avoids data suppression. We then compared the live birth counts from the two data sources for each year and found that both data sets matched across all years.

Sensitivity in national-level orphanhood estimates to assumptions on historic fertility rates

In the central analysis, we assumed that male and female fertility rates by age and standardized race and ethnicity were constant in 1966–1989 and in 1990 (Extended Data Fig. 4). This assumption was made because population size data stratified by more than three race categories were not publicly available before 1990. Considering that age- and sex-specific live birth data were available by race and ethnicity since 1978, we estimated in this sensitivity analysis the historic composition of population sizes by race and ethnicity in 1980–1989, and then updated the corresponding fertility rates and national-level orphanhood incidence and prevalence estimates. More specifically, we considered time trends in the proportion of each race and ethnicity in each 5 year age band and both sexes in the US intercensal population size estimates in 1990 to 1995, and extrapolated these with linear models backward in time to 1980–1989. We do not think that this estimation approach resulted in accurate estimates of population sizes, but rather that this approach conveys possible sensitivities in our orphanhood estimates. We identified minor sensitivities to national orphanhood prevalence estimates up to 2007, with no impact on incidence and prevalence estimates for recent years (Extended Data Fig. 10a).

Sensitivity in national-level orphanhood estimates to potentially correlated fertility rates

In the central analysis, we assumed that population-level fertility rates were not correlated with population-level mortality rates. We considered in sensitivity analyses possible deviations from this assumption that might lead to upward bias in our central estimates. For example, a person who died in year y may have experienced poor health in the preceding years y − 1, y − 2, … and in this case may also have been less likely to mother or father a child in years y, y − 1, y − 2, …. We modeled this possibility through four scenarios of dampened fertility rates close to individual death events. Specifically, we considered cumulative logistic adjustment factors that dampened fertility rates to either zero or half of the corresponding population-level-, year-, age-, sex- and standardized race and ethnicity-specific fertility rates in the year of death y. We also considered sensitivity analyses so that the onset of lower fertility rates preceded the year of death y by 1 or 3 years (Extended Data Fig. 10a–c). We found that orphanhood incidence estimates were up to 8.4% lower than in the central analysis, and orphanhood prevalence estimates were up to 15% lower than in the central analysis (Extended Data Fig. 10a–c). It is also possible that our central orphanhood estimates might be biased downward. For example, women experiencing premature death are more likely economically disadvantaged and in turn are more likely to have had reduced access to contraceptives and corresponding higher fertility rates⁸², or earlier sexual debut resulting in higher cumulative fertility until death. It is also possible that overall, at the population level, there is no strong correlation between fertility and mortality rates especially as the lag between birth and death events is typically relatively large, and in the absence of data, we have opted for this middle approach in the central analysis.

Sensitivity in national-level grandparent-caregiver-loss estimates to assumptions on the age of children experiencing loss of a grandparent caregiver

In the central analysis, we considered the age composition of children experiencing orphanhood in equation (15), as a proxy to the age composition of children experiencing grandparent caregiver loss. Calculations were done independently for each leading parental cause of death. We performed two analyses to characterize the sensitivity of this approach to national-level grandparent-caregiver-loss estimates. First, we repeated calculations using as proxy the age composition of children experiencing orphanhood across all causes of caregiver loss, that is

$${G}_{y,s,r,b,c}^{{\rm{new}}}={G}_{y,s,r,c}^{{\rm{new}}}\times \frac{{\sum }_{c,{a}^{{\prime} }\ge 30}{O}_{{a}^{{\prime} },s,r,b,c}^{{\rm{new}}* }}{\mathop{\sum }\nolimits_{c,b = 0}^{17}{\sum }_{{a}^{{\prime} }\ge 30}{O}_{{a}^{{\prime} },s,r,b,c}^{{\rm{new}}* }}.$$

(23)

Second, we used data from the ‘Topical survey’ of the National Survey of Children’s Health (https://www.census.gov/data/tables/time-series/demo/families/children.html), filtered to respondents reporting to be grandparents and living with at least one child in the household. Respondents were asked to report on demographic characteristics including the age and race and ethnicity of one randomly selected child. We pooled these data across 6 years, from 2016 to 2021, to characterize the age composition of children who live with grandparents due to small sample sizes in each survey after filtering (approximately 2,500 grandparents per survey round). In the sensitivity analyses, we found that age-specific grandparent caregiver incidence estimates deviated by up to ±26.8% and ±44.4% of the central estimate across years; however, as parental death contributes more to caregiver loss, age-specific caregiver-loss incidence estimates deviated by up to ±6.3% and ±10.0% of the central estimate across years (Extended Data Fig. 10d–e).

Sensitivity in national-level caregiver-loss estimates to assumptions on historic numbers of grandparent caregivers

In the central analysis, we assumed that the proportion of adults aged 30+ years who live with their grandchildren of age 17 or under (denoted γ_y,s,r) was the same in the years y = 1983, …, 2009 as in 2010. This assumption was made because data were not available before 2010. To investigate the implications of this assumption, we considered longitudinal United Nations Population Division data on Households and Living Arrangements of Older Persons for the United States since 1960 (https://www.un.org/development/desa/pd/data/living-arrangements-older-persons). These data indicate that the proportion of older persons who live with children or who are the primary caregivers of children has in the United States remained fairly constant since 1980 and, for this reason, suggests that our assumptions on the historic number of grandparent caregivers are unlikely to have a substantive impact on caregiver-loss estimates.

Ethical approval

This study used publicly available data and was deemed exempt from institutional board review by the CDC.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All data used to calculate estimates of mortality and orphanhood are publicly available via Zenodo at https://doi.org/10.5281/zenodo.11423744 (ref. ⁸³). Mortality and natality data were sourced from the NCHS (1983–2021 and 1968–2021, respectively) (https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm) with the utils (of version 4.2.3) package in R. Mortality and natality data after 2005 at the state level were sourced from CDC WONDER (mortality data 2005–2021: https://wonder.cdc.gov/Deaths-by-Underlying-Cause.html; natality data 2005–2021: https://wonder.cdc.gov/natality.html). Population data from 1969 to 1989 were sourced from https://seer.cancer.gov/popdata/singleages.html. Population data from 1990 to 2020 and 2021 were sourced from CDC WONDER (https://wonder.cdc.gov/bridged-race-population.html; https://wonder.cdc.gov/single-race-population.html, respectively). Child mortality data were sourced from United Nations (https://population.un.org/wpp/Download/Standard/Mortality/). Household data from 2010 to 2021 were sourced from American Community Survey (for example, year 2019: https://data.census.gov/table/ACSST5Y2019.S1002).

Code availability

Code and instructions to reproduce all analyses are freely available via Github version 1.1.2 under the GNU General Public License version 3.0 at https://github.com/MLGlobalHealth/orphanhood-caregiver-death-in-US-from-all-causes-of-mortality.

References

Sherr, L. et al. A systematic review on the meaning of the concept ‘AIDS Orphan’: confusion over definitions and implications for care. AIDS Care 20, 527–536 (2008).
Article PubMed Google Scholar
Unwin, H. et al. Global, regional, and national minimum estimates of children affected by COVID-19-associated orphanhood and caregiver death, by age and family circumstance up to Oct 31, 2021: an updated modelling study. Lancet Child Adolesc. Health 6, 249–259 (2022).
Article CAS PubMed PubMed Central Google Scholar
California Surgeon General’s Clinical Advisory Committee. Adverse Childhood Experience Questionnaire For Adults (Aces Aware, 2020; accessed 23 November 2023); https://www.acesaware.org/wp-content/uploads/2022/07/ACE-Questionnaire-for-Adults-Identified-English-rev.7.26.22.pdf
Kessler, R. et al. Childhood adversities and adult psychopathology in the WHO World Mental Health Surveys. Br. J. Psychiatry 197, 378–385 (2010).
Article PubMed PubMed Central Google Scholar
Atwoli, L. et al. Posttraumatic stress disorder associated with unexpected death of a loved one: cross-national findings from the world mental health surveys. Depress. Anxiety 34, 315–326 (2017).
Article PubMed Google Scholar
Children’s Collaborative. Utah Children’s Collaborative. https://www.childrenscollaborative.us/utah (2023).
Farella Guzzo, M. & Gobbi, G. Parental death during adolescence: a review of the literature. Omega 87, 1207–1237 (2023).
Article PubMed Google Scholar
Kentor, R. & Kaplow, J. Supporting children and adolescents following parental bereavement: guidance for health-care professionals. Lancet Child Adolesc. Health 4, 889–898 (2020).
Article PubMed Google Scholar
Gibbs, D., Villodas, M., Kainz, K. & Francis, A. Insecure housing, substance abuse, and incarceration among emerging adults aging out of foster care: examining associations with legal orphan status. Child. Youth Serv. Rev. 145, 106805 (2023).
Article Google Scholar
Black, R. et al. Health and development from preconception to 20 years of age and human capital. Lancet 399, 1730–1740 (2022).
Article PubMed PubMed Central Google Scholar
Alderman, H. No Small Matter: The Impact of Poverty, Shocks, and Human Capital Investments in Early Childhood Development (World Bank Publications, 2011).
US Centers for Disease Control and Prevention, CDC WONDER. Underlying Cause of Death, 1999–2020 (accessed 16 June 2023); https://wonder.cdc.gov/ucd-icd10.html
Imperial College London (on behalf of its COVID-19 Response Team). Global Orphanhood Estimates Real Time Calculator (2021; accessed 6 May 2022); https://imperialcollegelondon.github.io/orphanhood_calculator/#/country/Global
Guida, F. et al. Global and regional estimates of orphans attributed to maternal cancer mortality in 2020. Nat. Med. 28, 2563–2572 (2022).
Article CAS PubMed PubMed Central Google Scholar
Jones, C. M. et al. Estimated number of children who lost a parent to drug overdose in the US from 2011 to 2021. JAMA Psychiatry 81, 789–796 (2024).
Article PubMed Google Scholar
Schluter, B. S., Alburez-Gutierrez, D., Bibbins-Domingo, K., Alexander, M. J. & Kiang, M. V. Youth experiencing parental death due to drug poisoning and firearm violence in the US, 1999–2020. JAMA 331, 1741–1747 (2024).
Article PubMed PubMed Central Google Scholar
Camacho, S. & Clark Henderson, S. The social determinants of adverse childhood experiences: an intersectional analysis of place, access to resources, and compounding effects. Int. J. Environ. Res. Public Health 19, 10670 (2022).
Article PubMed PubMed Central Google Scholar
Karatekin, C. et al. Adverse childhood experiences: a scoping review of measures and methods. Child. Youth Serv. Rev. 136, 106425 (2022).
Article Google Scholar
Ahmad, F. et al. Provisional mortality data—United States, 2021. MMWR Morb. Mortal. Wkly. Rep. 71, 597–600 (2022).
Hillis, S. et al. COVID-19-associated orphanhood and caregiver death in the United States. Pediatrics 148, e2021053760 (2021).
Article PubMed Google Scholar
US Department of Health and Human Services, Office of the Assistant Secretary for Health. Services and Supports for Longer-Term Impacts of COVID-19 (2022); https://www.covid.gov/assets/files/Services-and-Supports-for-Longer-Term-Impacts-of-COVID-19-08012022.pdf
The White House. National COVID-19 Preparedness Plan (2023; accessed 20 September 2023); https://www.whitehouse.gov/covidplan/
State of California. Hope, Opportunity, Perseverance, and Empowerment (HOPE) for Children Act of 2022 (Senate of California, 2022).
US Centers for Disease Control and Prevention. Global Orphanhood Associated with COVID-19 (2023; accessed 20 September 2023); https://archive.cdc.gov/#/details?, https://www.cdc.gov/globalhealth/covid-19/orphanhood/index.html
World Bank Group. RSR Multiplying Impact at a Glance (2023; accessed 19 June 2023); https://documents1.worldbank.org/curated/en/892081579159887574/pdf/Rapid-Social-Response-Program-Building-Adaptive-Social-Protection-Systems-to-Protect-the-Poor-and-Vulnerable.pdf
Policy brief: The importance of the OVC set aside within PEPFAR. First Focus Campaign for Children https://campaignforchildren.org/resource/the-importance-of-the-ovc-set-aside-within-pepfar/#:~:text=The%2010%20percent%20set%20aside,the%202023%20Reauthorization%20of%20PEPFAR (2023).
USAID. Advancing Protection and Care for Children in Adversity: A U.S. Government Strategy for International Assistance (2023; accessed 20 September 2023); https://www.usaid.gov/sites/default/files/2023-07/apcca-strategy-final-web.pdf
COVID-19 Mental Disorders Collaborators. Global prevalence and burden of depressive and anxiety disorders in 204 countries and territories in 2020 due to the COVID-19 pandemic. Lancet 398, 1700–1712 (2021).
Article Google Scholar
Triggs, A. & Kharas, H. The triple economic shock of COVID-19 and priorities for an emergency G-20 leaders meeting. Brookings Institution https://www.brookings.edu/articles/the-triple-economic-shock-of-covid-19-and-priorities-for-an-emergency-g-20-leaders-meeting/ (2020).
Ahmad, F., Cisewski, J., Xu, J. & Anderson, R. Provisional mortality data—United States, 2022. MMWR Morb. Mortal. Wkly. Rep. 72, 488–492 (2023).
Article PubMed PubMed Central Google Scholar
Pew Research Center. Children Living with or Being Cared for by a Grandparent (2013); https://www.pewresearch.org/social-trends/2013/09/04/children-living-with-or-being-cared-for-by-a-grandparent/
US Census Bureau. American Community Survey and Puerto Rico Community Survey Design and Methodology (US Census Bureau, 2022).
National Center for Health Statistics. Data Access—Vital Statistics Online (2023; accessed 3 August 2023); https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm
US Census Bureau. Explore Census Data (2023; accessed 16 June 2023); https://data.census.gov/
Lee, L. & Fleming, P. Estimated number of children left motherless by AIDS in the United States, 1978–1998. J. Acquir. Immune Defic. Syndr. 34, 231–6 (2003).
Article PubMed Google Scholar
US Centers for Disease Control and Prevention, CDC WONDER. Provisional Mortality Statistics, 2018 Through Last Month Request Form [16/06/2023] (2023; accessed 16 June 2023); https://wonder.cdc.gov/controller/datarequest/D176
Sadruddin, A. F. et al. How do grandparents influence child health and development? A systematic review. Soc. Sci. Med. 239, 112476 (2019).
Article PubMed Google Scholar
Monasch, R. & Boerma, J. T. Orphanhood and childcare patterns in sub-Saharan Africa: an analysis of national surveys from 40 countries. AIDS 18, S55–S65 (2004).
Article PubMed Google Scholar
US Centers for Disease Control and Prevention. Social determinants of health (SDOH). https://www.cdc.gov/about/priorities/why-is-addressing-sdoh-important.html#:~:text=Addressing%20differences%20in%20SDOH%20accelerates,or%20access%20to%20healthcare%20services (2024).
US Census Bureau. Poverty Status in the Past 12 Months (2024; accessed 15 May 2024); https://data.census.gov/table?q=S1701:+POVERTY+STATUS+IN+THE+PAST+12+MONTHS
Osborne, J. C. & Chosewood, L. C. NIOSH responds to the U.S. drug overdose epidemic. New Solut. 31, 307–314 (2021).
US Centers for Disease Control and Prevention. Increase in Fatal Drug Overdoses Across the United States Driven by Synthetic Opioids Before and During the COVID-19 Pandemic (2020; accessed 20 September 2023); https://emergency.cdc.gov/han/2020/han00438.asp
Cartus, A. et al. Forecasted and observed drug overdose deaths in the US during the COVID-19 pandemic in 2020. JAMA Netw. Open 5, e223418 (2022).
Article PubMed PubMed Central Google Scholar
US Centers for Disease Control and Prevention. Overdose Deaths Accelerating During COVID-19 (2023; accessed 20 September 2023); https://archive.cdc.gov/#/details?url=https://www.cdc.gov/media/releases/2020/p1218-overdose-deaths-covid-19.html
Monnig, M. A. et al. Association of substance use with behavioral adherence to Centers for Disease Control and Prevention guidelines for COVID-19 mitigation: cross-sectional web-based survey. JMIR Public Health Surveill. 7, e29319 (2021).
Article PubMed PubMed Central Google Scholar
World Health Organization. INSPIRE: Seven Strategies for Ending Violence Against Children (2016; accessed 9 September 2020); https://www.who.int/publications/i/item/inspire-seven-strategies-for-ending-violence-against-children
US Centers for Disease Control and Prevention. Adverse Childhood Experiences Prevention: Resource for Action. A Compilation of the Best Available Evidence (National Center for Injury Prevention and Control, US Centers for Disease Control and Prevention, 2019).
Hillis, S. et al. Global minimum estimates of children affected by COVID-19-associated orphanhood and deaths of caregivers: a modelling study. Lancet 398, 391–402 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hillis, S. et al. Children: The Hidden Pandemic 2021: A Joint Report of COVID-19-Associated Orphanhood and a Strategy for Action (US Centers for Disease Control and Prevention, 2021; accessed 19 July 2024); https://stacks.cdc.gov/view/cdc/108199
Rutter, M. Protective factors in children’s responses to stress and disadvantage. Ann. Acad. Med. 8, 324–338 (1979).
CAS Google Scholar
Flaxman, S. et al. List child dependents on death certificates. Science 380, 467–467 (2023).
Article CAS PubMed Google Scholar
US Centers for Disease Control and Prevention. CDC Health Advisory: Rising Numbers of Deaths Involving Fentanyl and Fentanyl Analogs, Including Carfentanil, and Increased Usage and Mixing with Non-opioids (Health Alert Network Advisory, 11 July 2023); https://emergency.cdc.gov/han/han00413.asp
US Centers for Disease Control and Prevention. SUDORS Dashboard: Fatal Overdose Data (2023; accessed 23 October 2023); https://www.cdc.gov/drugoverdose/fatal/dashboard/index.html
Newgard, C. D. et al. Trauma in the neighborhood: a geospatial analysis and assessment of social determinants of major injury in North America. Am. J. Public Health 101, 669–677 (2011).
Article PubMed PubMed Central Google Scholar
Alisic, E., Vaughan, C., Morrice, H., Joy, K. & Chavez, K. M. A double taboo? Children bereaved by domestic homicide. Lancet 7, e647 (2022).
Google Scholar
Wang, H. Estimation of total mortality due to COVID-19. Institute for Health Metrics and Evaluation http://www.healthdata.org/special-analysis/estimation-excess-mortality-due-covid-19-and-scalars-reported-covid-19-deaths (2021; accessed 24 June 2021).
Peppin, J. F., Coleman, J. J., Paladini, A. & Varrassi, G. What your death certificate says about you may be wrong: a narrative review on CDC’s efforts to quantify prescription opioid overdose deaths. Cureus 13, e18012 (2021).
PubMed PubMed Central Google Scholar
Ragsdale, S. et al. Immigrants in the United States of America. Adv. Hist. Stud. 2, 167–174 (2013).
Article Google Scholar
Winstanley, E. L. & Stover, A. N. The impact of the opioid epidemic on children and adolescents. Clin. Ther. 41, 1655–1662 (2019).
Article PubMed PubMed Central Google Scholar
N’konzi, J. P. N. et al. Age-specific female and male fertility rate estimates for African countries and implications for all-cause-, AIDS- and COVID-19-associated orphanhood. In Int. Conf. on STIs and AIDs in Africa 2023, 173 (2023); https://www.saafrica.org/pages/wp-content/uploads/2023/10/ICASA-2023-Programme-Booklet-26102023.pdf
Pew Research Center's Social & Demographic Trends Project. Grandparents Living with or Serving as Primary Caregivers for Their Grandchildren 15–19 (Pew Research Center, 2013; accessed 30 June 2024); https://www.pewresearch.org/social-trends/wp-content/uploads/sites/3/2013/09/grandparents_report_final_2013.pdf
US Census Bureau. Presence of Parents for Children Living in the Home of a Grandparent 2014 (2023; accessed 13 August 2023); https://www.census.gov/content/dam/Census/library/visualizations/time-series/demo/families-and-households/ch-7.pdf
Sowden, R., Borgstrom, E. & Selman, L. E. It’s like being in a war with an invisible enemy: a document analysis of bereavement due to COVID-19 in UK newspapers. PLoS ONE 16, e0247904 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ellis, R. R. & Simmons, T. Coresident Grandparents and Their Grandchildren: 2012 (US Census Bureau, 2014); https://www.census.gov/content/dam/Census/library/publications/2014/demo/p20-576.pdf
Rosenberg, H. M. Quality of Death Rates by Race and Hispanic-Origin: A Summary of Current Research, 1999 Vol. 128 (National Center for Health Statistics, 1999).
Ingram, D. D. et al. United States Census 2000 population with bridged race categories. Vital Health Stat. 2 135, 1–55 (2003).
Google Scholar
Heron, M. Comparability of race-specific mortality data based on 1977 versus 1997 reporting standards. Natl Vital Stat. Rep. 70, 1–31 (2021).
PubMed Google Scholar
International Statistical Classification of Diseases and Related Health Problems, Tenth Revision Vol. II, Instruction Manual, 5th edn. (World Health Organization, 2016).
Manual of the International Statistical Classification of Diseases, Injuries, and Causes of Death, Ninth Revision (World Health Organization, 1977).
Instruction Manual Part 9: ICD-9 Cause of Death Lists for Tabulating Mortality Statistics, Effective 1979 (National Center for Health Statistics, 1979).
Instruction Manual Part 9: ICD-10 Cause of Death Lists for Tabulating Mortality Statistics (Updated October 2020 to Include WHO Updates to ICD-10 for Data Year 2020) (National Center for Health Statistics, 2020).
Anderson, R. N., Miniño, A. M., Hoyert, D. L. & Rosenberg, H. M. Comparability of cause of death between ICD-9 and ICD-10: preliminary estimates. Natl Vital Stat. Rep. 49, 1–32 (2001).
PubMed Google Scholar
Public Use Data Tape Documentation. Multiple Cause of Death for ICD-9 1986 Data (US Department of Health and Human Services, 1988); https://data.nber.org/mortality/1986/dt86icd9.pdf
National Center for Injury Prevention and Control (US), Division of Unintentional Injury Prevention, Health Systems and Trauma Systems Branch, Prescription Drug Overdose Team. Prescription Drug Overdose Data & Statistics: Guide to ICD-9-CM and ICD-10 Codes Related to Poisoning and Pain (US Centers for Disease Control and Prevention, 2013); https://stacks.cdc.gov/view/cdc/59394
Curtin, S. C., Tejada-Vera, B., Bastian, B. A. Deaths: Leading Causes for 2020 (National Center for Health Statistics, 2023); https://www.cdc.gov/nchs/data/nvsr/nvsr72/nvsr72-13.pdf
User Guide to the 2006 Natality Public Use File (US Centers for Disease Control and Prevention, 2006); https://data.nber.org/nvss/natality/inputs/pdf/2006/UserGuide2006.pdf
1978 Detail Natality, Standard Micro-data Tape Transcript,tape Contents Documentation (National Center for Health Statistics, 1978); https://data.nber.org/natality/1978/natl1978.pdf
Arias, E. et al. United States Life Tables, 2020 NCHS National Vital Statistics Report Vol. 71 No. 1, 1–64 (2022).
Unwin, H. J. T. et al. Global, regional, and national minimum estimates of children affected by COVID-19-associated orphanhood and caregiver death, by age and family circumstance up to Oct 31, 2021: an updated modelling study. Lancet Child Adolesc. Health 6, 249–259 (2022).
Article CAS PubMed PubMed Central Google Scholar
Dhaene, J., Denuit, M., Goovaerts, M. J., Kaas, R. & Vyncke, D. The concept of comonotonicity in actuarial science and finance: applications. Insur. Math. Econ. 31, 133–161 (2002).
Article Google Scholar
Mathews, T. & Hamilton, B. E. Total rertility rates by state and race and Hispanic origin: United States, 2017. Natl Vital Stat. Rep. 68, 1–11 (2019).
Google Scholar
Price, J. H. et al. Racial/ethnic disparities in chronic diseases of youths and access to health care in the United States. BioMed Res. Int. 2013, 1–12 (2013).
Article Google Scholar
Villaveces, A. et al. Orphanhood and caregiver death among children in the United States by all-cause mortality, 2000–2021. Zenodo https://zenodo.org/records/11423744 (2024).

Download references

Acknowledgements

We thank the Global Reference Group for Children in Crisis, especially C. Desmond, for comments on early versions of this work, and reviewers at the CDC and NCHS, especially R. Anderson, for helpful suggestions. We thank the Imperial College Research Computing Service (https://doi.org/10.14469/hpc/2232) for providing the computational resources needed to perform this study, and Zulip for sponsoring team communications through the Zulip Cloud Standard chat app. This study was supported by the Oak Foundation (to L.C. and L.S.), the UKRI Global Challenges Research Fund (to L.C.), the Moderna Charitable Foundation (to H.J.T.U. and O.R.), the World Health Organization (to S.F.), the Engineering and Physical Sciences Research Council (EPSRC; EP/V002910/2 to S.F.), the EPSRC through the EPSRC Centre for Doctoral Training in Modern Statistics and Statistical Machine Learning at Imperial College London and Oxford University (EP/S023151/1 to A. Gandy) and the Imperial College London President’s PhD Scholarship fund (to Y.C.), Imperial College London Undergraduate Research Bursaries (to L.G. and V.K.-M.) and London Mathematical Society Undergraduate Research Bursary (URB-2023-86 to D.W.). The funders had no role in the study design, the data collection and analysis, the decision to publish or the preparation of the paper. The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the CDC.

Author information

These authors contributed equally: Andrés Villaveces, Yu Chen, Susan Hillis, Oliver Ratmann.

Authors and Affiliations

National Center for Injury Prevention and Control, Centers for Disease Control and Prevention, Atlanta, GA, USA
Andrés Villaveces, Jan L. Losby, Rita Noonan, Francis Annor & Greta Massetti
Department of Mathematics, Imperial College London, London, UK
Yu Chen, Sydney Tucker, Alexandra Blenkinsop, Linden Graves, Victor Kojey-Merle, Douhan Wang, Susan Hillis & Oliver Ratmann
Centre for Evidence-Based Social Intervention, Department of Social Policy and Intervention, University of Oxford, Oxford, UK
Sydney Tucker, Lucie Cluver & Susan Hillis
Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
Lucie Cluver
Institute of Global Health, University College London, London, UK
Lorraine Sherr
Gender Group, World Bank, Washington DC, USA
Laura Rawlings
Harvard Medical School and Boston Children’s Hospital, Harvard University, Boston, MA, USA
Charles A. Nelson
School of Mathematics, University of Bristol, Bristol, UK
H. Juliette T. Unwin
Department of Computer Science, University of Oxford, Oxford, UK
Seth Flaxman
Global Reference Group for Children Affected by Crisis, University of Oxford, Oxford, UK
Susan Hillis

Authors

Andrés Villaveces
View author publications
Search author on:PubMed Google Scholar
Yu Chen
View author publications
Search author on:PubMed Google Scholar
Sydney Tucker
View author publications
Search author on:PubMed Google Scholar
Alexandra Blenkinsop
View author publications
Search author on:PubMed Google Scholar
Lucie Cluver
View author publications
Search author on:PubMed Google Scholar
Lorraine Sherr
View author publications
Search author on:PubMed Google Scholar
Jan L. Losby
View author publications
Search author on:PubMed Google Scholar
Linden Graves
View author publications
Search author on:PubMed Google Scholar
Rita Noonan
View author publications
Search author on:PubMed Google Scholar
Francis Annor
View author publications
Search author on:PubMed Google Scholar
Victor Kojey-Merle
View author publications
Search author on:PubMed Google Scholar
Douhan Wang
View author publications
Search author on:PubMed Google Scholar
Greta Massetti
View author publications
Search author on:PubMed Google Scholar
Laura Rawlings
View author publications
Search author on:PubMed Google Scholar
Charles A. Nelson
View author publications
Search author on:PubMed Google Scholar
H. Juliette T. Unwin
View author publications
Search author on:PubMed Google Scholar
Seth Flaxman
View author publications
Search author on:PubMed Google Scholar
Susan Hillis
View author publications
Search author on:PubMed Google Scholar
Oliver Ratmann
View author publications
Search author on:PubMed Google Scholar

Contributions

A.V., S.H. and O.R. designed the study. Y.C., A.V., S.T., A.B., L.G., V.K.-M., D.W., S.H. and O.R. performed the analysis. Y.C. led the coding. L.G., V.K.-M., D.W., H.J.T.U. and S.F. checked the analysis and code. A.V., L.C., L.S., F.A., G.M., J.L.L., L.R., R.N., C.A.N., S.F., H.J.T.U., S.H. and O.R. oversaw the data interpretation. A.V., Y.C., S.T., S.H. and O.R. wrote the first draft. All authors discussed the first draft and contributed to the final paper.

Corresponding authors

Correspondence to Andrés Villaveces or Oliver Ratmann.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Medicine thanks Florence Guida, Rachel Kentor and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Ming Yang, in collaboration with the Nature Medicine team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Definition of orphanhood and grandparent caregiver loss.

To quantify caregiver loss from all causes, we considered orphanhood defined as children experiencing the death of one or both parents, as well as in²⁰ and following American Community Cohort definitions^61,62,63 the loss of primary grandparent caregivers aged 30 years or older who live in the same household and provide care in the absence of a parent, or who provide for most of the basic needs for children, and separately the loss of secondary grandparent caregivers aged 30 years or older serving as head of household who own or rent the family’s housing and provide for some but not most of the basic needs of their grandchildren^32,63.

Extended Data Fig. 2 Aggregated age- and sex-specific U.S. mortality counts in 1983-2021, by standardized race and ethnicity.

Line-list mortality data on US residents were retrieved from the NCHS Vital Statistics portal for each year in 1983-2021 (https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm) for orphanhood and grandparent caregiver loss estimation. Data were collected from 1983 onwards because the corresponding children who lost a caregiver in 1983 at age 0 were of age 17 in 2000, and so entered our estimation of orphanhood prevalence in 2000. Individuals of other races in 1984-1991 and individuals of more than one race in 2021 were not coded consistently and not included in this study (see Methods). In total, n=93,921,920 death records were included in national orphanhood and grandparent caregiver loss estimation.

Extended Data Fig. 3 Aggregated age- and sex-specific US mortality counts in 1983-2021, by leading parental causes of death.

Line-list mortality data on US residents were retrieved from the NCHS Vital Statistics portal for each year in 1983-2021 (https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm) for orphanhood and grandparent caregiver loss estimation. Causes of death were reported in the Ninth or Tenth Revision of the International Classification of Disease, which we mapped into one of 53 caregiver loss causes-of-death categories (or ‘Other’ category, Supplementary Table 1), and analysed in terms of the leading parental causes of death in 2021 (Methods). In total, n=93,921,920 death records were included in national orphanhood and grandparent caregiver loss estimation.

Extended Data Fig. 4 Age- and sex-specific fertility rates, by standardised race & ethnicity.

Line-list live births data for US mothers aged 15-49 years and corresponding fathers aged 15-77 years were retrieved from the NCHS Vital Statistics portal for each year between 1969 to 2021 (https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm) for orphanhood estimation. Standardized race and ethnicity categories were attributed as described in Supplementary Table 7 (colour). Prior to 1990, we assumed the same fertility rates in each year in 1966-1989 as in 1990. For comparison, the black lines show national-level fertility rates, which we calculated directly in 1969-1989, and as weighted average of the standardized race and ethnicity-specific fertility rates in 1990-2021. The dashed vertical line represents 1990. In total, n=127,463,725 live birth records from 1990 were included in national orphanhood estimates.

Extended Data Fig. 5 Magnitude of children experiencing primary and secondary grandparent caregiver loss in the US.

(a) Estimated number of US children newly experiencing primary grandparent caregiver death (incidence) by leading parental causes of death (colour). (b) Estimated number of US children experiencing primary grandparent caregiver death in their lifetime (prevalence) by leading parental causes of death (colour). (c–d) Incidence and prevalence rates of primary grandparent caregiver death by leading parental causes of death among US children. Median estimates are shown (points) with uncertainty ranges in Supplementary Table 18. (e–h) Corresponding figures for secondary grandparent caregiver loss. Median estimates are shown (points) with uncertainty ranges in Supplementary Table 19. 2021 incidence estimates are based on an average of n=5,791 sex-, race and ethnicity- and cause-specific mortality records, and an average of n=610,229 race and ethnicity-specific natality records.

Extended Data Fig. 6 Leading causes of orphanhood incidence among US children in 2019 by race and ethnicity and sex of parent.

(a) Estimates of 2019 paternal orphanhood incidence (y-axis) by cause-of-death of fathers (point, number, colour) and race and ethnicity (panels) versus differences in incidence rates in 2019 minus those in 2000 (x-axis), with positive differences indicating increasing incidence rates and negative differences indicating decreasing incidence rates. The size of points indicates the contribution of each cause to new cases of orphanhood among US children in 2019. (b) The same for 2019 maternal orphanhood incidence by cause-of-death of mothers. Median estimates are shown (points) and uncertainty ranges are detailed in Supplementary Table 13. 2019 incidence estimates are based on an average of n=5,644 sex-, race and ethnicity- and cause-specific mortality records, and an average of n=748,936 race and ethnicity-specific natality records.

Extended Data Fig. 7 Reliability evaluation of orphanhood incidence estimates by state and leading parental cause of death.

(a) Comparison of state-, sex-, age- and leading causes-specific mortality counts for 2005-2021 that were extracted from CDC WONDER to the corresponding suppression-adjusted counts, and to the national-level sex-, age- and leading parental causes-specific counts obtained from NCHS (see Methods). (b) Comparison of state-level orphanhood incidence estimates to national-level estimates by sex and leading parental causes of death for 2005-2021 (see colour, symbols). Incidence estimates were very similar except for ‘Homicide excluding drug overdose’ in women, ‘Chronic liver disease and cirrhosis’ in women and ‘Other’ causes of death (see Methods). Only median estimates are shown.

Extended Data Fig. 8 Leading parental causes of orphanhood incidence and prevalence in 2021 by US state.

(a) Estimated number of US children newly experiencing orphanhood, by state and leading parental causes of death. (b) Estimated number of US children experiencing orphanhood in their lifetime (over ages 0-17 years), by state and leading parental causes of death. Throughout, the leading parental cause-of-death in each state is indicated with hatch marks. Throughout, median estimates (bars) are shown and uncertainty intervals are provided in Supplementary Table 14. Incidence estimates for 2021 are based on an average of n=5,071 state- and cause-specific mortality records and n=71,791 state-specific natality records, and prevalence estimates are based on aggregating incidence estimates among children over the previous 17 years while accounting for ageing.

Extended Data Fig. 9 Reliability evaluation of orphanhood incidence estimates by state, race and ethnicity and leading parental cause-of-death.

(a) Example of completeness evaluation for state-, sex-, age- and race and ethnicity-specific live birth counts in New Mexico (see Methods). (b) Example of completeness evaluation for state-, sex-, age-, race and ethnicity- and leading-causes-specific mortality counts in New Mexico (see Methods). Only data shown in bold were retained for further analysis. (c) To assess the reliability of state- and race and ethnicity-specific orphanhood estimates, we considered weighted averages of the orphanhood incidence estimates across race and ethnicity, and then compared the weighted averages against state-level estimates (see colour, symbols). Triangles and circles are shown in opaque colour when race and ethnicity-specific estimates disagreed by more than 20% and were considered unreliable. Only median estimates are shown.

Extended Data Fig. 10 Sensitivity in national-level orphanhood and grandparent caregiver loss estimates.

(a) Estimated national-level orphanhood prevalence rates by standardized race and ethnicity (colours) in the central analysis (solid line) and the sensitivity analysis (dashed) in which we relaxed our assumption on constant fertility rates before 1990 (Methods). We identified minor sensitivities to national orphanhood prevalence estimates up to 2007, with no impact on incidence and prevalence estimates for recent years. (b) Estimated national-level orphanhood prevalence rates in the central analysis (solid line in red) and in sensitivity analyses during which the impact of lower fertility rates preceding a death event were explored (dashed, dotted, dotdash and twodash lines in colour, corresponding to different multiplication factors used to model lower fertility rates prior to death events as shown in the inset (c). Throughout, only median estimates are shown. We found negative correlations in fertility and mortality rates could result in notably lower orphanhood prevalence estimates, whereas positive correlations could result in notably higher orphanhood prevalence estimates (not shown); note the y-axis range is between 2.3 and 3.5. (d) Estimated national-level loss of grandparent caregivers by age of child (colors) in the central analysis (solid line) and the two sensitivity analyses (dashed, dotted line) in which first the age composition of children experiencing grandparent caregiver loss is the same regardless of cause-of-death, and second in which the age composition was based on NCHS data (see Methods). (e) Estimated national-level loss of caregivers by age of child (colours) in the central analysis (solid line) and the same two sensitivity analyses (dashed, dotted line) as in (d). In (d) and (e), labels in colour represent the age of child. We identified minor sensitivities to overall incidence of caregiver loss.

Supplementary information

Supplementary Information (download PDF )

Supplementary Tables 1–19.

Reporting Summary (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Villaveces, A., Chen, Y., Tucker, S. et al. Orphanhood and caregiver death among children in the United States by all-cause mortality, 2000–2021. Nat Med 31, 672–683 (2025). https://doi.org/10.1038/s41591-024-03343-6

Download citation

Received: 02 April 2024
Accepted: 03 October 2024
Published: 10 January 2025
Version of record: 10 January 2025
Issue date: February 2025
DOI: https://doi.org/10.1038/s41591-024-03343-6

This article is cited by

Orphanhood and caregiver death among children in the United States by all-cause mortality, 2000–2021
- Andrés Villaveces
- Yu Chen
- Oliver Ratmann
Nature Medicine (2025)