Abstract
Despite the mounting demand for generative population models, their limited generalizability to underrepresented demographic groups hinders widespread adoption in real-world applications. Here we propose a diversity-aware population modeling framework that can guide targeted strategies in public health and education, by estimating subgroup-level effects and stratifying predictions to capture sociodemographic variability. We leverage Bayesian multilevel regression and post-stratification to systematically quantify inter-individual differences in the relationship between socioeconomic status and cognitive development. Post-stratification enhanced the interpretability of model predictions across underrepresented groups by incorporating US Census data to gain additional insights into smaller subgroups in the Adolescent Brain Cognitive Development Study. This ensured that predictions were not skewed by overly heterogeneous or homogeneous representations. Our analyses underscore the importance of combining Bayesian multilevel modeling with post-stratification to validate reliability and provide a more holistic explanation of sociodemographic disparities in our diversity-aware population modeling framework.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$32.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to the full article PDF.
USD 39.95
Prices may be subject to local taxes which are calculated during checkout






Similar content being viewed by others
Data availability
The data supporting the findings of this study are available from the ABCD and Robert Graham Center’s SDI dataset15. The ABCD dataset is a publicly available resource accessible through the National Institute of Mental Health Data Archive. All relevant instructions to obtain the data can be found online at: https://nda.nih.gov/abcd/request-access. Source Data are provided with this paper.
References
Ciccia, A. H., Meulenbroek, P. & Turkstra, L. S. Adolescent brain and cognitive developments: implications for clinical assessment in traumatic brain injury. Top Lang. Disord. 29, 249–265 (2009).
Hackman, D. A., Farah, M. J. & Meaney, M. J. Socioeconomic status and the brain: mechanistic insights from human and animal research. Nat. Rev. Neurosci. 11, 651–659 (2010).
Falk, E. B. et al. What is a representative brain? Neuroscience meets population science. Proc. Natl Acad. Sci. USA 110, 17615–17622 (2013).
Kopal, J., Uddin, L. Q. & Bzdok, D. The end game: respecting major sources of population diversity. Nat. Methods 20, 1122–1128 (2023).
Evans, G. W. & Cassells, R. C. Childhood poverty, cumulative risk exposure, and mental health in emerging adults. Clin. Psychol. Sci. 2, 287–296 (2014).
Parolin, Z. Temporary assistance for needy families and the Black–White child poverty gap in the United States. Socio-Econ. Rev. 19, 1005–1035 (2021).
Lichter, D., Zhenchao, Q. & Crowley, M. Race and poverty: divergent fortunes of America’s children. Focus 24, 8–16 (2006).
Massey, D. S. & Fischer, M. J. How segregation concentrates poverty. Ethn. Racial Stud. 23, 670–691 (2000).
Hernández, E. Utilizing critical race theory to examine race/ethnicity, racism, and power in student development theory and research. J. College Student Dev. 57, 168–180 (2016).
Lichter, D. T., Taquino, M. C. & Parisi, D. The geography of exclusion: race, segregation, and concentrated poverty. Social Problems 59, 364–388 (2012).
Henry, D. A., Votruba-Drzal, E. & Miller, P. Child development at the intersection of race and SES: an overview. Adv. Child Dev. Behav. 57, 1–25 (2019).
Jernigan, T. L., Brown, S. A. & Dowling, G. J. The Adolescent Brain Cognitive Development Study. J. Res. Adolescence 28, 154–156 (2018).
Wu, S. M. & Stephenson, B. J. K. Bayesian estimation methods for survey data with potential applications to health disparities research. Wiley Interdiscip. Rev. Comput. Stat. 16, e1633 (2024).
Saragosa-Harris, N. M. et al. A practical guide for researchers and reviewers using the ABCD Study and other large longitudinal datasets. Dev. Cogn. Neurosci. 55, 101115 (2022).
Social Deprivation Index (SDI) (Robert Graham Center, 2025); https://www.graham-center.org/maps-data-tools/social-deprivation-index.html
Social Deprivation Index (SDI) (Robert Graham Center, accessed 29 November 2021); https://www.graham-center.org/maps-data-tools/social-deprivation-index.html
Gelman, A. & Pardoe, I. Bayesian measures of explained variance and pooling in multilevel (hierarchical) models. Technometrics 48, 241–251 (2006).
Kwate, N. O. A. & Goodman, M. S. An empirical analysis of White privilege, social position and health. Social Sci. Med. 116, 150–160 (2014).
Quillian, L. Segregation and poverty concentration: the role of three segregations. Am. Sociol. Rev. 77, 354–379 (2012).
Galster, G. & Sharkey, P. Spatial foundations of inequality: a conceptual model and empirical overview. RSF 3, 1–33 (2017).
Gaubert, C., Kline, P., Vergara, D. & Yagan, D. Trends in US spatial inequality: concentrating affluence and a democratization of poverty. In AEA Papers and Proceedings Vol. 111 520–525 (American Economic Association, 2021).
Popescu, I., Duffy, E., Mendelsohn, J. & Escarce, J. J. Racial residential segregation, socioeconomic disparities, and the White–Black survival gap. PLoS One 13, e0193222 (2018).
Kühn, M. Peripheralization: theoretical concepts explaining socio-spatial inequalities. European Plann. Stud. 23, 367–378 (2015).
Donnelly, L. et al. Geography of intergenerational mobility and child development. Proc. Natl Acad. Sci. USA 114, 9320–9325 (2017).
Faul, J. D., Ware, E. B., Kabeto, M. U., Fisher, J. & Langa, K. M. The effect of childhood socioeconomic position and social mobility on cognitive function and change among older adults: a comparison between the United States and England. J. Gerontol. B 76, S51–S63 (2021).
Henry, D. A., Betancur Cortés, L. & Votruba-Drzal, E. Black–White achievement gaps differ by family socioeconomic status from early childhood through early adolescence. J. Educational Psychol. 112, 1471 (2020).
Assari, S., Mardani, A., Maleki, M., Boyce, S, & Bazargan, M. Black–White achievement gap: role of race, school urbanity, and parental education. Pediatric Health Med. Ther. 12, 1–11 (2021).
Assari, S., Najand, B. & Donovan, A. Exposure to adverse life events among children transitioning into adolescence: intersections of socioeconomic position and race. J. Mental Health Clin. Psychol. 8, 5 (2024).
Boyce, S., Bazargan, M., Caldwell, C. H., Zimmerman, M. A. & Assari, S. Parental educational attainment and social environment of urban public schools in the US: Blacks’ diminished returns. Children 7, 44 (2020).
Weissman, D. G., Hatzenbuehler, M. L., Cikara, M., Barch, D. M. & McLaughlin, K. A. State-level macro-economic factors moderate the association of low income with brain structure and mental health in US children. Nat. Commun. 14, 2085 (2023).
Ulnicane, I. & Aden, A. Power and politics in framing bias in artificial intelligence policy. Rev. Policy Res. 40, 665–687 (2023).
Agénor, M. Future directions for incorporating intersectionality into quantitative population health research. Am. J. Public Health 110, 803–806 (2020).
Sethi, T., Kalia, A., Sharma, A. & Nagori, A. Interpretable artificial intelligence: closing the adoption gap in healthcare. In Artificial Intelligence in Precision Health 3–29 (Academic, 2020).
What Census Calls Us: A Historical Timeline (Pew Research Center, 2020); https://www.pewresearch.org/wp-content/uploads/2020/02/PH_15.06.11_MultiRacial-Timeline.pdf
Helms, J. E. Why is there no study of cultural equivalence in standardized cognitive ability testing? Am. Psychol. 47, 1083 (1992).
Karcher, N. R. & Barch, D. M. The ABCD Study: understanding the development of risk for mental and physical health outcomes. Neuropsychopharmacology 46, 131–142 (2021).
Volkow, N. D. et al. The conception of the ABCD Study: from substance use to a broad NIH collaboration. Dev. Cogn. Neurosci. 32, 4–7 (2018).
Auchter, A. M. et al. A description of the ABCD organizational structure and communication framework. Dev. Cogn. Neurosci. 32, 8–15 (2018).
ABCD Data Repository (National Institute of Mental Health, 2025); https://data-archive.nimh.nih.gov/abcd
Garavan, H. et al. Recruiting the ABCD sample: design considerations and procedures. Dev. Cogn. Neurosci. 32, 16–22 (2018).
Compton, W. M., Dowling, G. J. & Garavan, H. Ensuring the best use of data. JAMA Pediatrics 173, 809–810 (2019).
Butler, D. C., Petterson, S., Phillips, R. L. & Bazemore, A. W. Measures of social deprivation that predict health care access and need within a rational area of primary care service delivery. Health Services Res. 48, 539–559 (2013).
Chen, J. et al. Shared and unique brain network features predict cognitive, personality, and mental health scores in the ABCD Study. Nat. Commun. 13, 1–17 (2022).
Yip, S. W., Jordan, A., Kohler, R. J., Holmes, A. & Bzdok, D. Multivariate, transgenerational associations of the COVID-19 pandemic across minoritized and marginalized communities. JAMA Psych. 79, 350–358 (2022).
Sugianela, Y. & Ahmad, T. Pearson correlation attribute evaluation-based feature selection for intrusion detection system. In 2020 International Conference on Smart Technology and Applications (ICoSTA) 1–5 (IEEE, 2020).
Higgins, J. P., White, I. R. & Anzures‐Cabrera, J. Meta‐analysis of skewed data: combining results reported on log‐transformed or raw scales. Stats Med. 27, 6072–6092 (2008).
Gelman, A. Poststratification into many categories using hierarchical logistic regression. Surv. Methodol. 23, 127 (1997).
Park, D. K., Gelman, A. & Bafumi, J. Bayesian multilevel estimation with poststratification: state-level estimates from national polls. Pol. Anal. 12, 375–385 (2004).
pymc-devs. PyMC3. GitHub https://github.com/pymc-devs/pymc3 (2024).
Bonkhoff, A. K. et al. Generative lesion pattern decomposition of cognitive impairment after stroke. Brain Commun. 3, fcab110 (2021).
ArviZ (ArviZ, 2022); https://www.arviz.org/
dblabs-mcgill-mila. BMRP-diversity-aware-pm. GitHub https://github.com/dblabs-mcgill-mila/BMRP-diversity-aware-pm (2024).
Osayande, N. et al. Diversity-aware population models. figshare https://doi.org/10.6084/m9.figshare.28234859 (2025).
Acknowledgements
D.B. was supported by the Brain Canada Foundation, through the Canada Brain Research Fund, with the financial support of Health Canada, National Institutes of Health (NIH R01 AG068563A, NIH R01 DA053301-01A1, NIH R01 MH129858-01A1), the Canadian Institute of Health Research (CIHR 438531, CIHR 470425), the Healthy Brains Healthy Lives initiative (Canada First Research Excellence fund), the IVADO R3AI initiative (Canada First Research Excellence fund), and by the CIFAR Artificial Intelligence Chairs program (Canada Institute for Advanced Research).
Author information
Authors and Affiliations
Contributions
D.B. and N.O. designed the study, analyzed sociodemographic and behavioral data, and drafted the manuscript. J.M., S.A., J.K., A.H. and S.W.Y. provided feedback on the manuscript.
Corresponding authors
Ethics declarations
Competing interests
D.B. is a shareholder and advisory board member at MindState Design Labs, USA. The other authors declare no competing interests.
Peer review
Peer review information
Nature Computational Science thanks Joyonna Gamble-George, Arianna Gard and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Fernando Chirigati, in collaboration with the Nature Computational Science team.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 A comparison of the participant distribution in the ABCD cohort and in the US Census.
A, To examine potential sampling discrepancies between the ABCD cohort and the US Census SDI, participant distributions are compared across race and state strata. This shows that Asian participants in Missouri are represented only in the SDI and not in the ABCD cohort. B, Pearson’s correlation coefficients, |ρ|, are visualized to identify which SDV–neurobehavior relationships to model. Based on our set threshold of |ρ| ≥ 0.2, we identified 14 pairs (out of 2052 candidate pairs) with the strongest positive and negative associations, narrowing our study to 5 SDVs and 3 cognitive phenotypes.
Supplementary information
Source data
Source Data Fig. 2
94% HDI interval for each intercept and slope estimation.
Source Data Fig. 3
Mean predictions before and after post-stratification.
Source Data Fig. 4
Mean predictions before and after post-stratification for each binary SES group.
Source Data Fig. 5
Mean predictions before and after post-stratification for each state.
Source Data Extended Data Fig. 1
Race–state distribution counts and Pearson’s cross-correlation coefficients.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Osayande, N., Marotta, J., Aggarwal, S. et al. Quantifying associations between socio-spatial factors and cognitive development in the ABCD cohort. Nat Comput Sci 5, 221–233 (2025). https://doi.org/10.1038/s43588-025-00774-0
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s43588-025-00774-0
This article is cited by
-
Abnormal alterations in neurodevelopment in preterm children with very low birth weight during the adolescence
European Child & Adolescent Psychiatry (2025)


