Established in 2018 to push beyond the constraints of individual health and population cohorts, the IHCC is a community of cohorts advancing global science and health. We summarize the collective resources of 69 member cohorts, representing over 34 million people.
Background
The International Health Cohorts Consortium (IHCC) was established in 2018 at the request of the leaders of the Heads of International Research Organizations (HIROs) and through a collaboration between the Global Genomic Medicine Collaborative (G2MC) and the Global Alliance for Genomics and Health (GA4GH). It is a global initiative aimed at closing gaps in genomic databases to enhance representation of different ancestral groups, promoting collaboration across academic and industry partners, and supporting cutting-edge research in areas that impact global health1. The mission of the IHCC is to forge cohort connections that revolutionize population health science by providing sustainable data infrastructure, cultivating a collaborative research environment, and promoting policies and best practices that foster connectivity, interoperability, and reciprocity.
IHCC membership criteria prioritize large population cohorts, capability for longitudinal health follow-up, broad participant selection, biological sample collection, and a commitment to data-sharing. It also recognizes the value of smaller cohorts representing underrepresented or unique groups.
Projects supported by the IHCC are focused on the biological, environmental, and social determinants of health and disease. Its core objectives are to (1) accelerate research, (2) harmonize data, (3) educate researchers, (4) enhance public health, and (5) foster innovation. Collaborators can utilize IHCC resources, including the public IHCC Data Atlas (https://atlas.ihccglobal.org/), to drive distinct areas of research in collaboration with other cohorts, and/or to access samples and data from populations of interest.
In this Comment, we summarize the breadth of IHCC members’ resources, invite prospective partners to join us in addressing global challenges in health research, and propose a federated template for constructive collaboration.
A large number of IHCC resources are available
Participants from member cohorts span a broad range of ages, ethnicities, and geographic locations, with 35% of locations (N = 24) self-identifying as a low- and middle-income country (LMIC), per World Bank criteria2.
A members’ survey was disseminated from November 2021 to March 2022 and again from November 2022 to March 2023. In total, it was completed by 69 of the 89 cohort members. Respondents reflected the diversity of the IHCC as a whole and were from Africa (N = 7), Asia (N = 19), Australia (N = 2), North America (N = 19), South America (N = 3), and Europe (N = 19). Of the 69 member cohorts who responded, 45 (65%) hold genomic data on some/all of their cohort. Forty-one sites had collected genotype data from ~12,834,036 research participants. Collectively, the cohorts represent ~34 million unique research participants with available data/samples (Fig. 1).
Several million biospecimens are available for immediate research use, subject to project-specific informed consent and ethical oversight, and country-specific legislation. Cohorts with ongoing recruitment continuously add to these collections (Fig. 2).
Across the 69 responding cohorts, millions of biospecimens are available for immediate research use, subject to project-specific informed consent and ethical oversight, and country-specific legislation. Cohorts with ongoing recruitment continuously add to these sample collections, ensuring that both new and existing studies can access fresh or archived samples as needed.
Relevant phenotype data are largely broad-based (i.e., not phenotype-specific). Demographic data informative of social determinants of health are widely available. Additionally, detailed clinical outcomes address chronic disease, mental health, and lifestyle. Ten (14%) IHCC cohorts (including two LMICs) reported linkage to participants’ electronic medical records (EMR).
IHCC encourages data sharing and collaboration
The IHCC supports open and timely access to research findings, adopting the Framework for Responsible Sharing developed by the Global Alliance for Genomics and Health (GA4GH)3. This assumes that data donors (or their legal representatives) have provided consent for data use and sharing, and that ethics oversight consistent with local, national, and relevant international laws has been applied, as well as culturally appropriate ethics standards and best practices for governing future data use. The intention and willingness to collaborate with other members and beyond is a prerequisite to membership and fundamental to the IHCC research enterprise.
IHCC facilitates LMIC/LRS representation and support
Since its founding, a core component of the IHCC's mission has been to enhance inclusion of cohorts with diversity from LMICs/low-resource settings (LRS) and to contribute to closing the existing ancestral and minority population gaps in databases. LRSs are characterized by significant constraints in both financial and human resources, encompassing clinical and non-clinical personnel. These contexts are further defined by an underdeveloped organizational and physical infrastructure. In addition to supporting LMIC/LRS collaborations, the IHCC explicitly supports diversity across age, ethnicity, sex, rural/urban, socioeconomic status, both in terms of constituent cohorts and investigator development.
The Global Cohort Atlas allows robust and simple data sharing
The IHCC Global Cohort Atlas serves as a centralized resource for discovering available data across cohorts. It enables data harmonization and sharing across different platforms. Participation in the Atlas is a requirement for any projects funded by the IHCC.
In 2020, the Atlas was launched as the first open global directory of large-scale human cohorts, with cohort information gathered from surveys and data dictionaries. It enables the findability of cohort data by providing users with a single entry point to cross-query cohort data dictionaries to discover cohorts, phenotypes, or variables of interest. Searchable dimensions include participant disease status, data use policy, sample collection parameters, genotypes, and demographic/health variables from cohort data dictionaries. The Atlas encourages data interoperability by providing harmonized descriptions of cohorts. All of the Atlas’s cohort public metadata (cohort descriptors, data dictionaries, and variables) are openly available (https://github.com/IHCC-cohorts).
Cohorts are classified into three levels of detail based on the cohort variables collected and harmonized. High-level cohort fields are collected upon joining IHCC (Level 1), structured cohort descriptors are collected through the IHCC annual cohort surveys (Level 2), and complete semantic harmonization of the cohort data dictionary variables is carried out in collaboration with cohort data managers (Level 3). This multi-level approach has enabled us to lower the bar for inclusion in the Atlas (Level 1), balancing the curation overhead required to reach the most comprehensive Level 3.
The Cohort Atlas has grown to 89 cohorts from 43 countries with 12 harmonized cohorts (Level 3) through collaborations with projects such as CINECA and the Davos Alzheimer’s Collaborative (DAC).
Starting a collaboration
Collaborations typically start with a feasibility assessment through the Atlas and outreach to cohort leaders for project alignment.
The Alzheimer’s pilot aimed to integrate existing cohort data worldwide to form a global Alzheimer’s disease (AD) resource. By uniting diverse cohorts with varying degrees of genomic and phenotypic data, the program targeted early detection of disease onset and progression. The approach to project planning, initiation, and development constitutes a working model for future IHCC programs with a milestone-driven approach to program development (Table 1).
In the planning stages, over 70 experts defined specific recommendations for the ‘Must have” minimum set of components and approaches, as well as suggestions for desirable “Nice to have” components. A full mapping and assessment of existing cohorts of subjects willing and able to contribute, already enrolled participants, was undertaken. Twenty IHCC cohorts comprising several million individuals were identified. Each completed a resource survey aligned with the scientific plan, which allowed the IHCC to assess readiness, gaps, and support needs. In addition, the group cataloged additional cohorts/participants that offer congruent capabilities, as well as specialized sub-cohorts focused on narrow aspects such as brain autopsy or novel biomarker strategies. In the pilot phase, the IHCC pursued cohort leads from cohorts to contribute data for the assessment of AD polygenic risk in diverse populations in comparison with European ancestry cohorts, including the UK Biobank4.
We envision that this model is adaptable to any number of phenotypes/projects, whereby global data on diverse and heterogeneous populations will yield new discoveries.
Existing pilots and publications
Our existing pilots give an example of the diversity of areas in which our research can be used. Several cohorts are engaged in using PRS to assess genetic susceptibility to various diseases, including cardiovascular diseases, diabetes, and dementia. These pilot studies are crucial for understanding PRS application in underrepresented populations5. Similarly, several cohorts are focused on mental health disorders, including studies to identify genetic and environmental contributors to conditions such as depression, schizophrenia, and anxiety, particularly in non-European populations6,7,8,9. The IHCC continues to support studies pioneering the use of metabolomics in population health. This research is crucial for understanding how metabolic pathways contribute to chronic diseases such as diabetes and cardiovascular conditions in diverse populations10,11. Another example of IHCC-supported health initiatives is in studying the impact of opioids—known carcinogens—on cancer burden. Member cohorts have formed a global collaboration to harmonize data on opioid and cancer risk12,13. Similarly, IHCC Cohorts are collaborating on the study of early-life adiposity linked to cancer risk in diverse genetic ancestries14,15,16. Finally, cohorts from Africa have developed a resource for coronavirus host genomics studies. This multi-collaborator strategic partnership was designed to provide harmonized demographic, clinical, and genetic information specific to Black South Africans with COVID-1917.
Future research directions
We aim to continue our support for the pilot program and to expand to new areas in the coming year. Our subgroups remain active and continue to address gaps in global health research. Future efforts will focus on improving the understanding of mental health disorders across LMIC populations, where the burden of mental health conditions is rising but remains under-researched. We also plan to enhance members’ metabolomics capabilities, particularly by developing standards for cross-cohort data integration. New initiatives include unique opportunities to further assess the relationship between climate and health. With over 70% of IHCC cohorts collecting environmental data, we are well-powered to lead in this increasingly important domain. Other areas for IHCC expansion include cancer genomics, infectious disease research, and pharmacogenomics. These fields represent new frontiers where diverse global data will be important for identifying genetic and environmental contributors to health.
Conclusions
The IHCC is a unique resource and transformative global initiative aimed at uniting large-scale, longitudinal population cohort studies to address some of the most pressing challenges in population health. The consortium’s commitment to diversity, inclusion, and representation addresses existing gaps in genomic databases and health disparities. Through initiatives such as the Global Cohort Atlas, standardized data-sharing frameworks, and federated analysis models, the IHCC has enabled collaborations that transcend geographical and technological barriers while respecting ethical and regulatory considerations.
IHCC’s scientific strategy has catalyzed novel research, including the application of PRS, studies on mental health and metabolomics, and the development of global frameworks for Alzheimer’s disease research. As it continues to grow, the IHCC remains focused on advancing its mission to support sustainable, inclusive, and high-impact research.
The IHCC invites researchers, policymakers, and industry partners to join its efforts in building a collaborative ecosystem that leverages population cohort data to drive innovation and equity in health outcomes worldwide. Researchers interested in joining can contact ihccinfo@ihccglobal.org.
Data availability
All of the underlying cohort public metadata, including cohort descriptors, data dictionaries, and variables, are openly available. Collaborators can utilize IHCC resources, including the public IHCC Data Atlas at: https://atlas.ihccglobal.org/. All of the Atlas’s cohort public metadata (cohort descriptors, data dictionaries, and variables) are openly available: https://github.com/IHCC-cohorts.
References
Manolio, T. A., Goodhand, P. & Ginsburg, G. The International Hundred Thousand Plus Cohort Consortium: integrating large-scale cohorts to address global scientific challenges. Lancet Digit. Health 2, e567–e568 (2020).
Organization WH. The Global Health Observatory (World Health Organization, 2024).
Knoppers, B. M. Framework for responsible sharing of genomic and health-related data. Hugo J. 8, 3 (2014).
Sleiman, P. M. et al. Trans-ethnic genomic informed risk assessment for Alzheimer’s disease: an International Hundred K+ Cohorts Consortium study. Alzheimers Dement. 19, 5765–5772 (2023).
Qu, H. Q. et al. Trans-ethnic polygenic risk scores for body mass index: an international hundred K+ cohorts consortium study. Clin. Transl. Med. 13, e1291 (2023).
Brunoni, A. R. et al. Prevalence and risk factors of psychiatric symptoms and diagnoses before and during the COVID-19 pandemic: findings from the ELSA-Brasil COVID-19 mental health cohort. Psychol. Med. 53, 446–457 (2023).
Choi, K. W. et al. Effects of social support on depression risk during the COVID-19 pandemic: what support types and for whom? Preprint at medRxiv https://doi.org/10.1101/2022.05.15.22274976 (2022).
Fatori, D. et al. Trajectories of common mental disorders symptoms before and during the COVID-19 pandemic: findings from the ELSA-Brasil COVID-19 Mental Health Cohort. Soc. Psychiatry Psychiatr. Epidemiol. 57, 2445–2455 (2022).
Lee, Y. H. et al. Association of everyday discrimination with depressive symptoms and suicidal ideation during the COVID-19 pandemic in the All of Us research program. JAMA Psychiatry 79, 898–906 (2022).
Qu, H. Q. et al. Metabolomic profiling for dyslipidemia in pediatric patients with sickle cell disease, on behalf of the IHCC consortium. Metabolomics 18, 101 (2022).
Qu, H. Q. et al. Metabolomic profiling of samples from pediatric patients with asthma unveils deficient nutrients in African Americans. iScience 25, 104650 (2022).
Sheikh, M., Brennan, P., Mariosa, D. & Robbins, H. A. Opioid medications: an emerging cancer risk factor? Br. J. Anaesth. 130, e401–e403 (2023).
Alcala, K. et al. Incident cancers attributable to using opium and smoking cigarettes in the Golestan cohort study. EClinicalMedicine 64, 102229 (2023).
Papadimitriou, N. et al. Separating the effects of early and later life adiposity on colorectal cancer risk: a Mendelian randomization study. BMC Med. 21, 5 (2023).
Papadimitriou, N. et al. Body mass index at birth and early life and colorectal cancer: a two-sample Mendelian randomization analysis in European and East Asian genetic similarity populations. Pediatr. Obes. 20, e13186 (2025).
Mukhtar, M. et al. The Associations of selenoprotein genetic variants with the risks of colorectal adenoma and colorectal cancer: case-control studies in Irish and Czech populations. Nutrients 14, 2718 (2022).
May, A. K. et al. Coronavirus Host Genomics Study: South Africa (COVIGen-SA). Glob. Health Epidemiol. Genom. 2022, 7405349 (2022).
Acknowledgements
This study was supported by the International Health Cohorts Consortium (IHCC), a program of the Global Genomic Medicine Collaborative (GGMC), with funding from the National Institutes of Health (US), the Chan-Zuckerberg Initiative, and the Wellcome Trust.
Author information
Authors and Affiliations
Contributions
John J Connolly: Conception, writing, and data curation. Scott Sundseth: Project administration, writing, review, and editing. Grant M. Wood: Project administration. Chisom Nwaneri: Project administration. Geoffrey Ginsburg: Conception, review, and editing. Philip Awadalla: Review and editing. Michele Ramsay: Review and editing. Thomas Keane: Review and editing. Peter Goodhand: Conception, review, and editing. Adam S. Butterworth: Conception, review, and editing. Hakon Hakonarson: Conception, review, and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Connolly, J.J., Sundseth, S., Wood, G.M. et al. The International Health Cohorts Consortium (IHCC) advances population health research and genomic discovery. Commun Med 5, 366 (2025). https://doi.org/10.1038/s43856-025-01026-y
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s43856-025-01026-y

