Abstract
Secondary findings (SFs) from genome sequencing have significant implications for disease prevention and early intervention, yet their population-specific spectrum remains poorly characterized in non-European cohorts. We performed whole-genome sequencing of 6685 Chinese newborns and evaluated pathogenic variants in 84 genes from the American College of Medical Genetics and Genomics (ACMG) SF v3.3 list according to ACMG/Association for Molecular Pathology (AMP) classification guidelines, and cross-referenced against ClinVar. We identified 306 unique actionable variants, comprising 172 known pathogenic variants (KP) and 134 expected pathogenic variants (EP). When heterozygous carriers of autosomal recessive (AR) variants were included, 9.12% (610/6685) of newborns carried at least one pathogenic variant. Under ACMG SF criteria, clinically actionable variants were identified in 5.06% (338/6685) of newborns, predominantly affecting cardiovascular disease genes (3.49%) and cancer predisposition genes (1.26%), most commonly involving LDLR, TTN, and BRCA2. Importantly, 28 variants across 12 genes showed significant allele frequency divergence between Chinese and European ancestries, highlighting ancestry-specific genetic architecture. Our findings support the inclusion of high-penetrance genes prevalent in East Asian populations in population-tailored genomic screening panels, providing essential reference data for the equitable implementation of precision newborn genomics in underrepresented populations.
Data availability
The data supporting the finding of this study, including the allele frequency for all 399,037 genetic variants across the 84 ACMG-recommended genes, have been deposited in the Genome Variation Map (https://ngdc.cncb.ac.cn/gvm/) at the National Genomics Data Center, China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences, under the accession number that can be publicly accessible at https://ngdc.cncb.ac.cn/gvm/getProjectDetail?project=GVM001119. The release of the data was approved by the Ministry of Science and Technology of China (Project ID: PRJCA043552). To prevent the disclosure of individuals’ genetic identity, the raw sequencing data and information of the research participants are not publicly available. Further analysis of sequencing data will be made available for collaborating researchers upon request, dependent of the HGRAC’s approval.
References
Saeidian, A. H. et al. Secondary ACMG and non-ACMG genetic findings in a multiethnic cohort of 16,713 pediatric participants. Genet. Med. 26, 101225 (2024).
Green, R. C. et al. ACMG recommendations for reporting of incidental findings in clinical exome and genome sequencing. Genet. Med. 15, 565–574 (2013).
Miller, D. T. et al. Recommendations for reporting of secondary findings in clinical exome and genome sequencing, 2021 update: a policy statement of the American College of Medical Genetics and Genomics (ACMG). Genet. Med. 23, 1391–1398 (2021).
Kalia, S. S. et al. Recommendations for reporting of secondary findings in clinical exome and genome sequencing, 2016 update (ACMG SF v2.0): a policy statement of the American College of Medical Genetics and Genomics. Genet Med. 19, 249–255 (2017).
Miller, D. T. et al. ACMG SF v3.0 list for reporting of secondary findings in clinical exome and genome sequencing: a policy statement of the American College of Medical Genetics and Genomics (ACMG). Genet. Med. 23, 1381–1390 (2021).
Miller, D. T. et al. ACMG SF v3.2 list for reporting of secondary findings in clinical exome and genome sequencing: A policy statement of the American College of Medical Genetics and Genomics (ACMG). Genet. Med. 25, 100866 (2023).
Amendola, L. M. et al. Actionable exomic incidental findings in 6503 participants: challenges of variant classification. Genome Res. 25, 305–315 (2015).
Jang, M. A., Lee, S. H., Kim, N. & Ki, C. S. Frequency and spectrum of actionable pathogenic secondary findings in 196 Korean exomes. Genet. Med. 17, 1007–1011 (2015).
Yamaguchi-Kabata, Y. et al. Evaluation of reported pathogenic variants and their frequencies in a Japanese population based on a whole-genome reference panel of 2049 individuals. J. Hum. Genet. 63, 213–230 (2018).
Chetruengchai, W. & Shotelersuk, V. Actionable secondary findings in the 73 ACMG-recommended genes in 1559 Thai exomes. J. Hum. Genet. 67, 137–142 (2022).
Hsu, J. S. et al. Complete genomic profiles of 1496 Taiwanese reveal curated medical insights. J. Adv. Res. 66, 197–207 (2024).
Tang, C. S. et al. Actionable secondary findings from whole-genome sequencing of 954 East Asians. Hum. Genet. 137, 31–37 (2018).
Lee, K. et al. ACMG SF v3.3 list for reporting of secondary findings in clinical exome and genome sequencing: A policy statement of the American College of Medical Genetics and Genomics (ACMG). Genet. Med. 27, https://doi.org/10.1016/j.gim.2025.101454 (2025).
Ovchinnikova, E. V., Garbuz, M. M., Ovchinnikova, A. A. & Kumeiko, V. V. Epidemiology of Wilson’s disease and pathogenic variants of the ATP7B gene leading to diversified protein disfunctions. Int. J. Mol. Sci. 25, https://doi.org/10.3390/ijms25042402 (2024).
Appadurai, V. et al. Apparent underdiagnosis of Cerebrotendinous Xanthomatosis revealed by analysis of ~60,000 human exomes. Mol. Genet. Metab. 116, 298–304 (2015).
Santos, R. D. Advancing prediction of pathogenicity of familial hypercholesterolemia LDL receptor commonest variants with machine learning models. JACC Basic Transl. Sci. 6, 828–830 (2021).
Jammal Addin, M. B., Young, D., McCarrison, S. & Hunter, L. Dilated cardiomyopathy in a national paediatric population. Eur. J. Pediatr. 178, 1229–1235 (2019).
Roberts, A. M. et al. Integrated allelic, transcriptional, and phenomic dissection of the cardiac effects of titin truncations in health and disease. Sci. Transl. Med. 7, 270ra276 (2015).
Jayson, G. C., Kohn, E. C., Kitchener, H. C. & Ledermann, J. A. Ovarian cancer. Lancet 384, 1376–1388 (2014).
Loibl, S., Poortmans, P., Morrow, M., Denkert, C. & Curigliano, G. Breast cancer. Lancet 397, 1750–1769 (2021).
Pietrangelo, A. Hereditary hemochromatosis: pathogenesis, diagnosis, and treatment. Gastroenterology 139, 393–408.e392 (2010).
Burke, W. et al. Hereditary hemochromatosisgene discovery and its implications for population-based screening. JAMA 280, 172–178 (1998).
Allen, K. J. et al. HFE Cys282Tyr homozygotes with serum ferritin concentrations below 1000 microg/L are at low risk of hemochromatosis. Hepatology 52, 925–933 (2010).
Cong, P. K. et al. Identification of clinically actionable secondary genetic variants from whole-genome sequencing in a large-scale Chinese population. Clin. Transl. Med. 12, https://doi.org/10.1002/ctm2.866 (2022).
Lerner-Ellis, J. P. et al. Identification of the gene responsible for methylmalonic aciduria and homocystinuria, cblC type. Nat. Genet. 38, 93–100 (2006).
Wang, C. et al. Mutation spectrum of MMACHC in Chinese pediatric patients with cobalamin C disease: a case series and literature review. Eur. J. Med. Genet. 62, 103713 (2019).
Huang, Y. et al. Landscape of secondary findings in Chinese population: a practice of ACMG SF v3.0 list. J. Pers. Med. 12, https://doi.org/10.3390/jpm12091503 (2022).
Cassa, C. A., Tong, M. Y. & Jordan, D. M. Large numbers of genetic variants considered to be pathogenic are common in asymptomatic individuals. Hum. Mutat. 34, 1216–1220 (2013).
Aloraini, T. et al. The rate of secondary genomic findings in the Saudi population. Am. J. Med. Genet. A 188, 83–88 (2022).
Kuo, C. W. et al. Frequency and spectrum of actionable pathogenic secondary findings in Taiwanese exomes. Mol. Genet. Genom. Med 8, e1455 (2020).
Xiao, H. et al. Secondary genomic findings in the 2020 China Neonatal Genomes Project participants. World J. Pediatr. 18, 687–694 (2022).
Natarajan, P. et al. Aggregate penetrance of genomic variants for actionable disorders in European and African Americans. Sci. Transl. Med. 8, 364ra151 (2016).
Dewey, F. E. et al. Distribution and clinical impact of functional variants in 50,726 whole-exome sequences from the DiscovEHR study. Science 354, https://doi.org/10.1126/science.aaf6814 (2016).
Khera, A. V. et al. Diagnostic yield and clinical utility of sequencing familial hypercholesterolemia genes in patients with severe hypercholesterolemia. J. Am. Coll. Cardiol. 67, 2578–2589 (2016).
Nordestgaard, B. G. et al. Familial hypercholesterolaemia is underdiagnosed and undertreated in the general population: guidance for clinicians to prevent coronary heart disease: consensus statement of the European Atherosclerosis Society. Eur. Heart J. 34, 3478–3490 (2013).
Myśliwiec, M. et al. 2024 Polish recommendations for the management of familial hypercholesterolemia in children and adolescents. Arch. Med. Sci. 20, 1741–1753 (2024).
Bogsrud, M. P. et al. Treatment goal attainment in children with familial hypercholesterolemia: a cohort study of 302 children in Norway. J. Clin. Lipido. 12, 375–382 (2018).
Luirink, I. K. et al. 20-year follow-up of statins in children with familial hypercholesterolemia. N. Engl. J. Med. 381, 1547–1556 (2019).
Green, R. C. et al. Actionability of unanticipated monogenic disease risks in newborn genomic screening: findings from the BabySeq Project. Am. J. Hum. Genet. 110, 1034–1045 (2023).
Wade, C. H., Wilfond, B. S. & McBride, C. M. Effects of genetic risk information on children’s psychosocial wellbeing: a systematic review of the literature. Genet. Med. 12, 317–326 (2010).
Botkin, J. R. et al. Points to consider: ethical, legal, and psychosocial implications of genetic testing in children and adolescents. Am. J. Hum. Genet. 97, 6–21 (2015).
Huang, C. et al. An integrated Asian human SNV and indel benchmark established using multiple sequencing methods. Sci. Rep. 10, 9821 (2020).
Li, Z. et al. An efficient large-scale whole-genome sequencing analyses practice with an average daily analysis of 100Tbp: ZBOLT. Clin. Transl. Discov. 3, e252 (2023).
Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinform. 43, 11.10.1–11.10.33 (2013).
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, https://doi.org/10.1093/gigascience/giab008 (2021).
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Zhang, F. et al. Ancestry-agnostic estimation of DNA sample contamination from sequence reads. Genome Res. 30, 185–194 (2020).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873 (2010).
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Kumar, P., Henikoff, S. & Ng, P. C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4, 1073–1081 (2009).
Adzhubei, I. A. et al. A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249 (2010).
Adzhubei, I., Jordan, D. M. & Sunyaev, S. R. Predicting functional effect of human missense mutations using PolyPhen-2. Curr. Protoc. Hum. Genet. 76, 7–20 (2013).
Schwarz, J. M., Rodelsperger, C., Schuelke, M. & Seelow, D. MutationTaster evaluates disease-causing potential of sequence alterations. Nat. Methods 7, 575–576 (2010).
Landrum, M. J. et al. ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 42, D980–D985 (2014).
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
McLaren, W. et al. The ensembl variant effect predictor. Genome Biol. 17, 122 (2016).
Gudmundsson, S. et al. Addendum: the mutational constraint spectrum quantified from variation in 141,456 humans. Nature 597, E3–E4 (2021).
Acknowledgements
We are especially grateful to the participation of the volunteers and their families. We sincerely thank Yuhua Ye for his valuable assistance in the review and interpretation of genetic variants. This work was supported by the National Natural Science Foundation of China (2022YFC2703102), and the China-Serbia Science and Technology Cooperation Committee Exchange Program, Sixth Session (Project 6-3 to M. F.).
Author information
Authors and Affiliations
Contributions
M.F. and X.J. conceived and designed the study. Y.H. performed data analysis, quality control, variant annotation and data interpretation. Y.G., Z.D., X.J., Y.S., and C.L. were responsible for participants recruitment and the collection of samples and associated information. H.H., J.L., S.P., and X.J. contributed to data generation. M.F. and Y.H. drafted the manuscript. M.F., Y.H., and X.J. revised the manuscript. All authors have read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
Authors Y.H., Y.G., Z.D., H.H., J.L., X.J., and M.F. were employed by Beijing Genomics Institution (BGI) in Shenzhen. The other authors do not have a competing interest.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Huang, Y., Gao, Y., Duan, Z. et al. Population-scale genomic screening reveals high frequency of actionable secondary findings in Chinese newborns. npj Genom. Med. (2026). https://doi.org/10.1038/s41525-026-00565-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41525-026-00565-0