Abstract
Copy number variations (CNVs) are genomic structural variations that result from the deletion or duplication of large genomic segments. The characterization of CNVs is largely underrepresented, particularly those of indigenous populations, such as the Orang Asli in Peninsular Malaysia. In the present study, we first characterized the genome-wide CNVs of four major native populations from Peninsular Malaysia, including the Malays and three Orang Asli populations; namely, Proto-Malay, Senoi, and Negrito (collectively called PM). We subsequently assessed the distribution of CNVs across the four populations. The resulting global CNV map revealed 3102 CNVs, with an average of more than 100 CNVs per individual. We identified genes harboring CNVs that are highly differentiated between PM and global populations, indicating that these genes are predominantly enriched in immune responses and defense functions, including APOBEC3A_B, beta-defensin genes, and CCL3L1, followed by other biological functions, such as drug and toxin metabolism and responses to radiation, suggesting some attributions between CNV variations and adaptations of the PM groups to the local environmental conditions of tropical rainforests.
Similar content being viewed by others
Log in or create a free account to read this content
Gain free access to this article, as well as selected content from this journal and more on nature.com
or
References
Iafrate AJ, Feuk L, Rivera MN, et al. Detection of large-scale variation in the human genome. Nat Genet. 2004;36:949–51.
Sebat J, Lakshmi B, Troge J, et al. Large-scale copy number polymorphism in the human genome. Science. 2004;305:525–8.
Redon R, Ishikawa S, Fitch KR, Feuk L, Redon R, Ishikawa S, et al. Global variation in copy number in the human genome. Nature. 2006;444:444–54.
Lupski JR, Stankiewicz P. Genomic disorders: molecular mechanisms for rearrangements and conveyed phenotypes. PLoS Genet. 2005;1:0627–33.
Wong KK, deLeeuw RJ, Dosanjh NS, et al. A comprehensive analysis of common copy-number variations in the human genome. Am J Hum Genet. 2007;80:91–104.
Perry GH, Dominy NJ, Claw KG, et al. Diet and the evolution of human amylase gene copy number variation. Nat Genet. 2007;39:1256–60.
Lupski JR, Wise CA, Kuwano A, et al. Gene dosage is a mechanism for Charcot-Marie-Tooth disease type 1A. Nat Genet. 1992;1:29–33.
Hollox EJ, Hoh B-P. Human gene copy number variation and infectious disease. Hum Genet. 2014;133:1217–33.
Fanciulli M, Norsworthy PJ, Petretto E, et al. FCGR3B copy number variation is associated with susceptibility to systemic, but not organ-specific, autoimmunity. Nat Genet. 2007;39:721–3.
Mamtani M, Anaya J-M, He W, Ahuja SK. Association of copy number variation in the FCGR3B gene with risk of autoimmune diseases. Genes Immun. 2010;11:155–60.
Molokhia M, Fanciulli M, Petretto E, et al. FCGR3B copy number variation is associated with systemic lupus erythematosus risk in Afro-Caribbeans. Rheumatology. 2011;50:1206–10.
Sebat J, Lakshmi B, Malhotra D, et al. Strong association of de novo copy number mutations with autism. Science. 2007;316:445–9.
Stefansson H, Rujescu D, Cichon S, et al. Large recurrent microdeletions associated with schizophrenia. Nature. 2008;455:232–6.
Xu B, Roos JL, Levy S, van Rensburg EJ, Gogos JA, Karayiorgou M. Strong association of de novo copy number mutations with sporadic schizophrenia. Nat Genet. 2008;40:880–5.
Pollex RL, Hegele RA. Copy number variation in the human genome and its implications for cardiovascular disease. Circulation. 2007;115:3130–8.
Stankiewicz P, Lupski JR. Structural variation in the human genome and its role in disease. Annu Rev Med. 2010;61:437–55.
Girirajan S, Campbell CD, Eichler EE. Human copy number variation and complex genetic disease. Annu Rev Genet. 2011;45:203–26.
Gu W, Zhang F, Lupski JR. Mechanisms for human genomic rearrangements. Pathogenetics. 2008;1:4.
Lam HYK, Mu XJ, Stütz AM, et al. Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint library. Nat Biotechnol. 2010;28:47–55.
Mills RE, Walter K, Stewart C, et al. Mapping copy number variation by population-scale genome sequencing. Nature. 2011;470:59–65.
Gene D, Asia E, Hardwick RJ, et al. A worldwide analysis of beta-defensin copy number variation suggests recent selection of a high-expressing. Hum Mutat. 2011;67948. https://doi.org/10.1002/humu.21491.
Song H, Hu H, Seok I, Chung Y. Identifying copy number variants under selection in geographically structured populations based on F-statistics. Genom Inform. 2012;10:81–7.
Sudmant PH, Mallick S, Nelson BJ, et al. Global diversity, population stratification, and selection of human copy-number variation. Science 2015;349:aab3761.
Aghakhanian F, Yunus Y, Naidu R, et al. Unravelling the genetic history of negritos and indigenous populations of Southeast Asia. Genome Biol Evol. 2015;7:1206–15.
Liu X, Yunus Y, Lu D, et al. Differential positive selection of malaria resistance genes in three indigenous populations of Peninsular Malaysia. Hum Genet. 2015;134:375–92.
Deng L, Hoh BP, Lu D, et al. The population genomic landscape of human genetic structure, admixture history and local adaptation in Peninsular Malaysia. Hum Genet. 2014;133:1169–85.
Jinam Ta, Phipps ME, Saitou N. Admixture patterns and genetic differentiation in negrito groups from West Malaysia estimated from genome-wide SNP data. Hum Biol. 2013;85:173–88.
Mokhtar SS, Marshall CR, Phipps ME, et al. Novel population specific autosomal copy number variation and its functional analysis amongst Negritos from Peninsular Malaysia. PLoS ONE. 2014;9:e100371 https://doi.org/10.1371/journal.pone.0100371
Ku C-S, Pawitan Y, Sim X, et al. Genomic copy number variations in three Southeast Asian populations. Hum Mutat. 2010;31:851–7.
MacDonald JR, Ziman R, Yuen RKC, Feuk L, Scherer SW. The Database of Genomic Variants: a curated collection of structural variation in the human genome. Nucleic Acids Res. 2014;42:986–92.
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006;38:904–9.
Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–59.
Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics. 2003;164:1567–87.
Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: Dominant markers and null alleles. Mol Ecol Notes. 2007;7:574–8.
Hubisz MJ, Falush D, Stephens M, Pritchard JK. Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour. 2009;9:1322–32.
Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38:1358.
Lou H, Li S, Yang Y, et al. A map of copy number variations in chinese populations. PLoS ONE. 2011;6:e27341 https://doi.org/10.1371/journal.pone.0027341
Felsenstein J. PHYLIP (Phylogeny Inference Package)Version 3.6. Distributed by the author. Department of Genome Sciences, University of Washington, Seattle. Cladistics. 2004;5:164–6.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28:2731–9.
Jha P, Sinha S, Kanchan K, et al. Deletion of the APOBEC3B gene strongly impacts susceptibility to falciparum malaria. Infect Genet Evol. 2012;12:142–8.
Hatin WI, Nur-Shafawati AR, Zahri MK, et al. Population genetic structure of peninsular Malaysia Malay sub-ethnic groups. PLoS ONE. 2011;6:2–6.
Korn JM, Kuruvilla FG, McCarroll SA, et al. Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs. Nat Genet. 2008;40:1253–60.
Huson DH, Scornavacca C. Dendroscope 3: an interactive tool for rooted phylogenetic trees and networks. Syst Biol. 2012;61:1061–7.
Purcell S, Neale B, Todd-Brown K, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75.
Acknowledgements
We thank the Department of Orang Asli Development (JAKOA) and especially all subjects who voluntarily participated in this study. SX acknowledges financial support from the Strategic Priority Research Program (XDB13040100) and Key Research Program of Frontier Sciences (QYZDJ-SSW-SYS009) of the Chinese Academy of Sciences (CAS), the National Natural Science Foundation of China (NSFC) grant (91331204, 91731303, 31771388, and 31711530221), the National Science Fund for Distinguished Young Scholars (31525014), the National Key Research and Development Program (2016YFC0906403), and the Program of Shanghai Academic Research Leader (16XD1404700). B-PH acknowledges the Chinese Academy of Sciences President’s International Fellowship Initiatives (2017VBA0008) awarded to him. This study is also supported by Ministry of Science, Technology and Innovation (MOSTI) grant erBiotek Grant #100-RM/BIOTEK 16/6/2 B (1/2011) and [100- RMI/GOV 16/6/2 (19/2011)] awarded to B-PH and MEP. SX is Max-Planck Independent Research Group Leader and member of CAS Youth Innovation Promotion Association. SX also gratefully acknowledges the support of the National Program for Top-notch Young Innovative Talents of The “Wanren Jihua” Project. We thank LetPub (www.letpub.com) for providing linguistic assistance during the preparation of this manuscript. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
These authors contributed equally: Ruiqing Fu, Boon-Peng Hoh.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Fu, R., Mokhtar, S., Phipps, M.E. et al. A genome-wide characterization of copy number variations in native populations of Peninsular Malaysia. Eur J Hum Genet 26, 886–897 (2018). https://doi.org/10.1038/s41431-018-0120-8
Received:
Revised:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1038/s41431-018-0120-8
This article is cited by
-
Ethnic and functional differentiation of copy number polymorphisms in Tunisian and HapMap population unveils insights on genome organizational plasticity
Scientific Reports (2024)
-
Genome-wide copy number variations in a large cohort of bantu African children
BMC Medical Genomics (2021)
-
A map of copy number variations in the Tunisian population: a valuable tool for medical genomics in North Africa
npj Genomic Medicine (2021)
-
Copy number variation of CCL3L1 among three major ethnic groups in Malaysia
BMC Genetics (2020)
-
Analysis of five deep-sequenced trio-genomes of the Peninsular Malaysia Orang Asli and North Borneo populations
BMC Genomics (2019)


