Abstract
The evolutionary impact of epigenetic variation depends on its transgenerational stability and source - whether genetically determined, environmentally induced, or due to spontaneous, genotype-independent mutations. Here, we evaluate current approaches for investigating an independent role of epigenetics in evolution, pinpointing methodological challenges. We further identify opportunities arising from integrating epigenetic data with population genetic analyses in natural populations. Efforts to advance data quality, study design, and statistical treatment are encouraged to consolidate our understanding of the source of heritable epigenetic variation, quantify its autonomous potential for evolution, and enrich population genetic analyses with an additional layer of information.
Similar content being viewed by others
Evolution, epigenesis, and epigenetic inheritance
Evolutionary biology is concerned with transgenerational change. Prevailing evolutionary theory conceptualizes the underlying process as an interplay between five central parameters acting on genetic variation: mutations of the DNA backbone generate genetic variation, which is depleted by random genetic drift, reorganized by recombination and redistributed by migration (for definitions of italicized terms, see the glossary in Box 1). A subset of genetic variation translates into heritable phenotypic variation1,2 where it is filtered by selection. Over the last decades, progress has been made in understanding the underlying epigenetic mechanisms that orchestrate the construction of phenotypes from information encoded in the genotype3. Examples include non-coding RNAs (lncRNA, piRNA, siRNA, miRNA), structural protein templating (prions), and chromatin modifications (e.g. DNA methylation, histone modification)4. These self-perpetuating somatic inheritance systems can establish somatic molecular memory by controlling “the interaction of genes and their products […] which bring the phenotype into being”5 p. 242.
In evolutionary biology, the focus is often not on epigenetics in the classical Waddingtonian sense (epigenesis) concerned with predictable ontogenetic change from single-celled zygotes to fully fledged, multi-cellular composites (‘genotype-to-phenotype’)5,6,7,8. Instead, we are mainly interested in the trans-generational effects of epigenetic mechanisms, which may themselves serve as a heritable substrate in the germline beyond the information encoded in the DNA blueprint9,10. Epigenetic inheritance has received considerable attention, challenging whether the traditional evolutionary paradigm relying on genetic variation as the source of heritable variation appropriately captures all aspects of evolution (extended evolutionary theory11,12,13). Importantly, only the fraction of epigenetic variation arising independently of the genotype (source), showing genealogical stability (inheritance) and causing phenotypic variation to be filtered by selection (consequences), can make an autonomous contribution to (adaptive) evolution beyond variation encoded in the genotype (orange path, Box 2).
Evidence for the evolutionary relevance of epigenetic variation, particularly 5mC DNA methylation, has been reviewed elsewhere14,15,16. In brief, it seems rather widespread in plants and fungi14,17,18. Research, mostly conducted on genetic models in the lab, provides clear evidence for spontaneous, random epimutations in DNA methylation19,20,21,22,23, and to a lesser degree for environmentally-induced epimutations24. Most importantly, transgenerational inheritance has been reported for both25,26, but seems far from frequent27,28. While there are documented phenotypic consequences of autonomous epigenetic variation29,30, evidence is still limited21. In animals, with a stronger, albeit leaky soma-germline separation31,32,33 and reprogramming34, transgenerational inheritance of autonomous epigenetic variation is expected to be more restricted and harder to prove35,36, although evidence exists37,38,39,40,41,42.
In this Perspective, we explore the potential of integrating population genetic approaches to the study of epigenetic variation. We make a case for natural populations as a valuable source of information, provide a critical reappraisal of current practice, highlight shortcomings, and identify fruitful areas of future research. The target audiences are evolutionary geneticists interested in expanding their research to epigenetic variation, as well as researchers focusing on epigenetic mechanisms in the lab who are looking to tap into natural variation.
Epigenetic research in natural populations
Despite valuable baseline information from lab studies of model species, natural populations are the way forward to understand the role of epigenetic variation in the relevant ecological and evolutionary context43. To what extent do environmental stressors impact epigenetic variation? What proportion of environmentally or spontaneous epialleles are stably inherited in the wild? How does epigenetic variation mediate phenotypic plasticity or parental effects, and can this be maintained over generations when past stressors are no longer present? Do trans-generationally stable spontaneous epigenetic mutations evolve neutrally or can they spread in a population through selection? Is epigenetic variation with evolutionary impact associated with certain genes or found randomly along the genome? What is the role of epigenetic variation in speciation? Most of these questions can only be answered when the source of epigenetic variation is known (Box 2). While allowing for less experimental control, natural systems contain information to get at this question, as they allow us to partition variation across scales: between (related) individuals, populations, species and higher-order taxa, often across a multitude of environmental contrasts. This comparative approach is powerful. It has been successfully exploited to dissect the relationship between genetic and phenotypic variation44,45,46 and can be adapted for the context of trans-generational inheritance47,48,49,50.
Current practice
We compiled information from 52 opportunistically sampled studies to identify current trends and challenges in the field addressing the above questions (Supplementary Data 1). We focused on studies investigating epigenetic variation in the form of 5mC methylation which still constitutes the most universally accessible and cost-effective chromatin modification51,52. We used several criteria to evaluate current standard practice regarding data quality measures, data analysis and statistical treatment, and integration into the theoretical framework. Overall, we feel that the field may have been running ahead too quickly, and should adjust aspects of data quality, analysis and experimental design differing greatly between plants and animals. Studies in plants generally have higher sample sizes, more standardized conditions and homogeneous sample types than animal studies; however, they lack taxonomic diversity in natural populations.
Data quality
In regards to methodological decisions for DNA treatment, sequencing coverage, batch effects, and technical replicates, there appear to be no clear standards in the field, despite the direct impact on data quality (but see53,54 for advice). For instance, the importance of choosing homogeneous tissue unaffected by cell type composition is of central importance and has been largely overlooked. Only 44 % of the studies explicitly considered cell type homogeneity and the vast majority of animal studies used blood, known to respond to immune stress or other external factors55. Blood needs to be evaluated not only for the cell type composition, but also if it is reflective of methylation in other tissues56,57. Technical replication, which is vital to assess measurement error, evaluate performance of lab methods (enzymatic conversion vs. bisulfite treatment) and inform the upper bound of heritability, is rarely used (12% of studies), possibly due to the trade-off with adding more samples to increase statistical power. Finally, for all methods converting cytosine methylation signals to thymine excluding segregating genetic variation at CpG sites (C-T, A-G SNPs) is a simple, yet underused means to improve data quality (13% of applicable studies).
Data analysis
We gathered information on the metrics used to quantify epigenetic variation. While studies use a variety of methods (MS-AFLP, MeDIP), the current standard is 5mC count data at single CpG sites (WGBS, RRBS, 61% of studies). Yet, there appears to be no clear consensus on the choice of data type nor the unit of choice, which will significantly impact downstream analysis (Box 3). A large number of studies analyze the proportion of methylated reads (40% of the studies) rather than raw read counts (12%), which contain more information58. Analyses based on single methylation loci make up 70% of the studies, while the rest integrated CpG-level methylation across regional levels to different extents. This choice is relevant from a statistical view point, since CpG locus count data can be approximated by a (beta)binomial distribution58, whereas integration across larger genomic areas might be better represented by other statistical distributions. We advocate that studies use both single loci and assess the intercorrelation of CpG loci within genomic regions (promoters, introns, exons, etc.) reducing the number of loci for downstream analysis59. Data-informed mechanistic models investigating the intercorrelation between CpG sites in a given genomic context will be important for guiding a biologically meaningful definition of epiphenotype (https://cran.r-project.org/web/packages/MethEvolSIM/)60.
A majority of examined studies (67%) attempted to find differentially methylated positions or regions (DMP or DMR, respectively) between two or more groups (i.e. treatments, populations, species, etc.). A subset of these were then related to gene expression (28% of studies) or were used for an a posteriori gene ontology narrative. However, caution is warranted in interpreting DMP/DMRs in natural populations. Unless explicitly addressed (Box 4), or at least in part controlling for genetic variation (23% of studies), we can only speculate on the source of observed differences. Moreover, samples sizes tend to be prohibitively low. Average sample size per treatment group was 18 (range: 2-336) which is substantially underpowered for detecting true positives, especially in differential methylation analysis (Box 5: simulations following ref. 58). This has important consequences for interpreting most standard analyses ranging from differential methylation to meQTLs which often invoke more complex models requiring even larger sample sizes to avoid overfitting. In the best case, conclusions are biased towards large-effect loci but may also refer to false positives.
In light of these limitations, we strongly recommend researchers conduct a pilot experiment to quantify the empirical distribution of effect sizes that can differ substantially between questions and systems. Power simulations can then provide insight into whether realistic samples sizes can be obtained to address the question at hand. While in most cases sample sizes in the hundreds or thousands will be required to detect small differences between variables of interest, a lack of power does not necessarily invalidate every study with moderate samples sizes. For example, clustering of large-effect loci in regions of elevated genetic divergence can still be informative on the underlying processes shaping epigenetic variation57,61. Moreover, where site-by-site analyses are strongly underpowered for small effect differences, other methods incorporating multivariate statistics58, machine learning62, or averaging across many genes in the genome63 can still extract useful information on the factors shaping epigenetic variation.
Design
To assess the role of epigenetics in evolution, designs assaying transgenerational stability (at least generation F3) of epigenetic variation are crucial but remain extremely limited (only 11% of studies). This is understandable, since evaluating stability involves crossing designs which are not accessible for all study organisms. Leveraging divergent ancestry components (population stratification, hybrid zones) may constitute a viable workaround (see below). In any case, a study design should include reliable quantitative measurements from ecological, genetic and epigenetic data (Box 3). While examples embracing all data types exist64,65,66, only 43% of studies included independent genetic data, even less included individual-level WGS data (11%), which should no longer be an obstacle as individual-level genotypes can be called for single-nucleotide polymorphisms, and increasingly also for more demanding structural variants at rapidly decreasing costs67,68. The omission of genetic data compromises the ability to make inferences on the source and phenotypic impact of epigenetic mechanisms. Clearly, it is most beneficial to include DNA sequencing data from the very same individual and tissue to accommodate somatic mutations where possible.
Decomposing the source of epigenetic variation
Population genetics is concerned with the processes shaping genetic variation across genealogical scales ranging from related individuals to populations and species. The quest for the source of epigenetic variation can benefit from drawing on established methodologies like quantitative genetics, variance decomposition in structured populations or genome scans. We can apply the same principles to (heritable) epigenetic variation and leverage population structure to investigate the processes shaping epigenetic variation. Nevertheless, challenges persist in refining experimental designs and downstream analyses. In the following, we summarize current methodologies and highlight opportunities for theoretical and empirical research to enhance the utility of epigenetic datasets.
Quantitative genetic approaches
Regression-based approaches in quantitative genetics can play a role in decomposing the source of epigenetic variation (Box 4). One major challenge will be to find an appropriate definition for the epiphenotype that contains relevant biological information (Box 3) and reduces the number of pseudo-replication among all possible phenotypes (e.g. all CpG sites) in a genome.
In principle, the heritable component of epigenetic variation that is due to genetic variation can be estimated from epigenetic covariance of related individuals alone without the need for additional genetic data (Eq. 1). Independent genetic data, at base pair resolution, is recommended though, as models benefit from genomic estimates of realized relatedness, rather than pedigree-based estimates. Genetic data can also be used to estimate the additive effects of the genome directly (Eq. 2). This approach clearly faces the challenge of high dimensional data (more markers than individuals), compromising appropriate estimation of effect sizes. A number of primarily Bayesian approaches that assume prior architecture of the trait are in use69, and motivate exploration in the context of epiphenotypes. More commonly, genomic approaches are used to isolate genetic regions/loci that exert a strong effect on the state of the epiphenotype often by making use of crossing schemes (QTL analyses) or by exploiting naturally segregating variation in genome-wide association studies (GWAS). Such approaches are referred to by several names in the context of epigenetic variation (meQTL, MethQTL, epiQTL), but follow the same standard practice. In plants, this approach has been successfully applied to uncover the link between genetic and epigenetic variation at base pair resolution, mainly in A. thaliana70,71,72. They are also applied in human research73,74,75 and are gaining traction in natural populations of other species36,76,77,78. Methylation appears to be linked to genetic variation at varying rates depending on the study organism47,61,64,79,80. Despite successful application of quantitative genetic approaches, small sample sizes result in studies that are underpowered and susceptible to false positive inference due to erroneous SNPs and 5mC methylation calls (Box 5). Where small-scale studies fall short of rational use of limited resources, community attempts may be considered to generate a few large empirical datasets including both epigenetic and genetic data for the same individuals. Alternatively, integrative study designs and methods extracting relevant information from fewer individuals need to be developed alongside existing multivariate approaches (see below, cf. Fig. 158).
Genealogical divergence (e.g. between individuals, families, populations, species) is typically reflected by the amount of difference in (heritable) genetic variation (Δ genetic). Contrasting heterogeneity in genetic and environmental divergence helps in assessing the degree of autonomous epigenetic variation (Box 2). a Three population samples with increasing genealogical distance are depicted (A, B), C). B1 (blue) and B2 (yellow) have the same genetic background, but are subject to different environmental conditions. b Associations between genome-wide genetic and epigenetic variation can be assessed using the correlation of distance metrics for genetic (Δ genetic) and epigenetic (Δ epigenetic) variation among genetic clusters, or through multivariate statistical approaches such as principle component analyses (PCA, not shown). For genetically controlled epigenetic variation, we expect to see a strong correlation between genome-wide metrics of genetic and epigenetic distance regardless of the environment. High correlation of genetic and epigenetic distance matrices between pairs of populations, and not the environment, may be a first indication of genetic control. c) While single-value comparisons per population provide a mean estimate across populations, window-based measures allow capturing variation along the genome. Capitalizing on the heterogeneity of genetic and epigenetic variation along the genome further allows to pinpoint regions of the genome under genetic and/or environmental control. The upper panel shows variation in genetic and epigenetic distance for the population comparison A | B2 at a regional scale along the genome. A point can refer to any genomic window in units of base pairs (or better units of centimorgans) here referred to as a locus. The lower panel shows an example of genetically controlled (left) and environmentally associated (right) epigenetic locus. Δ genetic is shown in blue, Δ epigenetic in red. Box 4 for corresponding statistical approaches.
Instead of using epigenetic variation as a focal phenotype, it can also be used as an explanatory variable to unravel variation in some other measurable trait. Formally, this would mean replacing P.EPI by any phenotype P.X and adding epigenetic variation as an additional explanatory component to Eqs. 1 and 2. Yet, it is challenging to isolate the independent contribution of epigenetic variation in such a situation, and few studies exist81. Importantly, the contribution hinges on the source of epigenetic variation and its stability. Transitory epigenetic variation induced by the environment could simply be subsumed in V(E). If epigenetic variation is fully heritable its effect should be captured by the genetic component V(G). Epigenetic variation will only make an independent, phenotypic contribution if it is environmentally induced or spontaneous and trans-generationally stable. This is an interesting topic and deserves further theoretical and empirical attention.
Multivariate statistics
If we treat epigenetic variation as a multi-dimensional phenotype, then approaches like principal component analysis can integrate signals across the genome and extract useful information, which was carried out in 37% of studies we examined. Such multivariate methods provide a first look at patterns separating populations along the major axes of variation, like drift82 and/or selection dynamics83. This provides a useful starting point for global comparison with population stratification inferred from genetic data. It may even allow for locus-specific comparisons of epigenetic and genetic variation by identifying the loci of both data types with the highest loadings on each PC axis. Given that technical processing can have a large impact on epigenetic data (batch effects)61, we advocate the inclusion of technical replicates to identify and remove loci statistically related to non-target covariates or utilizing intra-experimental controls (e.g. including samples of variable developmental stages to isolate and eliminate age effects). (Partial) redundancy analyses (RDA) provide an additional option to quantify the contribution of genetic and environmental co-variates84, however, choice of co-variates can greatly influence the outcome85. Further investigation and best practice recommendations borrowing from other fields where high dimensional data is common are needed, particularly for the distributional properties of CpG data as well as the treatment of technically induced variation.
Hierarchical population structure
Hierarchical multi-population comparisons constitute another way to decompose genetic, epigenetic, and environmental variance (Fig. 1a). Tests comparing sets of pairwise population matrices provide a means to investigate co-variance between two variables while controlling for a third. For instance, in evolutionary genetics research, the relationship of genetic and ecological distance is often of interest while controlling for the effect of geographic distance86,87. On the analytical side, partial Mantel tests have been a common choice, but multiple matrix regression methods promise more flexibility and appear to be better suited for this purpose88. In principle, such genome-wide queries can be adapted to genome-wide epigenetic distance between populations (Fig. 1b). Following the principle of genome-scans89,90 quantifying epigenetic variation along the genome alongside genetic data provides insight into the degree of local genomic covariation (Fig. 1c). Does high epigenetic differentiation coincide with regions of high genetic differentiation, or is it independent of background genetic signals? Such approaches could also be used to identify regions under environmental or cis-genetic influence. Yet, we are lacking appropriate distance metrics.
Distance metrics
For genetic data, standard measures, such as FST, dXY, dA, and multivariate statistics are readily available and are firmly rooted in population genetic theory with clear interpretation82,90,91,92,93. Yet, owing to a lack of suitable mutation and substitution models for chromatin modifications, we currently have no equivalent metrics for epigenetic data. FST-related distance metrics such as QST seem promising. QST quantifies the additive genetic component of phenotypic variation and, in conjunction with FST, allows for inquiry into the selective or neutral forces underlying phenotypic divergence94. Lacking information on narrow sense trait heritability, PST is often used as an approximation with known limitations and pitfalls95,96. It seems appealing to apply the FST-QST/PST framework to epiphenotypes as it places the variation we see in epigenetic data on the same scale as genetic data and allows to investigate whether epigenetic differentiation proceeds at the same rate as genetic variation (Fig. 1b). Elevated QST/PST suggests accelerated epigenetic divergence relative to genetic drift, which may hint at an environmental influence or selection acting on epigenetic variation. Several examples in natural populations already exist65,79,80,97, mainly using the proportion of methylation at a given loci as the input to calculate PST.
Despite the promise, we need both empirical and theoretical research to accommodate specific features of epigenetic data. Importantly, this includes the definition of genomic scale for which the epiphenotype is considered (Box 3). Can we treat genome-wide epigenetic variation as a single phenotype to be compared to neutral, genome-wide differentiation reflecting genetic drift? Which genomic scale of cis-acting genetic variation is appropriate for comparisons with targeted epigenetic loci (e.g. specific CpG islands, promoter regions, etc.)? Can we develop models, analogous to genetic variation, to identify QST outliers incorporating the effect of demographic change, rather than just picking loci from extreme percentiles? Specifically, which statistical distribution is best suited to estimate the variance components for calculating QST/PST? How do we best incorporate covariance structure across epiphenotypes along the genome98, which will substantially change under cis- or trans-genetic control of methylation? Models and simulations predicting epigenetic change upon population differentiation should be particularly useful to get an intuition about the range of possible QST/PST under different conditions for the source of epigenetic variation (spontaneous, genetic control, environmental or GxE, Box 2). While teasing apart environmentally induced epigenetic variation may be possible, separating spontaneous epimutations from those under genetic control will prove more challenging, in particular under conditions favoring linkage disequilibrium (see below). We see a clear opportunity here to utilize theoretical and simulation approaches to examine the scenarios where this may – or may not – be possible.
Integrating epigenetic variation in population genetic analyses
Characterizing the source of epigenetic variation is crucial to evaluate the autonomous potential of epigenetic variation to evolution. Yet, we also see an opportunity to broadly incorporate epigenetic datasets into population genetic approaches tracing evolutionary processes. Below, we explore potentially promising research directions.
Linkage disequilibrium
It has long been proposed that phenotypic plasticity interacts with genetic evolution99,100. Adaptive plasticity enables individuals to enter novel environments with hitherto unexperienced selection pressures shaping (cryptic) genetic variation101,102. Depending on the conditions, populations in both original and new habitats may coexist, differentiate, or speciate103,104. Environmentally induced epigenetic variation could be one key mechanism mediating the interaction of adaptive plasticity and genetic variation42,105 – with consequences for how we interpret covariation between genetic and epigenetic variation.
Here, we illustrate the possible consequences of this interaction in the context of ecological speciation, where natural selection causes reproductive isolation between environmentally differentiated populations connected by migration. In brief, divergent selection reduces effective migration of alleles from barrier loci under divergent selection between populations. Linkage extends these genomic effects beyond the barrier locus, with the extent depending on selection strength and recombination rate106. As a consequence of this process, we observe statistical associations (linkage disequilibrium) between selected loci and neutral genetic variation both within and between populations, to various extents90,107,108. In principle, this process also applies to epigenetic variation. Assuming antagonistic pleiotropy109, where the alternate epiallelic state is respectively favored in different habitats, adaptive epimutations induced in the parental habitat reduce the fitness of migrants with an immediate effect on gene flow. If the epimutation is transitory, then the selective effects disappear in the next generation, resulting in no linkage with genetic variation. However, if epimutations exhibit some degree of transgenerational inheritance, they can come into linkage with surrounding (neutral) genetic variation or become coupled with other genetic or epigenetic variation subject to divergent selection108.
This influences our interpretation of the association between genetic and epigenetic variation. Thus far, we have treated epigenetic variation as a potentially heritable phenotype encoded by genetic variation. Associations between epigenetic state and genotype then suggest causal cis- or trans-genetic effects (Box 4). However, as illustrated in the previous paragraph, co-variation with genetic loci can also arise through selection and other mechanisms in the form of linkage disequilibrium110. Both theoretical and empirical research are needed to differentiate the processes inducing linkage disequilibrium between genetic and epigenetic variation and causal, additive genetic variation underlying epigenetic variation. Promising theoretical approaches exploring this interaction in an explicit population genetic framework are emerging, and are vital to inform empirical experimental design111,112. A first ad hoc approach to understand the source of linkage disequilibrium could consist of quantifying genetic-epigenetic associations within and among populations along a gene flow gradient under varying selection regimes in both natural and experimental settings.
Hybrid zones
Hybrid zones constitute a promising way of leveraging population divergence not only to decompose environmental and genetic components of epigenetic variation, but also to address the role of autonomous epigenetic variation in speciation. The use of hybrid zones has a longstanding tradition in evolutionary genetic studies113 and has increased our understanding of the processes governing population divergence114. By maximizing phenotypic and genetic variation and generating novel combinations thereof, they are suited to map the genetic basis of phenotypic variation while partially controlling for environmental variation in a geographically confined center of the zone115. For studies of epigenetic variation, hybrid zones confer similar advantages (Fig. 2). In the central part of the hybrid zone, environmental variation is often reduced relative to the parental core habitat, while genetic variation is maximized in the admixed genomes of hybrids, which can be represented as mosaics of ancestry blocks116,117. The genetic variance between ancestry blocks exceeds variation found within populations of a single species. As populations diverge, the number of fixed differences mixing in the hybrids increases our ability to pinpoint the effects of local, cis-genetic variation or genome-wide ancestry (trans-genetic variation). Increased allelic variance in hybrids can be particularly well exploited when considering allele-specific methylation, as this allows quantification of dominance deviation under cis-control, as well as identification of transgressive methylation levels indicative of trans-effects (genome-wide hybrid ancestry). Capitalizing on this idea, cis- and trans-genetic control of epigenetic variation has been quantified across the genome of flycatcher hybrids63.
Hybrid zones allow to concomitantly study epigenetic variation and genetic ancestry and may help pinpointing the source of epigenetic variation (upper two panels). Cline analysis is a powerful tool to infer selection on both genetic and epigenetic variation and compare the two (lower panel). Geographic cline analysis characterizes the change of variation in genetic and epigenetic loci (here DNA methylation) on a spatial axis, whereas genomic cline analyses depict epigenetic divergence as a function of genome-wide ancestry.
Another promising avenue for exploiting information from hybrids may be to adapt geographic and genomic cline theory to epigenetic datasets and explore introgression dynamics of epigenetic and genetic loci116. Let us consider a geographic cline resulting from secondary contact of two populations that have previously diverged in isolation. In the center of the hybrid zone, we assume that the environment is rather homogeneous while genomes are admixed. Genetic variants sharing ancestry within a non-recombined segment of the genome will be characterized by elevated linkage disequilibrium. Epigenetic variants that are subject to cis-control should as a consequence also be in LD with one another and with the surrounding genetic variation. The same holds true for the fraction of spontaneous epigenetic variation that has accumulated differences during population isolation comparable to the surrounding genetic variation. Epigenetic variation under environmental or trans-genetic control should, however, not show this association. The former should show little to no variation, while the latter should reflect genome-wide ancestry (or ancestry at distant genetic loci). Expanding the rationale from the center of the zone to include a full transect of individuals from pure parental populations provides additional information. Genetic variants under divergent selection resist gene flow and are expected to show a steep cline118. Steep clines of single epigenetic loci may similarly reflect selection for trans-generationally stable epigenetic variation or simply reflect an underlying steep environmental gradient. The latter is less likely, while the former must be first separated from epigenetic variation maintained solely through genetic variants. The question of whether selection acted on the epigenetic locus or a genetic locus in LD is, however, difficult to answer, but may benefit from tissue-specific investigation.
This thought experiment illustrates two points: first, hybrid zones may hold key insight into the source of epigenetic variation and selection. Second, things are complicated, as processes can interact in intricate ways. Here, we can outline only the basic idea; theoretical research is needed to fill in detailed predictions. Evolutionary null models retracing the process of population divergence followed by secondary contact varying recombination, and genetic and epigenetic mutation rates could shed first light on the conditions necessary to distinguish the sources of epigenetic variation. Adding selection and coupling across regions of the genome may then help to understand the role of epigenetic variation in speciation. For simplicity, we here considered a geographic cline, but the rationale similarly extends to genomic clines, which hold promise for inferring the selection of epigenetic variation while controlling for genome-wide ancestry.
There are also a number of challenges on the empirical side. It remains to be explored how to best represent the change in epigenetic variation. Treatment as an epigenotype (binary or trinary) allows the calculation of epigenetic ancestry likelihood, analogous to genetic ancestry derived from genotype frequencies along the cline. An estimate of linkage disequilibrium would similarly require treatment of the epigenetic variation as a discrete genotype. Although cline analyses and measures of linkage disequilibrium can be conducted on a single epigenetic locus, analyzes will benefit from phased haplotype data embedding epigenetic information directly into (ancestry-informative) genetic background. With the advent of long-read sequencing data including information on chromatin modification such data can now be obtained. Despite substantial challenges, we see unexplored potential in ‘hybrid zone epigenetics’ and encourage both theoretical and empirical studies to adapt existing statistical approaches116.
Demographic inference
Incorporating epigenetic variation into coalescent methods may be valuable for tracing recent demographic changes (i.e. bottlenecks, range expansion or contraction, or speciation) induced by changing climate or human-inflicted perturbations. CpG sites with clock-like behavior that are due to spontaneous epimutations may carry additional information on genealogies and may help estimate the timing of lineage divergence or reconstruct population demographic histories. Note, that such a ‘genealogical’ clock’ involves trans-generationally stable epimutations and is not to be confounded with an ‘ontogenetic’ clock tracing individual age119. A theoretical framework to predict the frequency distribution (site frequency spectrum) of highly mutable epimutations (mSFS) was developed by Charlesworth and Jain120 paving the way for statistical inference of neutral and selective factors influencing the methylome of A. thaliana121,122,123 and maize124. Going one step further, Sellinger et al.125 incorporated DNA methylation data into full-genome demographic inference. The conceptual novelty here is to infer the coalescent genealogy of a sample by integrating several markers in a genomic region: SNPs, but also additional markers with a higher mutation rate (e.g. 5mC methylation). This method extends the pairwise Sequentially Markovian Coalescent (SMC) models and allows us to infer: (1) the mutation rates of methylation/demethylation if the SNP mutation rate is known, (2) the relative importance of site- and region-level epimutation processes, and (3) the past demographic history of the population. The accuracy and resolution to reconstruct bottlenecks that occurred in the recent past is improved over estimates from genetic data (SNPs) alone125. Approaches leveraging spontaneous epimutations could be applied to resolve important evolutionary events as they can be reliably tracked even within one organism such as long-living trees or clonal organisms126. This relies on identifying neutral methylation sites unaffected by regional methylation and environmental changes. These examples are based on plant research where epigenetic inheritance is well established and are intended to motivate research in other taxa.
Conclusions
The degree to which epigenetic variation makes an independent contribution to evolution is not yet clear. We advocate for research in natural populations complementing controlled experimental research in the lab. However, in order for epigenetic studies in the wild to make a lasting contribution to this question, they need to become more comprehensive by incorporating genetic, environmental, and phenotypic data in a rigorous statistical framework. Otherwise, the influence of the genetic sequence is likely to be underestimated, paving the way for epigenetic storytelling. The practical challenges are numerous and range from technical considerations and experimental decisions to finding meaningful metrics for epigenetic variation and appropriate statistical frameworks for data integration. Despite these challenges, we see the dual potential of this approach. The inclusion of epigenetic variation in population genetic methodology not only helps to assess to which degree epigenetic variation may or may not constitute an autonomous, non-genetic substrate for transgenerational evolutionary change. It also provides an additional axis of information for other questions in population genetics. The strongest approaches are those where we will be able to integrate genetic, epigenetic and environmental datasets with phenotypic measurements to understand the independent and joint impact of the DNA sequence and epigenetic modifications. In order to do this, we need to foster exchange between bench biologists and the empirical and theoretical population geneticists studying the evolutionary processes underlying (epi)genetic variation in natural populations.
References
Charlesworth, B. & Charlesworth, D. Elements of Evolutionary Genetics. (Roberts and Company, 2010).
Charlesworth, B. & Charlesworth, D. Population genetics from 1966 to 2016. Heredity 118, 2–9 (2017).
Allis, C. D. & Jenuwein, T. The molecular hallmarks of epigenetic control. Nat. Rev. Genet. 17, 487–500 (2016).
Li, E. Chromatin modification and epigenetic reprogramming in mammalian development. Nat. Rev. Genet. 3, 662–673 (2002).
Waddington, C. H. The basic ideas of biology. Biol. Theory 3, 238–253 (2008).
Jablonka, E. & Lamb, M. J. The changing concept of epigenetics. Ann. N. Y. Acad. Sci. 981, 82–96 (2002).
Deichmann, U. Epigenetics: The origins and evolution of a fashionable topic. Dev. Biol. 416, 249–254 (2016).
Felsenfeld, G. A brief history of epigenetics. Cold Spring Harb. Perspect. Biol. 6, https://doi.org/10.1016/j.imlet.2022.08.001 (2014).
Heard, E. & Martienssen, R. A. Transgenerational epigenetic inheritance: myths and mechanisms. Cell 157, 95–109 (2014).
Miska, E. A. & Ferguson-Smith, A. C. Transgenerational inheritance: Models and mechanisms of non–DNA sequence–based inheritance. Science 354, 59–63 (2016).
Jablonka, E. & Lamb, M. J. Evolution in Four Dimensions: Genetic, Epigenetic, Behavioral, and Symbolic Variation in the History of Life. x, 462 (MIT Press, Cambridge, MA, US, 2005).
Pigliucci, M. Do we need an extended evolutionary synthesis? Evolution 61, 2743–2749 (2007).
Laland, K. N. et al. The extended evolutionary synthesis: its structure, assumptions and predictions. Proc. R. Soc. B 282, 20151019 (2015).
Jablonka, E. & Raz, G. Transgenerational epigenetic inheritance: prevalence, mechanisms, and implications for the study of heredity and evolution. Q. Rev. Biol. 84, 131–176 (2009).
Richards, C. L. et al. Ecological plant epigenetics: Evidence from model and non-model species, and the way forward. Ecol. Lett. 20, 1576–1590 (2017).
C. L. Richards & M. Pigliucci. Epigenetic Inheritance. A Decade into the Extended Evolutionary Synthesis. Paradigmi 463–494 https://doi.org/10.30460/99624 (2020).
Boquete, M. T., Muyle, A. & Alonso, C. Plant epigenetics: phenotypic and functional diversity beyond the DNA sequence. Am. J. Bot. 108, 553–558 (2021).
Sobral, M. & Sampedro, L. Phenotypic, epigenetic, and fitness diversity within plant genotypes. Trends Plant Sci. 27, 843–846 (2022).
Hazarika, R. R. et al. Molecular properties of epimutation hotspots. Nat. Plants 8, 146–156 (2022).
Hofmeister, B. T. et al. A genome assembly and the somatic genetic and epigenetic mutation rate in a wild long-lived perennial Populus trichocarpa. Genome Biol. 21, 259 (2020).
Johannes, F. & Schmitz, R. J. Spontaneous epimutations in plants. N. Phytol. 221, 1253–1259 (2019).
Schmitz, R. J. et al. Transgenerational epigenetic instability is a source of novel methylation variants. Science 334, 369–373 (2011).
van der Graaf, A. et al. Rate, spectrum, and evolutionary dynamics of spontaneous epimutations. Proc. Natl Acad. Sci. 112, 6676–6681 (2015).
Kou, H. P. et al. Heritable alteration in DNA methylation induced by nitrogen-deficiency stress accompanies enhanced tolerance by progenies to the stress in rice (Oryza sativa L.). J. Plant Physiol. 168, 1685–1693 (2011).
Han, Z. et al. Heritable Epigenomic Changes to the Maize Methylome Resulting from Tissue Culture. Genetics 209, 983–995 (2018).
Whittle, C., Otto, S., Johnston, M. & Krochko, J. Adaptive epigenetic memory of ancestral temperature regime in Arabidopsis thaliana This paper is one of a selection of papers published in a Special Issue from the National Research Council of Canada – Plant Biotechnology Institute. Botany 87, 650–657 (2009).
Boyko, A. et al. Transgenerational adaptation of arabidopsis to stress requires dna methylation and the function of dicer-like proteins. PLOS ONE 5, e9514 (2010).
Johannes, F. et al. Assessing the impact of transgenerational epigenetic variation on complex traits. PLOS Genet 5, e1000530 (2009).
Cubas, P., Vincent, C. & Coen, E. An epigenetic mutation responsible for natural variation in floral symmetry. Nature 401, 157–161 (1999).
Manning, K. et al. A naturally occurring epigenetic mutation in a gene encoding an SBP-box transcription factor inhibits tomato fruit ripening. Nat. Genet. 38, 948–952 (2006).
Daxinger, L. & Whitelaw, E. Understanding transgenerational epigenetic inheritance via the gametes in mammals. Nat. Rev. Genet. 13, 153–162 (2012).
Lane, M., Robker, R. L. & Robertson, S. A. Parenting from before conception. Science 345, 756–760 (2014).
Monaghan, P. & Metcalfe, N. B. The deteriorating soma and the indispensable germline: gamete senescence and offspring fitness. Proc. R. Soc. B Biol. Sci. 286 (2019).
Reik, W., Dean, W. & Walter, J. Epigenetic Reprogramming in Mammalian Development. Science https://doi.org/10.1126/science.1063443 (2001).
Horsthemke, B. A critical view on transgenerational epigenetic inheritance in humans. Nat. Commun. 9, 2973 (2018).
Sepers, B., Chen, R. S., Memelink, M., Verhoeven, K. J. F. & van Oers, K. Variation in DNA methylation in avian nestlings is largely determined by genetic effects. Mol. Biol. Evol. 40, msad086 (2023).
Morgan, H. D., Sutherland, H. G. E., Martin, D. I. K. & Whitelaw, E. Epigenetic inheritance at the agouti locus in the mouse. Nat. Genet. 23, 314–318 (1999).
Greer, E. L. et al. A histone methylation network regulates transgenerational epigenetic memory in C. elegans. Cell Rep. 7, 113–126 (2014).
Kelly, W. G. Transgenerational epigenetics in the germline cycle of Caenorhabditis elegans. Epigenetics Chromatin 7, 6 (2014).
Ciabrelli, F. et al. Stable Polycomb-dependent transgenerational inheritance of chromatin states in Drosophila. Nat. Genet. 49, 876–886 (2017).
Anastasiadi, D., Venney, C. J., Bernatchez, L. & Wellenreuther, M. Epigenetic inheritance and reproductive mode in plants and animals. Trends Ecol. Evol. 36, 1124–1140 (2021).
Feiner, N. et al. Environmentally induced DNA methylation is inherited across generations in an aquatic keystone species. iScience 25, 104303 (2022).
Bossdorf, O., Richards, C. L. & Pigliucci, M. Epigenetics for ecologists. Ecol. Lett. 11, 106–115 (2008).
Vijay, N. et al. Genomewide patterns of variation in genetic diversity are shared among populations, species and higher-order taxa. Mol. Ecol. 26, 4284–4295 (2017).
Vijay, N. et al. Evolution of heterogeneous genome differentiation across multiple contact zones in a crow species complex. Nat. Commun. 7, 13195 (2016).
Meier, J. I. et al. Haplotype tagging reveals parallel formation of hybrid races in two butterfly species. Proc. Natl Acad. Sci. 118, e2015005118 (2021).
McRae, A. F. et al. Contribution of genetic variation to transgenerational inheritance of DNA methylation. Genome Biol. 15, R73 (2014).
Skinner, M. K. Environmental stress and epigenetic transgenerational inheritance. BMC Med 12, 153 (2014).
Weyrich, A. et al. Diet changes alter paternally inherited epigenetic pattern in male Wild guinea pigs. Environ. Epigenetics 4, dvy011 (2016).
Yi, S. V. Insights into epigenome evolution from animal and plant methylomes. Genome Biol. Evol. 9, 3189–3201 (2017).
Hu, J. & Barrett, R. D. H. Epigenetics in natural animal populations. J. Evol. Biol. 30, 1612–1632 (2017).
Sepers, B. et al. Avian ecological epigenetics: pitfalls and promises. J. Ornithol. 160, 1183–1203 (2019).
Laine, V. N. et al. An ecologist’s guide for studying DNA methylation variation in wild vertebrates. Mol. Ecol. Resour. 23, 1488–1508 (2022).
Bock, C. Analysing and interpreting DNA methylation data. Nat. Rev. Genet. 13, 705–719 (2012).
Pickering, A. D. Changes in blood cell composition of the brown trout, Salmo trutta L., during the spawning season. J. Fish. Biol. 29, 335–347 (1986).
Lindner, M. et al. Temporal changes in DNA methylation and RNA expression in a small song bird: within- and between-tissue comparisons. BMC Genomics 22, 36 (2021).
Merondun, J. & Wolf, J. B. W. The contribution of epigenetic variation to evolution in crows. 2024.05.22.595340 Preprint at https://doi.org/10.1101/2024.05.22.595340 (2024).
Lea, A. J., Vilgalys, T. P., Durst, P. A. P. & Tung, J. Maximizing ecological and evolutionary insight in bisulfite sequencing data sets. Nat. Ecol. Evol. 1, 1074–1083 (2017).
McClay, J. L. et al. High density methylation QTL analysis in human blood via next-generation sequencing of the methylated genomic DNA fraction. Genome Biol. 16, 291 (2015).
Grosser, K. & Metzler, D. Modeling methylation dynamics with simultaneous changes in CpG islands. BMC Bioinforma. 21, 115 (2020).
Hu, J. et al. Heritability of DNA methylation in threespine stickleback (Gasterosteus aculeatus). Genetics 217, 1–15 (2021).
Zhang, Y., Wendte, J. M., Ji, L. & Schmitz, R. J. Natural variation in DNA methylation homeostasis and the emergence of epialleles. Proc. Natl Acad. Sci. 117, 4874–4884 (2020).
Boman, J., Qvarnström, A. & Mugal, C. F. Regulatory and evolutionary impact of DNA methylation in two songbird species and their naturally occurring F1 hybrids. BMC Biol. 22, 124 (2024).
Hagmann, J. et al. Century-scale Methylome Stability in a Recently Diverged Arabidopsis thaliana Lineage. PLOS Genet 11, e1004920 (2015).
Liu, S. et al. Genome-wide methylation in the panmictic European eel (Anguilla anguilla). Mol. Ecol. 31, 4286–4306 (2022).
de Carvalho, C. F. et al. DNA methylation differences between stick insect ecotypes. Mol. Ecol. 32, 6809–6823 (2023).
Liao, W.-W. et al. A draft human pangenome reference. Nature 617, 312–324 (2023).
Weissensteiner, M. H. et al. Discovery and population genomics of structural variation in a songbird genus. Nat. Commun. 11, 3403 (2020).
Gianola, D. Priors in whole-genome regression: the bayesian alphabet returns. Genetics 194, 573–596 (2013).
Dubin, M. J. et al. DNA methylation in Arabidopsis has a genetic basis and shows evidence of local adaptation. eLife 4, e05255 (2015).
Kawakatsu, T. et al. Epigenomic diversity in a global collection of arabidopsis thaliana accessions. Cell 166, 492–505 (2016).
Sasaki, E., Kawakatsu, T., Ecker, J. R. & Nordborg, M. Common alleles of CMT2 and NRPE1 are major determinants of CHH methylation variation in Arabidopsis thaliana. PLOS Genet 15, e1008492 (2019).
Shi, J. et al. Characterizing the genetic basis of methylome diversity in histologically normal human lung tissue. Nat. Commun. 5, 3365 (2014).
Villicaña, S. & Bell, J. T. Genetic impacts on DNA methylation: research findings and future perspectives. Genome Biol. 22, 127 (2021).
Hawe, J. S. et al. Genetic variation influencing DNA methylation provides insights into molecular mechanisms regulating genomic function. Nat. Genet. 54, 18–29 (2022).
Gawra, J. et al. Epigenetic variations are more substantial than genetic variations in rapid adaptation of oyster to Pacific oyster mortality syndrome. Sci. Adv. 9, eadh8990 (2023).
Gugger, P. F., Fitz-Gibbon, S., PellEgrini, M. & Sork, V. L. Species-wide patterns of DNA methylation variation in Quercus lobata and their association with climate gradients. Mol. Ecol. 25, 1665–1680 (2016).
van Moorsel, S. J. et al. Evidence for rapid evolution in a grassland biodiversity experiment. Mol. Ecol. 28, 4097–4117 (2019).
Carja, O. et al. Worldwide patterns of human epigenetic variation. Nat. Ecol. Evol. 1, 1577–1583 (2017).
Silliman, K., Spencer, L. H., White, S. J. & Roberts, S. B. Epigenetic and genetic population structure is coupled in a marine invertebrate. Genome Biol. Evol. 15, evad013 (2023).
Tal, O., Kisdi, E. & Jablonka, E. Epigenetic Contribution to Covariance Between Relatives. Genetics 184, 1037–1050 (2010).
Patterson, N., Price, A. L. & Reich, D. Population Structure and Eigenanalysis. PLoS Genet 2, e190 (2006).
Tusso, S., Nieuwenhuis, B. P. S., Weissensteiner, B., Immler, S. & Wolf, J. B. W. Experimental evolution of adaptive divergence under varying degrees of gene flow. Nat. Ecol. Evol. 5, 338–349 (2021).
Legendre, P. & Legendre, L. Numerical Ecology. (Elsevier, 2012).
Capblancq, T. & Forester, B. R. Redundancy analysis: A Swiss Army Knife for landscape genomics. Methods Ecol. Evol. 12, 2298–2309 (2021).
Meirmans, P. G. The trouble with isolation by distance. Mol. Ecol. 21, 2839–2846 (2012).
Shafer, A. B. A. & Wolf, J. B. W. Widespread evidence for incipient ecological speciation: a meta-analysis of isolation-by-ecology. Ecol. Lett. 16, 940–950 (2013).
Wang, I. J. & Bradburd, G. S. Isolation by environment. Mol. Ecol. 23, 5649–5662 (2014).
Lewontin, R. C. & Krakauer, J. Distribution of gene frequency as a test of the theory of the selective neutrality of polymorphisms. Genetics 74, 175–195 (1973).
Wolf, J. B. W. & Ellegren, H. Making sense of genomic islands of differentiation in light of speciation. Nat. Rev. Genet. 18, 87–100 (2017).
Meirmans, P. G. & Hedrick, P. W. Assessing population structure: FST and related measures. Mol. Ecol. Resour. 11, 5–18 (2011).
Cruickshank, T. E. & Hahn, M. W. Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow. Mol. Ecol. 23, 3133–3157 (2014).
Duforet-Frebourg, N., Luu, K., Laval, G., Bazin, E. & Blum, M. G. B. Detecting genomic signatures of natural selection with principal component analysis: application to the 1000 genomes data. Mol. Biol. Evol. 33, 1082–1093 (2016).
Merilä, J. & Crnokrak, P. Comparison of genetic differentiation at marker loci and quantitative traits. J. Evol. Biol. 14, 892–903 (2001).
Brommer, J. E. Whither Pst? The approximation of Qst by Pst in evolutionary and conservation biology. J. Evol. Biol. 24, 1160–1168 (2011).
Pujol, B., Wilson, A. J., Ross, R. I. C. & Pannell, J. R. Are QST–FST comparisons for natural populations meaningful? Mol. Ecol. 17, 4782–4785 (2008).
Johnson, K. M. & Kelly, M. W. Population epigenetic divergence exceeds genetic divergence in the Eastern oyster Crassostrea virginica in the Northern Gulf of Mexico. Evol. Appl. 13, 945–959 (2020).
Chenoweth, S. F. & Blows, M. W. Qst meets the G matrix: the dimensionality of adaptive divergence in multiple correlated quantitative tratis. Evolution 62, 1437–1449 (2008).
Baldwin, J. M. A new factor in evolution. Am. Nat. 30, 441–451 (1896).
Waddington, C. H. Genetic assimilation of an acquired character. Evolution 7, 118–126 (1953).
Price, T. D., Quarnström, A. & Irwin, D. E. The role of phenotypic plasticity in driving genetic evolution. Proc. R. Soc. B-Biol. Sci. 270, 1433–1440 (2003).
Paaby, A. B. & Rockman, M. V. Cryptic genetic variation: evolution’s hidden substrate. Nat. Rev. Genet. 15, 247–258 (2014).
Pfennig, D. W. et al. Phenotypic plasticity’s impacts on diversification and speciation. Trends Ecol. Evol. 25, 459–467 (2010).
Thibert-Plante, X. & Hendry, A. P. The consequences of phenotypic plasticity for ecological speciation. J. Evol. Biol. 24, 326–342 (2011).
Herman, J. J. & Sultan, S. E. DNA methylation mediates genetic variation for adaptive transgenerational plasticity. Proc. R. Soc. B Biol. Sci. 283, 20160988 (2016).
Barton, N. & Bengtsson, B. O. The barrier to genetic exchange between hybridising populations. Heredity 57, 357–376 (1986).
Ravinet, M. et al. Interpreting the genomic landscape of speciation: a road map for finding barriers to gene flow. J. Evol. Biol. 30, 1450–1477 (2017).
Dopman, E. B., Shaw, K. L., Servedio, M. R., Butlin, R. K. & Smadja, C. M. Coupling of Barriers to Gene Exchange: Causes and Consequences. Cold Spring Harb. Perspect. Biol. a041432 https://doi.org/10.1101/cshperspect.a041432 (2024).
Weng, X. et al. A Pleiotropic Flowering Time QTL Exhibits Gene-by-Environment Interaction for Fitness in a Perennial Grass. Mol. Biol. Evol. 39, msac203 (2022).
Baird, S. J. E. Exploring linkage disequilibrium. Mol. Ecol. Resour. 15, 1017–1019 (2015).
Greenspoon, P. B., Spencer, H. G. & M’Gonigle, L. K. Epigenetic induction may speed up or slow down speciation with gene flow. Evolution 76, 1170–1182 (2022).
Planidin, N. P., de Carvalho, C. F., Feder, J. L., Gompert, Z. & Nosil, P. Epigenetics and reproductive isolation: a commentary on Westram et al., 2022. J. Evol. Biol. 35, 1188–1194 (2022).
Bigelow, R. S. Hybrid zones and reproductive isolation. Evolution 19, 449–458 (1965).
Harrison, R. G. Hybrid Zones and the Evolutionary Process. (Oxford University Press, 1993).
Knief, U. et al. Epistatic mutations under divergent selection govern phenotypic variation in the crow hybrid zone. Nat. Ecol. Evol. 3, 570–576 (2019).
Gompert, Z., Mandeville, E. G. & Buerkle, C. A. Analysis of population genomic data from hybrid zones. Annu. Rev. Ecol. Evol. Syst. 48, 207–229 (2017).
Lawson, D. J., Hellenthal, G., Myers, S. & Falush, D. Inference of population structure using dense haplotype data. PLoS Genet 8, e1002453 (2012).
Haldane, J. B. S. The theory of a cline. J. Genet. 48, 277–284 (1948).
Horvath, S. DNA methylation age of human tissues and cell types. Genome Biol. 14, 1–20 (2013).
Charlesworth, B. & Jain, K. Purifying selection, drift, and reversible mutation with arbitrarily high mutation rates. Genetics 198, 1587–1602 (2014).
Vidalis, A. et al. Methylome evolution in plants. Genome Biol. 17, 1–14 (2016).
Muyle, A., Ross-Ibarra, J., Seymour, D. K. & Gaut, B. S. Gene body methylation is under selection in Arabidopsis thaliana. Genetics 218, iyab061 (2021).
Yao, N., Schmitz, R. J. & Johannes, F. Epimutations define a fast-ticking molecular clock in plants. Trends Genet 37, 699–710 (2021).
Xu, G. et al. Evolutionary and functional genomics of DNA methylation in maize domestication and improvement. Nat. Commun. 11, 5539 (2020).
Sellinger, T., Johannes, F. & Tellier, A. Improved inference of population histories by integrating genomic and epigenomic data. eLife 12 (2023).
Yao, N. et al. An evolutionary epigenetic clock in plants. Science 381, 1440–1445 (2023).
Deichmann, U. Chromatin: Its history, current research, and the seminal researchers and their philosophy. Perspect. Biol. Med. 58, 143–164 (2015).
Greenberg, M. V. C. & Bourc’his, D. The diverse roles of DNA methylation in mammalian development and disease. Nat. Rev. Mol. Cell Biol. 20, 590–607 (2019).
Doerfler, W. DNA methylation and gene activity. Annu. Rev. Biochem. 52, 93–124 (1983).
Ashe, A., Colot, V. & Oldroyd, B. P. How does epigenetics influence the course of evolution? Philos Trans R Soc Lond B Biol Sci. 376, 20200111 (2021).
Angers, B., Perez, M., Menicucci, T. & Leung, C. Sources of epigenetic variation and their applications in natural populations. Evol. Appl. 13, 1262–1278 (2020).
Greally, J. M. A user’s guide to the ambiguous word ‘epigenetics. Nat. Rev. Mol. Cell Biol. 19, 207–208 (2018).
Taudt, A., Colomé-Tatché, M. & Johannes, F. Genetic sources of population epigenomic variation. Nat. Rev. Genet. 17, 319–332 (2016).
Shea, N., Pen, I. & Uller, T. Three epigenetic information channels and their different roles in evolution. J. Evol. Biol. 24, 1178–1187 (2011).
Dobzhansky, T. Genetics and the Origin of Species. (Columbia University Press, New York, 1951).
Felsenstein, J. Theoretical Evolutionary Genetics. (Online, 2019).
Falconer, D. S. Introduction to Quantitative Genetics. (Longman, Harlow, 1995).
Lynch, M. & Walsh, B. Genetics and Analysis of Quantitative Traits. (Sinauer, Sunderland, Mass, 1998).
Slatkin, M. Epigenetic inheritance and the missing heritability problem. Genetics 182, 845–850 (2009).
Acknowledgements
We would to thank the members of the Wolf lab for comments on an earlier draft, and Zachariah Gompert for insightful comments on the hybrid zone section. Funding was provided by LMU Munich and the Max-Planck-Society (to JW), the German Research Foundation (DfG WO 1426/2-1 to JW and MU 5038/2-1 to SM).
Author information
Authors and Affiliations
Contributions
S.M., J.M., and J.W. conceived of the main ideas with valuable input from S.L. S.M. spearheaded the literature search. S.M. and J.W. wrote the manuscript with help from J.M. and S.L.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Mueller, S.A., Merondun, J., Lečić, S. et al. Epigenetic variation in light of population genetic practice. Nat Commun 16, 1028 (2025). https://doi.org/10.1038/s41467-025-55989-6
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-025-55989-6
This article is cited by
-
Gene regulation by non-Coding RNAs in infertility: a mechanistic review
Journal of Ovarian Research (2025)
-
Biogeographic Ancestry and Socioeconomic Status: A Meta-Analysis of Epidemiological Findings
Evolutionary Psychological Science (2025)










