Evolution, epigenesis, and epigenetic inheritance

Evolutionary biology is concerned with transgenerational change. Prevailing evolutionary theory conceptualizes the underlying process as an interplay between five central parameters acting on genetic variation: mutations of the DNA backbone generate genetic variation, which is depleted by random genetic drift, reorganized by recombination and redistributed by migration (for definitions of italicized terms, see the glossary in Box 1). A subset of genetic variation translates into heritable phenotypic variation1,2 where it is filtered by selection. Over the last decades, progress has been made in understanding the underlying epigenetic mechanisms that orchestrate the construction of phenotypes from information encoded in the genotype3. Examples include non-coding RNAs (lncRNA, piRNA, siRNA, miRNA), structural protein templating (prions), and chromatin modifications (e.g. DNA methylation, histone modification)4. These self-perpetuating somatic inheritance systems can establish somatic molecular memory by controlling “the interaction of genes and their products […] which bring the phenotype into being”5 p. 242.

In evolutionary biology, the focus is often not on epigenetics in the classical Waddingtonian sense (epigenesis) concerned with predictable ontogenetic change from single-celled zygotes to fully fledged, multi-cellular composites (‘genotype-to-phenotype’)5,6,7,8. Instead, we are mainly interested in the trans-generational effects of epigenetic mechanisms, which may themselves serve as a heritable substrate in the germline beyond the information encoded in the DNA blueprint9,10. Epigenetic inheritance has received considerable attention, challenging whether the traditional evolutionary paradigm relying on genetic variation as the source of heritable variation appropriately captures all aspects of evolution (extended evolutionary theory11,12,13). Importantly, only the fraction of epigenetic variation arising independently of the genotype (source), showing genealogical stability (inheritance) and causing phenotypic variation to be filtered by selection (consequences), can make an autonomous contribution to (adaptive) evolution beyond variation encoded in the genotype (orange path, Box 2).

Evidence for the evolutionary relevance of epigenetic variation, particularly 5mC DNA methylation, has been reviewed elsewhere14,15,16. In brief, it seems rather widespread in plants and fungi14,17,18. Research, mostly conducted on genetic models in the lab, provides clear evidence for spontaneous, random epimutations in DNA methylation19,20,21,22,23, and to a lesser degree for environmentally-induced epimutations24. Most importantly, transgenerational inheritance has been reported for both25,26, but seems far from frequent27,28. While there are documented phenotypic consequences of autonomous epigenetic variation29,30, evidence is still limited21. In animals, with a stronger, albeit leaky soma-germline separation31,32,33 and reprogramming34, transgenerational inheritance of autonomous epigenetic variation is expected to be more restricted and harder to prove35,36, although evidence exists37,38,39,40,41,42.

In this Perspective, we explore the potential of integrating population genetic approaches to the study of epigenetic variation. We make a case for natural populations as a valuable source of information, provide a critical reappraisal of current practice, highlight shortcomings, and identify fruitful areas of future research. The target audiences are evolutionary geneticists interested in expanding their research to epigenetic variation, as well as researchers focusing on epigenetic mechanisms in the lab who are looking to tap into natural variation.

Epigenetic research in natural populations

Despite valuable baseline information from lab studies of model species, natural populations are the way forward to understand the role of epigenetic variation in the relevant ecological and evolutionary context43. To what extent do environmental stressors impact epigenetic variation? What proportion of environmentally or spontaneous epialleles are stably inherited in the wild? How does epigenetic variation mediate phenotypic plasticity or parental effects, and can this be maintained over generations when past stressors are no longer present? Do trans-generationally stable spontaneous epigenetic mutations evolve neutrally or can they spread in a population through selection? Is epigenetic variation with evolutionary impact associated with certain genes or found randomly along the genome? What is the role of epigenetic variation in speciation? Most of these questions can only be answered when the source of epigenetic variation is known (Box 2). While allowing for less experimental control, natural systems contain information to get at this question, as they allow us to partition variation across scales: between (related) individuals, populations, species and higher-order taxa, often across a multitude of environmental contrasts. This comparative approach is powerful. It has been successfully exploited to dissect the relationship between genetic and phenotypic variation44,45,46 and can be adapted for the context of trans-generational inheritance47,48,49,50.

Current practice

We compiled information from 52 opportunistically sampled studies to identify current trends and challenges in the field addressing the above questions (Supplementary Data 1). We focused on studies investigating epigenetic variation in the form of 5mC methylation which still constitutes the most universally accessible and cost-effective chromatin modification51,52. We used several criteria to evaluate current standard practice regarding data quality measures, data analysis and statistical treatment, and integration into the theoretical framework. Overall, we feel that the field may have been running ahead too quickly, and should adjust aspects of data quality, analysis and experimental design differing greatly between plants and animals. Studies in plants generally have higher sample sizes, more standardized conditions and homogeneous sample types than animal studies; however, they lack taxonomic diversity in natural populations.

Data quality

In regards to methodological decisions for DNA treatment, sequencing coverage, batch effects, and technical replicates, there appear to be no clear standards in the field, despite the direct impact on data quality (but see53,54 for advice). For instance, the importance of choosing homogeneous tissue unaffected by cell type composition is of central importance and has been largely overlooked. Only 44 % of the studies explicitly considered cell type homogeneity and the vast majority of animal studies used blood, known to respond to immune stress or other external factors55. Blood needs to be evaluated not only for the cell type composition, but also if it is reflective of methylation in other tissues56,57. Technical replication, which is vital to assess measurement error, evaluate performance of lab methods (enzymatic conversion vs. bisulfite treatment) and inform the upper bound of heritability, is rarely used (12% of studies), possibly due to the trade-off with adding more samples to increase statistical power. Finally, for all methods converting cytosine methylation signals to thymine excluding segregating genetic variation at CpG sites (C-T, A-G SNPs) is a simple, yet underused means to improve data quality (13% of applicable studies).

Data analysis

We gathered information on the metrics used to quantify epigenetic variation. While studies use a variety of methods (MS-AFLP, MeDIP), the current standard is 5mC count data at single CpG sites (WGBS, RRBS, 61% of studies). Yet, there appears to be no clear consensus on the choice of data type nor the unit of choice, which will significantly impact downstream analysis (Box 3). A large number of studies analyze the proportion of methylated reads (40% of the studies) rather than raw read counts (12%), which contain more information58. Analyses based on single methylation loci make up 70% of the studies, while the rest integrated CpG-level methylation across regional levels to different extents. This choice is relevant from a statistical view point, since CpG locus count data can be approximated by a (beta)binomial distribution58, whereas integration across larger genomic areas might be better represented by other statistical distributions. We advocate that studies use both single loci and assess the intercorrelation of CpG loci within genomic regions (promoters, introns, exons, etc.) reducing the number of loci for downstream analysis59. Data-informed mechanistic models investigating the intercorrelation between CpG sites in a given genomic context will be important for guiding a biologically meaningful definition of epiphenotype (https://cran.r-project.org/web/packages/MethEvolSIM/)60.

A majority of examined studies (67%) attempted to find differentially methylated positions or regions (DMP or DMR, respectively) between two or more groups (i.e. treatments, populations, species, etc.). A subset of these were then related to gene expression (28% of studies) or were used for an a posteriori gene ontology narrative. However, caution is warranted in interpreting DMP/DMRs in natural populations. Unless explicitly addressed (Box 4), or at least in part controlling for genetic variation (23% of studies), we can only speculate on the source of observed differences. Moreover, samples sizes tend to be prohibitively low. Average sample size per treatment group was 18 (range: 2-336) which is substantially underpowered for detecting true positives, especially in differential methylation analysis (Box 5: simulations following ref. 58). This has important consequences for interpreting most standard analyses ranging from differential methylation to meQTLs which often invoke more complex models requiring even larger sample sizes to avoid overfitting. In the best case, conclusions are biased towards large-effect loci but may also refer to false positives.

In light of these limitations, we strongly recommend researchers conduct a pilot experiment to quantify the empirical distribution of effect sizes that can differ substantially between questions and systems. Power simulations can then provide insight into whether realistic samples sizes can be obtained to address the question at hand. While in most cases sample sizes in the hundreds or thousands will be required to detect small differences between variables of interest, a lack of power does not necessarily invalidate every study with moderate samples sizes. For example, clustering of large-effect loci in regions of elevated genetic divergence can still be informative on the underlying processes shaping epigenetic variation57,61. Moreover, where site-by-site analyses are strongly underpowered for small effect differences, other methods incorporating multivariate statistics58, machine learning62, or averaging across many genes in the genome63 can still extract useful information on the factors shaping epigenetic variation.

Design

To assess the role of epigenetics in evolution, designs assaying transgenerational stability (at least generation F3) of epigenetic variation are crucial but remain extremely limited (only 11% of studies). This is understandable, since evaluating stability involves crossing designs which are not accessible for all study organisms. Leveraging divergent ancestry components (population stratification, hybrid zones) may constitute a viable workaround (see below). In any case, a study design should include reliable quantitative measurements from ecological, genetic and epigenetic data (Box 3). While examples embracing all data types exist64,65,66, only 43% of studies included independent genetic data, even less included individual-level WGS data (11%), which should no longer be an obstacle as individual-level genotypes can be called for single-nucleotide polymorphisms, and increasingly also for more demanding structural variants at rapidly decreasing costs67,68. The omission of genetic data compromises the ability to make inferences on the source and phenotypic impact of epigenetic mechanisms. Clearly, it is most beneficial to include DNA sequencing data from the very same individual and tissue to accommodate somatic mutations where possible.

Decomposing the source of epigenetic variation

Population genetics is concerned with the processes shaping genetic variation across genealogical scales ranging from related individuals to populations and species. The quest for the source of epigenetic variation can benefit from drawing on established methodologies like quantitative genetics, variance decomposition in structured populations or genome scans. We can apply the same principles to (heritable) epigenetic variation and leverage population structure to investigate the processes shaping epigenetic variation. Nevertheless, challenges persist in refining experimental designs and downstream analyses. In the following, we summarize current methodologies and highlight opportunities for theoretical and empirical research to enhance the utility of epigenetic datasets.

Quantitative genetic approaches

Regression-based approaches in quantitative genetics can play a role in decomposing the source of epigenetic variation (Box 4). One major challenge will be to find an appropriate definition for the epiphenotype that contains relevant biological information (Box 3) and reduces the number of pseudo-replication among all possible phenotypes (e.g. all CpG sites) in a genome.

In principle, the heritable component of epigenetic variation that is due to genetic variation can be estimated from epigenetic covariance of related individuals alone without the need for additional genetic data (Eq. 1). Independent genetic data, at base pair resolution, is recommended though, as models benefit from genomic estimates of realized relatedness, rather than pedigree-based estimates. Genetic data can also be used to estimate the additive effects of the genome directly (Eq. 2). This approach clearly faces the challenge of high dimensional data (more markers than individuals), compromising appropriate estimation of effect sizes. A number of primarily Bayesian approaches that assume prior architecture of the trait are in use69, and motivate exploration in the context of epiphenotypes. More commonly, genomic approaches are used to isolate genetic regions/loci that exert a strong effect on the state of the epiphenotype often by making use of crossing schemes (QTL analyses) or by exploiting naturally segregating variation in genome-wide association studies (GWAS). Such approaches are referred to by several names in the context of epigenetic variation (meQTL, MethQTL, epiQTL), but follow the same standard practice. In plants, this approach has been successfully applied to uncover the link between genetic and epigenetic variation at base pair resolution, mainly in A. thaliana70,71,72. They are also applied in human research73,74,75 and are gaining traction in natural populations of other species36,76,77,78. Methylation appears to be linked to genetic variation at varying rates depending on the study organism47,61,64,79,80. Despite successful application of quantitative genetic approaches, small sample sizes result in studies that are underpowered and susceptible to false positive inference due to erroneous SNPs and 5mC methylation calls (Box 5). Where small-scale studies fall short of rational use of limited resources, community attempts may be considered to generate a few large empirical datasets including both epigenetic and genetic data for the same individuals. Alternatively, integrative study designs and methods extracting relevant information from fewer individuals need to be developed alongside existing multivariate approaches (see below, cf. Fig. 158).

Fig. 1: Making use of population divergence to analyze the source of epigenetic variation.
figure 1

Genealogical divergence (e.g. between individuals, families, populations, species) is typically reflected by the amount of difference in (heritable) genetic variation (Δ genetic). Contrasting heterogeneity in genetic and environmental divergence helps in assessing the degree of autonomous epigenetic variation (Box 2). a Three population samples with increasing genealogical distance are depicted (A, B), C). B1 (blue) and B2 (yellow) have the same genetic background, but are subject to different environmental conditions. b Associations between genome-wide genetic and epigenetic variation can be assessed using the correlation of distance metrics for genetic (Δ genetic) and epigenetic (Δ epigenetic) variation among genetic clusters, or through multivariate statistical approaches such as principle component analyses (PCA, not shown). For genetically controlled epigenetic variation, we expect to see a strong correlation between genome-wide metrics of genetic and epigenetic distance regardless of the environment. High correlation of genetic and epigenetic distance matrices between pairs of populations, and not the environment, may be a first indication of genetic control. c) While single-value comparisons per population provide a mean estimate across populations, window-based measures allow capturing variation along the genome. Capitalizing on the heterogeneity of genetic and epigenetic variation along the genome further allows to pinpoint regions of the genome under genetic and/or environmental control. The upper panel shows variation in genetic and epigenetic distance for the population comparison A | B2 at a regional scale along the genome. A point can refer to any genomic window in units of base pairs (or better units of centimorgans) here referred to as a locus. The lower panel shows an example of genetically controlled (left) and environmentally associated (right) epigenetic locus. Δ genetic is shown in blue, Δ epigenetic in red. Box 4 for corresponding statistical approaches.

Instead of using epigenetic variation as a focal phenotype, it can also be used as an explanatory variable to unravel variation in some other measurable trait. Formally, this would mean replacing P.EPI by any phenotype P.X and adding epigenetic variation as an additional explanatory component to Eqs. 1 and 2. Yet, it is challenging to isolate the independent contribution of epigenetic variation in such a situation, and few studies exist81. Importantly, the contribution hinges on the source of epigenetic variation and its stability. Transitory epigenetic variation induced by the environment could simply be subsumed in V(E). If epigenetic variation is fully heritable its effect should be captured by the genetic component V(G). Epigenetic variation will only make an independent, phenotypic contribution if it is environmentally induced or spontaneous and trans-generationally stable. This is an interesting topic and deserves further theoretical and empirical attention.

Multivariate statistics

If we treat epigenetic variation as a multi-dimensional phenotype, then approaches like principal component analysis can integrate signals across the genome and extract useful information, which was carried out in 37% of studies we examined. Such multivariate methods provide a first look at patterns separating populations along the major axes of variation, like drift82 and/or selection dynamics83. This provides a useful starting point for global comparison with population stratification inferred from genetic data. It may even allow for locus-specific comparisons of epigenetic and genetic variation by identifying the loci of both data types with the highest loadings on each PC axis. Given that technical processing can have a large impact on epigenetic data (batch effects)61, we advocate the inclusion of technical replicates to identify and remove loci statistically related to non-target covariates or utilizing intra-experimental controls (e.g. including samples of variable developmental stages to isolate and eliminate age effects). (Partial) redundancy analyses (RDA) provide an additional option to quantify the contribution of genetic and environmental co-variates84, however, choice of co-variates can greatly influence the outcome85. Further investigation and best practice recommendations borrowing from other fields where high dimensional data is common are needed, particularly for the distributional properties of CpG data as well as the treatment of technically induced variation.

Hierarchical population structure

Hierarchical multi-population comparisons constitute another way to decompose genetic, epigenetic, and environmental variance (Fig. 1a). Tests comparing sets of pairwise population matrices provide a means to investigate co-variance between two variables while controlling for a third. For instance, in evolutionary genetics research, the relationship of genetic and ecological distance is often of interest while controlling for the effect of geographic distance86,87. On the analytical side, partial Mantel tests have been a common choice, but multiple matrix regression methods promise more flexibility and appear to be better suited for this purpose88. In principle, such genome-wide queries can be adapted to genome-wide epigenetic distance between populations (Fig. 1b). Following the principle of genome-scans89,90 quantifying epigenetic variation along the genome alongside genetic data provides insight into the degree of local genomic covariation (Fig. 1c). Does high epigenetic differentiation coincide with regions of high genetic differentiation, or is it independent of background genetic signals? Such approaches could also be used to identify regions under environmental or cis-genetic influence. Yet, we are lacking appropriate distance metrics.

Distance metrics

For genetic data, standard measures, such as FST, dXY, dA, and multivariate statistics are readily available and are firmly rooted in population genetic theory with clear interpretation82,90,91,92,93. Yet, owing to a lack of suitable mutation and substitution models for chromatin modifications, we currently have no equivalent metrics for epigenetic data. FST-related distance metrics such as QST seem promising. QST quantifies the additive genetic component of phenotypic variation and, in conjunction with FST, allows for inquiry into the selective or neutral forces underlying phenotypic divergence94. Lacking information on narrow sense trait heritability, PST is often used as an approximation with known limitations and pitfalls95,96. It seems appealing to apply the FST-QST/PST framework to epiphenotypes as it places the variation we see in epigenetic data on the same scale as genetic data and allows to investigate whether epigenetic differentiation proceeds at the same rate as genetic variation (Fig. 1b). Elevated QST/PST suggests accelerated epigenetic divergence relative to genetic drift, which may hint at an environmental influence or selection acting on epigenetic variation. Several examples in natural populations already exist65,79,80,97, mainly using the proportion of methylation at a given loci as the input to calculate PST.

Despite the promise, we need both empirical and theoretical research to accommodate specific features of epigenetic data. Importantly, this includes the definition of genomic scale for which the epiphenotype is considered (Box 3). Can we treat genome-wide epigenetic variation as a single phenotype to be compared to neutral, genome-wide differentiation reflecting genetic drift? Which genomic scale of cis-acting genetic variation is appropriate for comparisons with targeted epigenetic loci (e.g. specific CpG islands, promoter regions, etc.)? Can we develop models, analogous to genetic variation, to identify QST outliers incorporating the effect of demographic change, rather than just picking loci from extreme percentiles? Specifically, which statistical distribution is best suited to estimate the variance components for calculating QST/PST? How do we best incorporate covariance structure across epiphenotypes along the genome98, which will substantially change under cis- or trans-genetic control of methylation? Models and simulations predicting epigenetic change upon population differentiation should be particularly useful to get an intuition about the range of possible QST/PST under different conditions for the source of epigenetic variation (spontaneous, genetic control, environmental or GxE, Box 2). While teasing apart environmentally induced epigenetic variation may be possible, separating spontaneous epimutations from those under genetic control will prove more challenging, in particular under conditions favoring linkage disequilibrium (see below). We see a clear opportunity here to utilize theoretical and simulation approaches to examine the scenarios where this may – or may not – be possible.

Integrating epigenetic variation in population genetic analyses

Characterizing the source of epigenetic variation is crucial to evaluate the autonomous potential of epigenetic variation to evolution. Yet, we also see an opportunity to broadly incorporate epigenetic datasets into population genetic approaches tracing evolutionary processes. Below, we explore potentially promising research directions.

Linkage disequilibrium

It has long been proposed that phenotypic plasticity interacts with genetic evolution99,100. Adaptive plasticity enables individuals to enter novel environments with hitherto unexperienced selection pressures shaping (cryptic) genetic variation101,102. Depending on the conditions, populations in both original and new habitats may coexist, differentiate, or speciate103,104. Environmentally induced epigenetic variation could be one key mechanism mediating the interaction of adaptive plasticity and genetic variation42,105 – with consequences for how we interpret covariation between genetic and epigenetic variation.

Here, we illustrate the possible consequences of this interaction in the context of ecological speciation, where natural selection causes reproductive isolation between environmentally differentiated populations connected by migration. In brief, divergent selection reduces effective migration of alleles from barrier loci under divergent selection between populations. Linkage extends these genomic effects beyond the barrier locus, with the extent depending on selection strength and recombination rate106. As a consequence of this process, we observe statistical associations (linkage disequilibrium) between selected loci and neutral genetic variation both within and between populations, to various extents90,107,108. In principle, this process also applies to epigenetic variation. Assuming antagonistic pleiotropy109, where the alternate epiallelic state is respectively favored in different habitats, adaptive epimutations induced in the parental habitat reduce the fitness of migrants with an immediate effect on gene flow. If the epimutation is transitory, then the selective effects disappear in the next generation, resulting in no linkage with genetic variation. However, if epimutations exhibit some degree of transgenerational inheritance, they can come into linkage with surrounding (neutral) genetic variation or become coupled with other genetic or epigenetic variation subject to divergent selection108.

This influences our interpretation of the association between genetic and epigenetic variation. Thus far, we have treated epigenetic variation as a potentially heritable phenotype encoded by genetic variation. Associations between epigenetic state and genotype then suggest causal cis- or trans-genetic effects (Box 4). However, as illustrated in the previous paragraph, co-variation with genetic loci can also arise through selection and other mechanisms in the form of linkage disequilibrium110. Both theoretical and empirical research are needed to differentiate the processes inducing linkage disequilibrium between genetic and epigenetic variation and causal, additive genetic variation underlying epigenetic variation. Promising theoretical approaches exploring this interaction in an explicit population genetic framework are emerging, and are vital to inform empirical experimental design111,112. A first ad hoc approach to understand the source of linkage disequilibrium could consist of quantifying genetic-epigenetic associations within and among populations along a gene flow gradient under varying selection regimes in both natural and experimental settings.

Hybrid zones

Hybrid zones constitute a promising way of leveraging population divergence not only to decompose environmental and genetic components of epigenetic variation, but also to address the role of autonomous epigenetic variation in speciation. The use of hybrid zones has a longstanding tradition in evolutionary genetic studies113 and has increased our understanding of the processes governing population divergence114. By maximizing phenotypic and genetic variation and generating novel combinations thereof, they are suited to map the genetic basis of phenotypic variation while partially controlling for environmental variation in a geographically confined center of the zone115. For studies of epigenetic variation, hybrid zones confer similar advantages (Fig. 2). In the central part of the hybrid zone, environmental variation is often reduced relative to the parental core habitat, while genetic variation is maximized in the admixed genomes of hybrids, which can be represented as mosaics of ancestry blocks116,117. The genetic variance between ancestry blocks exceeds variation found within populations of a single species. As populations diverge, the number of fixed differences mixing in the hybrids increases our ability to pinpoint the effects of local, cis-genetic variation or genome-wide ancestry (trans-genetic variation). Increased allelic variance in hybrids can be particularly well exploited when considering allele-specific methylation, as this allows quantification of dominance deviation under cis-control, as well as identification of transgressive methylation levels indicative of trans-effects (genome-wide hybrid ancestry). Capitalizing on this idea, cis- and trans-genetic control of epigenetic variation has been quantified across the genome of flycatcher hybrids63.

Fig. 2: Epigenetic variation in hybrid zones.
figure 2

Hybrid zones allow to concomitantly study epigenetic variation and genetic ancestry and may help pinpointing the source of epigenetic variation (upper two panels). Cline analysis is a powerful tool to infer selection on both genetic and epigenetic variation and compare the two (lower panel). Geographic cline analysis characterizes the change of variation in genetic and epigenetic loci (here DNA methylation) on a spatial axis, whereas genomic cline analyses depict epigenetic divergence as a function of genome-wide ancestry.

Another promising avenue for exploiting information from hybrids may be to adapt geographic and genomic cline theory to epigenetic datasets and explore introgression dynamics of epigenetic and genetic loci116. Let us consider a geographic cline resulting from secondary contact of two populations that have previously diverged in isolation. In the center of the hybrid zone, we assume that the environment is rather homogeneous while genomes are admixed. Genetic variants sharing ancestry within a non-recombined segment of the genome will be characterized by elevated linkage disequilibrium. Epigenetic variants that are subject to cis-control should as a consequence also be in LD with one another and with the surrounding genetic variation. The same holds true for the fraction of spontaneous epigenetic variation that has accumulated differences during population isolation comparable to the surrounding genetic variation. Epigenetic variation under environmental or trans-genetic control should, however, not show this association. The former should show little to no variation, while the latter should reflect genome-wide ancestry (or ancestry at distant genetic loci). Expanding the rationale from the center of the zone to include a full transect of individuals from pure parental populations provides additional information. Genetic variants under divergent selection resist gene flow and are expected to show a steep cline118. Steep clines of single epigenetic loci may similarly reflect selection for trans-generationally stable epigenetic variation or simply reflect an underlying steep environmental gradient. The latter is less likely, while the former must be first separated from epigenetic variation maintained solely through genetic variants. The question of whether selection acted on the epigenetic locus or a genetic locus in LD is, however, difficult to answer, but may benefit from tissue-specific investigation.

This thought experiment illustrates two points: first, hybrid zones may hold key insight into the source of epigenetic variation and selection. Second, things are complicated, as processes can interact in intricate ways. Here, we can outline only the basic idea; theoretical research is needed to fill in detailed predictions. Evolutionary null models retracing the process of population divergence followed by secondary contact varying recombination, and genetic and epigenetic mutation rates could shed first light on the conditions necessary to distinguish the sources of epigenetic variation. Adding selection and coupling across regions of the genome may then help to understand the role of epigenetic variation in speciation. For simplicity, we here considered a geographic cline, but the rationale similarly extends to genomic clines, which hold promise for inferring the selection of epigenetic variation while controlling for genome-wide ancestry.

There are also a number of challenges on the empirical side. It remains to be explored how to best represent the change in epigenetic variation. Treatment as an epigenotype (binary or trinary) allows the calculation of epigenetic ancestry likelihood, analogous to genetic ancestry derived from genotype frequencies along the cline. An estimate of linkage disequilibrium would similarly require treatment of the epigenetic variation as a discrete genotype. Although cline analyses and measures of linkage disequilibrium can be conducted on a single epigenetic locus, analyzes will benefit from phased haplotype data embedding epigenetic information directly into (ancestry-informative) genetic background. With the advent of long-read sequencing data including information on chromatin modification such data can now be obtained. Despite substantial challenges, we see unexplored potential in ‘hybrid zone epigenetics’ and encourage both theoretical and empirical studies to adapt existing statistical approaches116.

Demographic inference

Incorporating epigenetic variation into coalescent methods may be valuable for tracing recent demographic changes (i.e. bottlenecks, range expansion or contraction, or speciation) induced by changing climate or human-inflicted perturbations. CpG sites with clock-like behavior that are due to spontaneous epimutations may carry additional information on genealogies and may help estimate the timing of lineage divergence or reconstruct population demographic histories. Note, that such a ‘genealogical’ clock’ involves trans-generationally stable epimutations and is not to be confounded with an ‘ontogenetic’ clock tracing individual age119. A theoretical framework to predict the frequency distribution (site frequency spectrum) of highly mutable epimutations (mSFS) was developed by Charlesworth and Jain120 paving the way for statistical inference of neutral and selective factors influencing the methylome of A. thaliana121,122,123 and maize124. Going one step further, Sellinger et al.125 incorporated DNA methylation data into full-genome demographic inference. The conceptual novelty here is to infer the coalescent genealogy of a sample by integrating several markers in a genomic region: SNPs, but also additional markers with a higher mutation rate (e.g. 5mC methylation). This method extends the pairwise Sequentially Markovian Coalescent (SMC) models and allows us to infer: (1) the mutation rates of methylation/demethylation if the SNP mutation rate is known, (2) the relative importance of site- and region-level epimutation processes, and (3) the past demographic history of the population. The accuracy and resolution to reconstruct bottlenecks that occurred in the recent past is improved over estimates from genetic data (SNPs) alone125. Approaches leveraging spontaneous epimutations could be applied to resolve important evolutionary events as they can be reliably tracked even within one organism such as long-living trees or clonal organisms126. This relies on identifying neutral methylation sites unaffected by regional methylation and environmental changes. These examples are based on plant research where epigenetic inheritance is well established and are intended to motivate research in other taxa.

Conclusions

The degree to which epigenetic variation makes an independent contribution to evolution is not yet clear. We advocate for research in natural populations complementing controlled experimental research in the lab. However, in order for epigenetic studies in the wild to make a lasting contribution to this question, they need to become more comprehensive by incorporating genetic, environmental, and phenotypic data in a rigorous statistical framework. Otherwise, the influence of the genetic sequence is likely to be underestimated, paving the way for epigenetic storytelling. The practical challenges are numerous and range from technical considerations and experimental decisions to finding meaningful metrics for epigenetic variation and appropriate statistical frameworks for data integration. Despite these challenges, we see the dual potential of this approach. The inclusion of epigenetic variation in population genetic methodology not only helps to assess to which degree epigenetic variation may or may not constitute an autonomous, non-genetic substrate for transgenerational evolutionary change. It also provides an additional axis of information for other questions in population genetics. The strongest approaches are those where we will be able to integrate genetic, epigenetic and environmental datasets with phenotypic measurements to understand the independent and joint impact of the DNA sequence and epigenetic modifications. In order to do this, we need to foster exchange between bench biologists and the empirical and theoretical population geneticists studying the evolutionary processes underlying (epi)genetic variation in natural populations.