Efficiency of genomic prediction across two Eucalyptus nitens seed orchards with different selection histories

Suontama, Mari; Klápště, Jaroslav; Telfer, Emily; Graham, Natalie; Stovold, Toby; Low, Charlie; McKinley, Russell; Dungey, Heidi

doi:10.1038/s41437-018-0119-5

Download PDF

Article
Open access
Published: 06 July 2018

Efficiency of genomic prediction across two Eucalyptus nitens seed orchards with different selection histories

Mari Suontama¹,
Jaroslav Klápště ORCID: orcid.org/0000-0001-5504-3735¹,
Emily Telfer¹,
Natalie Graham¹,
Toby Stovold¹,
Charlie Low¹,
Russell McKinley¹ &
…
Heidi Dungey¹

Heredity volume 122, pages 370–379 (2019)Cite this article

4060 Accesses
55 Citations
10 Altmetric
Metrics details

Subjects

A Correction to this article was published on 22 October 2018

This article has been updated

Abstract

Genomic selection is expected to enhance the genetic improvement of forest tree species by providing more accurate estimates of breeding values through marker-based relationship matrices compared with pedigree-based methodologies. When adequately robust genomic prediction models are available, an additional increase in genetic gains can be made possible with the shortening of the breeding cycle through elimination of the progeny testing phase and early selection of parental candidates. The potential of genomic selection was investigated in an advanced Eucalyptus nitens breeding population focused on improvement for solid wood production. A high-density SNP chip (EUChip60K) was used to genotype 691 individuals in the breeding population, which represented two seed orchards with different selection histories. Phenotypic records for growth and form traits at age six, and for wood quality traits at age seven were available to build genomic prediction models using GBLUP, which were compared to the traditional pedigree-based alternative using BLUP. GBLUP demonstrated that breeding value accuracy would be improved and substantial increases in genetic gains towards solid wood production would be achieved. Cross-validation within and across two different seed orchards indicated that genomic predictions would likely benefit in terms of higher predictive accuracy from increasing the size of the training data sets through higher relatedness and better utilization of LD

Effect of clonal testing on the efficiency of genomic evaluation in forest tree breeding

Article Open access 22 February 2022

Forest tree breeding using genomic Markov causal models: a new approach to genomic tree breeding improvement

Article 17 March 2025

Improving lodgepole pine genomic evaluation using spatial correlation structure and SNP selection with single-step GBLUP

Article 18 February 2022

Introduction

Shining gum, Eucalyptus nitens (Deane & Maiden), is the most important commercial eucalypt species in New Zealand, with an advanced breeding programme moving towards its fourth generation (Klápště et al. 2017). The New Zealand E. nitens breeding programme is an open-pollinated (OP) breeding population, therefore, some inaccuracy in breeding values is expected due to unknown paternal contribution and plausible pedigree errors. Precisely recorded pedigrees and known ancestors of the breeding population individuals are prerequisites for accurate genetic evaluation and consequently, efficient breeding programme management and boosted genetic gain in forest tree species.

Genomic selection was proposed as a tool to predict individual breeding values on the basis of information from high-density genetic marker panels through multiple regression models (Meuwissen et al. 2001). Development of next generation sequencing technologies such as genotyping by sequencing (Elshire et al. 2011) or exome capture (Neves et al. 2013) has allowed implementation of genomic technologies in species with missing reference genomes such as forest trees (Resende et al. 2012a; Resende et al. 2012b; Ratcliffe et al. 2015; El-Dien et al. 2015; Beaulieu et al. 2014a).

High-density genotyping of a sufficiently large sub-sample of the target breeding population offers a tool for more precise selection of breeding candidates (Resende et al. 2012b) by fitting marker-based relationship matrices instead of documented pedigrees (Nejati-Javaremi et al. 1997; Van Raden 2008). The advantage of marker-based relationship matrices is that gaps are filled in pairwise relatedness in forest tree pedigrees, which leads to an increase in the accuracy of genetic parameters and more precise selections of breeding candidates (Zapata-Valenzuela et al. 2013; Isik et al. 2016; Müller et al. 2017; Tan et al. 2017). The genomic prediction model generally captures: (1) shared genealogy, (2) co-segregation and (3) linkage disequilibrium (LD) between markers and quantitative trait loci (QTL). The contribution of each of these is affected by the genetic architecture of a trait, genomic marker density and marker distribution and effective population size (Habier et al. 2013). The accuracy of genomic prediction also depends on a trait’s heritability, training population size and the effective number of chromosomal segments, defined as the function of a trait’s genetic architecture (distribution of QTLs) and decay of LD along the chromosome (Hayes et al. 2009). The level of relatedness between training and validation population is also an important factor affecting the accuracy of genomic breeding values (Habier et al. 2010; Scutari et al. 2016).

Eucalyptus nitens has been grown predominantly for pulp wood production with short rotations in Southland, the southernmost region of New Zealand and the major E. nitens plantation area. Breeding objectives have been for improved growth and form (Wilcox 1980; King and Wilcox 1988), whilst wood quality traits have not been the focus of breeding until recently, with an increasing interest in the use of E. nitens for higher value solid wood products. Similar trends in breeding of eucalypt species for wood quality have been reported in previous studies (Raymond 2002; Kube 2005; Kube and Raymond 2005; Grattapaglia and Kirst 2008; Hamilton et al. 2009). Earlier research efforts on implementing molecular genetics in breeding of E. nitens have included QTL trait locus and candidate gene approaches for growth and vegetation traits; and wood quality where a number of candidate genes for wood quality and QTLs for growth and vegetative propagation traits were identified (Thumma et al. 2010a, 2010b). Initiation of genomic estimated breeding values were proposed in the latest breeding plan of E. nitens in New Zealand, with a focus on solid wood production. Implementation of genomic estimated breeding values is expected to accelerate the rate of genetic progress for the breeding objective traits because of greater accuracy of genetic evaluation and the shorter generation interval than in traditional breeding (Resende et al. 2012; Resende et al. 2012b).

The objective of this study was to investigate the improvement in accuracy of genetic evaluation for growth, form and wood quality traits when using marker-based breeding values with GBLUP methodology compared to pedigree-based BLUP in an advanced E. nitens population. Firstly, the aim was to estimate heritability and accuracy of pedigree and marker-based breeding values within and across the progeny of two seed orchards. Secondly, the aim was to compare predictive accuracy of BLUP and GBLUP estimated breeding values within and across the seed orchards. Finally, implications of the results for the breeding programme and the next steps in the research will be discussed.

Materials and methods

Materials

The E. nitens population used in this study was established as an OP progeny test of families from two independent seed orchards, Waiouru and Tinkers. The number of families in the progeny test originating from the Waiouru seed orchard was 90, and the number of families in the progeny test from the Tinkers seed orchard was 25. In the current study, 51 and 100% of the families were represented from the Waiouru and Tinkers seed orchards, respectively. Outline of the breeding programme history is described by Klápště et al. (2017).

The third generation progeny trial used in this study was located in the South Island of New Zealand. The total number of individuals in the trial was 3600, with a subsample of 691 individuals representing 72 families with 1–24 (average of 9.6) individuals per family used for genomic prediction analysis for which genomic information and phenotypic trait records were available. The Waiouru seed orchard was represented by 431 genotyped individuals with effective population size of 133.2, while the Tinkers seed orchard was represented by 236 genotyped individuals with effective population size of 64.7. The effective population size (N_E) was estimated in terms of status number as following $N_S = \frac{1}{{2\theta }}$, where Ɵ is group co-ancestry of individuals (Lindgren et al. 1996). The remaining genotyped individuals represented Australian Tree Seed Centre orchard (ATSC) as control trees in the progeny test. Trees in this progeny test were assessed at age six for tree height, diameter at breast height (DBH) and stem straightness. These same trees were assessed at age seven for the following wood quality traits: wood density, wood stiffness, wood shrinkage and growth strain. Methodology to measure wood quality traits is described in detail by Klápště et al. (2017).

Genomic data was generated by extracting DNA from the leaf tissue of 691 individuals from the progeny trial using the commercially available NucleoSpin® 96 Plant II kit (Machery-Nagel, Dϋren, Germany) (Telfer et al. 2013) and sent to GeneSeek, Inc. (a Neogen company, Lincoln, NE, USA) for genotyping. Genotyping was undertaken using the Illumina Infinium EUChip60K SNP chip (Silva‐Junior et al. 2015) with SNP calling performed on the basis of multi-taxa and Maidenaria section reference. Both call algorithms produced a similar number of SNPs (58,307 vs. 58,323). The marker data were filtered for genTrain score > 0.5, GenCall > 0.15, minor allele frequency > 0.01, SNP call rate > 0.6 and pairwise LD in terms of a composite estimate (r² < 0.9), with 12,236 SNPs selected to train genomic prediction models. The missing data were imputed through expectation-maximisation algorithm implemented in “rrBLUP” package (Endelman 2011). The spectral decomposition of the realized relationship matrix showed a clear segregation of each seed orchard population with Australian Tree Seed Centre (ATSC) families in between. This reflects the differences in the genetic background of these populations due to different selection strategies (Fig. 1). LD decreased to 0.2 within 3 kb in Waiouru and within 5 kb in Tinkers (Fig. 2).

Methods

The genetic parameters were estimated using linear mixed models implemented in the ASReml-R package (Butler et al. 2009). Two models using either a pedigree or marker-based relationship matrix were investigated and compared. A pedigree-based model (BLUP) was used as follows:

$${\boldsymbol{y}} = {\mathbf{X}}{\boldsymbol{\beta }} + {\mathbf{Z}}_1{\boldsymbol{u}} + {\mathbf{Z}}_2{\boldsymbol{r}} + {\mathbf{Z}}_3{\boldsymbol{r}}\left( {\boldsymbol{s}} \right) + {\boldsymbol{e}}$$

(1)

where y is a vector of measurements, β is a vector of fixed terms such as intercept and seed source, u is a vector of additive genetic effects (breeding values) following $var\left( u \right)\sim N(0,{\sigma _a^2}{\mathrm{{A}}})$, where $\sigma _a^2$ is additive genetic variance and A is the average numerator relationship matrix (Wright, 1922), r is a vector of random replication effects following $var\left( r \right)\sim N\left( {0,\sigma _r^2{{\mathbf{I}}}} \right)$ where $\sigma _r^2$ is replication variance and I is the identity matrix, r(s) is set nested within replication following $var\left( {r\left( s \right)} \right)\sim N\left( {0,\sigma _{r\left( S \right)}^2{\mathbf{I}}} \right)$ where $\sigma _{r(s)}^2$ is set nested within replication variance. Set is an incomplete block effect within a replicate where the equal number of families were allocated to each set within a replicate. A vector of random residuals is e following $var\left( e \right)\sim N\left( {0,\sigma _e^2{\mathbf{I}}} \right)$, where $\sigma _e^2$ is residual variance, X and Z₁, Z₂ and Z₃ are incidence matrices assigning fixed and random effects to measurements in vector y (Klápště et al. 2017). The model accommodating a marker-based relationship matrix (GBLUP) was performed using the previous Eq. (1), but the average numerator relationship matrix A was substituted by a marker-based relationship matrix G which was estimated as follows:

$${{\mathbf{G}}} = \frac{{{\mathbf{ZZ}}\prime }}{{\mathrm{tr}\left[ {{\mathbf{ZZ}}\prime } \right]/n}}$$

(2)

where Z is M – P, M is the marker matrix with genotypes coded 0, 1 and 2 for alternative allele homozygote, heterozygote and reference allele homozygote, respectively, and P is a vector of twice the reference allele frequency, tr[ZZ’] is a trace of the matrix defined in nominator and n is the number of markers (Forni et al. 2011). Heritability represents the proportion of a trait’s variance explained by genetic factors and can provide inference about the potential efficiency of any genetic improvement (Falconer and Mackay 1996). Narrow-sense heritability was estimated as:

$$\widehat h^2 = \frac{{\widehat \sigma _a^2}}{{\widehat \sigma _a^2 + \widehat \sigma _e^2}}$$

(3)

where $\sigma _a^2$ is additive genetic variance and $\sigma _e^2$ is residual variance. The accuracy of breeding values represents correlation of their estimates obtained from the model [1] with their true breeding values which are commonly unknown. The theoretical accuracy of breeding values is estimated using the following formula:

$$r = \sqrt {1 - \frac{{\mathrm{PEV}}}{{G_{ii}\sigma _a^2}}}$$

(4)

where PEV is prediction error variance (Mrode 2014) and G_ii is the diagonal element of the marker-based relationship matrix for the i^th individual and is substituted by A_ii in the pedigree-based scenario.

A 10-fold cross-validation performed through 30 replications was used as independent evaluation, and was performed at an individual level, and within, between, and across seed orchards. The resulting predictive accuracies indicate the efficiency of the marker-based model as a prediction tool for breeding values based solely on marker information. Such a scenario reflects the main advantages of genomic selection in breeding programmes: the elimination of the testing phase (establishment of progeny trials) from breeding cycles and selections based only on genetic markers. It was estimated as follows:

$$r_p = cor\left( {\mathrm{EBV}}\,{\mathrm{GEBV}} \right)$$

(5)

where EBV is the vector of breeding values estimated by the pedigree-based model and GEBV is the vector of breeding values predicted in cross-validation using the marker-based model.

The full efficiency of genomic selection was investigated by comparing genetic gains per generation using pedigree-based estimated breeding values (BLUP) with genetic gains using genomic marker-based estimated breeding values (GBLUP). Both of these individual based estimated genetic gains included the availability of phenotypic records for all traits. The mean value of BLUP and GBLUP breeding values of the 20% of selected individuals was the measurement for the estimated genetic gain.

Results

Estimates of heritability and accuracy of breeding values

Pedigree-based analysis showed low to moderate within seed orchard heritabilities in both seed orchards (Table 1). Pedigree-based estimated heritabilities were higher at Tinkers than at Waiouru for radial wood shrinkage, tangential air-dry wood shrinkage and wood stiffness at 1.4–3 m traits, but lower for all other traits at Tinkers. Marker-based within seed orchard heritabilities was also low to moderate, and generally higher at Waiouru than at Tinkers, except for tangential air-dry wood shrinkage and growth strain (Table 1).

Table 1 Estimates of narrow sense heritability (h²) with their standard errors (se) and breeding value accuracies (r) within seed orchards

Full size table

Across seed orchard heritability estimates were generally higher using marker-based breeding values than pedigree-based with only a few exceptions (Table 2). Reconditioned radial wood shrinkage, DBH, height and stem straightness indicated lower heritability estimates using marker-based models than when pedigree-based models were used. The highest heritabilities were estimated for wood shrinkage and wood density traits. The lowest heritability estimates were obtained for DBH and height and were relatively similar with both pedigree and marker-based methods.

Table 2 Estimates of narrow sense heritability (h²) with their standard errors (se) and breeding value accuracies (r) and estimated genetic gains per generation (ΔG_BLUP, ΔG_GBLUP) across seed orchards

Full size table

Marker-based breeding values had consistently higher accuracies than pedigree-based breeding values. Generally, accuracies of breeding values were higher for both seed orchards when using marker-based rather than pedigree-based estimated breeding values. However, there were some inconsistencies, with the Tinkers seed orchard showing lower genomic breeding value accuracies for radial wood shrinkage, DBH and height than for the same traits using pedigree-based models, reflecting the pattern in heritability.

Cross-validation

The cross-validation analysis showed no, or very low, predictive accuracy for breeding values for one seed orchard when based on a model trained in an alternative seed orchard, using pedigree-based or marker-based models (Table 3). Within seed orchard predictive accuracy, however, improved considerably for both seed orchards and for the majority of traits when using marker-based models. Pedigree-based models resulted in lower predictive accuracy for most of the traits within Waiouru seed orchard compared with Tinkers seed orchard, but this trend was not so obvious with marker-based models. Across seed orchards, predictive accuracies were higher for marker-based analysis than for pedigree-based predictions for the majority of traits.

Table 3 Predictive accuracy from cross validation scenarios on individual BLUP and GBLUP breeding values within the Waiouru (W) and Tinkers (T) seed orchards and across the both seed orchards (WT)

Full size table

Estimates of genetic gains

Generally estimated absolute genetic gains per generation were as expected higher when using GBLUP breeding values rather than BLUP breeding values (Table 2). The estimated genetic gains reflected the magnitude of heritability for the traits. The largest difference in genetic gains between BLUP and GBLUP was for growth strain in the upper part of the log, which is one of the most important trait in selection for solid wood production of this species. Stem straightness was the only trait that had a significantly lower genetic gain when using GBLUP than BLUP.

Discussion

The benefits of using information from genomic markers in the genetic evaluation were apparent in the current study, where both within seed orchard and across seed orchard estimations supported the use of GBLUP over BLUP predictions. The reduced number of SNPs after filtering still provided sufficient genomic information to perform efficient marker-based predictions, which improved breeding value accuracies when compared with pedigree-based evaluations. A gain in accuracy of breeding values would likely be even more substantial when increasing the size of training population by additional genotyping. Grattapaglia and Resende (2011) estimated that the impact of training population size on the accuracy of genomic breeding values would increase up to a sample size of 2000. However, the effect of the training population size on accuracy of breeding values also depends on genomic marker density and the number of QTL controlling the trait (Grattapaglia and Resende 2011). The size and composition of the training sets and the number of SNPs were found more important factors in genomic prediction, than statistical methodology or the genomic location of markers, i.e genic vs. intergenic in eucalypts for growth and wood traits (Tan et al. 2017). Similarly, there were no noticeable differences between statistical methodologies used in maritime pine when comparing GBLUP and Bayesian methods (Isik et al. 2016). Depending on the importance of LD in contributing to accuracy of genomic predicted breeding values, Bayesian models may perform better than GBLUP for which the decay of the prediction accuracy tends to be larger, especially when the training data set is relatively small (Habier et al. 2010).

Forest tree breeding programmes are generally in the very early stages compared to animal and crop breeding programmes with faster generation turnover (Grattapaglia and Kirst 2008; Isik 2014). This causes genetic parameter estimates to be less accurate, as it is not always possible to take into account complete pedigrees, identify specific genetic groups or consider data since the selection began. Our analysis found that a marker-based predictions improved the accuracy of genetic parameter estimates, and also resulted in higher predictive accuracies in cross-validation evaluations than pedigreed-based breeding values. The likely source of this improvement is the utilization of all the available information in the populations through a complete pairwise marker-based relationship matrix accounting for realised genetic relationships between individuals (Zapata-Valenzuela et al. 2013). This, in conjunction with the faster progress in genetic improvement and delivery, are the major benefits to the implementation of genomics in forest tree breeding (Grattapaglia and Resende 2011).

In our study, the genomic information was available only for those individuals in the progeny test but not for their parents. Therefore, further improvement of genetic parameters through pedigree reconstruction cannot be achieved. Pedigree reconstruction is able to recover unknown relationships and correct inconsistencies in documented pedigrees (Doerksen and Herbinger 2010; El-Kassaby et al. 2011; Telfer et al. 2015; Vidal et al. 2015; Tan et al. 2018). More accurate genetic relationship information would increase the precision of genetic parameter estimations, and can be further explored through dense marker arrays that capture Mendelian sampling terms through the construction of a marker-based relationship matrix (Habier et al. 2007; Hayes et al. 2009; VanRaden 2008). In Eucalyptus hybrids, as a result of pedigree inconsistencies, genomic predictions outperformed pedigree-based predictions, which were largely underestimated (Tan et al. 2017). Sib-ship reconstruction was previously applied to the population used in this study by Klápště et al. (2017), who estimated the proportion of selfs being 4% in the population, with DBH and growth strain being the traits affected by inbreeding depression. Implementing sib-ship reconstruction increased genetic parameters and breeding value accuracies for the traits under inbreeding depression, and was consequently regarded as a useful tool to cull inbred individuals or selfs from the breeding population (Klápště et al. 2017). However, sib-ship reconstruction is not able to recover all classes of relatedness present in advanced breeding populations, and rather parentage reconstruction should be used when possible (Klápště et al. 2017).

Exceptionally low heritability for DBH and tree height in the current study is probably reflecting the composition of the training data set that is a selected subsample of the population. This resulted in a reduced genetic variance where the GBLUP implementation did not achieve any improvement in heritability estimates for those traits either. Between seed orchards, the marker-based predictions showed generally lower heritability estimates in Tinkers compared with Waiouru. This is interpreted as being a consequence of a higher selection intensity applied in the Tinkers population compared with Waiouru, which resulted in partial fixation of the genetic variance. The opposite overall trend for heritability estimates between the seed orchards was seen in the pedigree-based predictions. This difference can be explained by pedigree-based analysis overestimating additive genetic variance when a reference population is small. The across seed orchard estimates for heritability and breeding values accuracy converted to intermediate values between the within seed orchard estimates. Surprisingly, a larger sample size did not result in a higher accuracy of genetic parameters, which is attributed to be a consequence of merging two populations with different selection histories and smaller relatedness (Habier et al. 2013). The comparison of LD decay showed a strong difference between populations, which probably resulted in lack of marker effects transferability between merged populations.

Cross-validation was performed at an individual level to dissect the effects of genetic relationships, co-segregation of alleles, and LD between markers and QTL, three factors that genomic prediction is based on (Habier et al. 2013). The cross-validation captured all of the effects and showed a higher predictive accuracy in the Tinkers seed orchard compared to the Waiouru. This result was somewhat contrary to the estimated heritability and the accuracy estimates. The higher predictive accuracy in the Tinkers population can be explained by lower effective population size and larger haploblocks, which are built in populations created under higher selection intensity (supported by slower LD decay), and thus the whole genetic complex can be efficiently captured even by a sparse marker array (Ødegård and Meuwissen 2014). Transferability of this kind of prediction model is highly reduced, and can be seen in the cross-validation between seed orchards. As a result, when training the Waiouru seed orchard, a slightly higher predictive accuracy was found. However, the effect is limited due to lack of connectivity when using only one seed orchard as a training population to predict the other. The across population cross-validation again produced intermediate predictive accuracies between the seed orchards, but did not improve the estimates compared to the Tinkers population, in spite of an increase in training population size. Cross-validation performed in the same generation may not be ideal to estimate predictive accuracy for forward selection in the future breeding (Isik et al. 2016), whereas cross-validation over generations would likely result comparably better predictions (Isik et al. 2016). The accuracy of GEBVs varies depending on the training population size as well as the degree of genetic relationship between the training and validation population (e.g. Habier et al. 2010; Isik et al. 2016; Durán et al. 2017), which is recommended to be as high as possible. The relatedness is also a driver for building stronger LD through larger haploblocks and longer independent chromosomal segments, since the effective number of chromosomal segments is the outcome of the effective population size and length of the genome (Hayes et al. 2009). LD is the main contributor to the accuracy of GEBVs that is persistent over generations, therefore in case of the existence of considerable LD, a requirement for updating the phenotypes across generations is not so high (Habier et al. 2010). In spite of this, relying only on the accuracy originating from LD would result in smaller genetic gains than when the accuracy is based on both LD and relatedness between training and selection populations (Habier et al. 2007). Generally, it is highly recommended to capture a large proportion of the genetic variability in training populations in order to build robust genomic prediction models, making it important to keep a broad range of genetic material in training populations. Increasing the training population size does not only improve accuracy through higher relatedness but also through the increasing LD along the larger training population size (Habier et al. 2010). In genomics-based breeding programmes, the breeding archive should be established independently of the production seed orchards due to different requirements on genetic diversity vs. genetic gain trade-offs to utilize genomics at maximum efficiency (Grattapaglia and Resende 2011).

Genotype by environment interaction (GxE) plays a significant role in forestry tree breeding (Li et al. 2017), and is an important factor affecting transferability of prediction models. GxE would decrease the accuracy of genomic predicted breeding values as is the case also for pedigree-based estimations when genotypes are not stable across environments (Zapata-Valenzuela et al. 2013). GxE is a population and trait specific source of variation in forest tree breeding where both unstable genomic predictions (Resende et al. 2012b; Resende et al. 2012; Beaulieu et al. 2014a) as well as stable genomic predictions across environments have been reported (Lenz et al. 2017).

Estimated absolute genetic gains based on GBLUP breeding values per generation indicate that genomic selection would significantly improve the efficiency of selection for solid wood properties. The major benefit of genomic selection in accelerating the rate of genetic improvement would be derived from the ability to shorten the generation interval through the very early selection at the seedling stage (Resende et al. 2012, Beaulieu et al. 2014b). Improvement in selection efficiency was estimated at 50% in eucalypts if the breeding cycle was halved (Resende et al. 2012), and at 53–112% in loblolly pine (Resende et al. 2012b). Between 65 and 110% genetic gains were predicted for wood and growth traits in white spruce when relatedness between training and prediction data set was high, but lower as the relatedness decreased (Beaulieu et al. 2014b).

The potential to make faster selections by skipping progeny testing should be pursued in this E. nitens breeding population. A question remains as to how well prediction models perform after several breeding cycles (Resende et al. 2012), and how often the prediction models must be updated with new infusions of phenotypic data. Another aspect for consideration when using genomic prediction models are age-age correlations, since transferability of genomic models may be possible only when the selections are made at the same ages (Resende et al. 2012b). Additional genotyping from other progeny testing sites in the current population is recommended to ensure genomic prediction models are stable across sites. Further research on implications of genomic selection in this E. nitens breeding population is required to find the best possible methodology, including additional data infusions of wider breeding population that will reflect better the future selection population than in the current study. We expect further data infusions to result in considerably higher predictive accuracy of genomic breeding values compared to pedigree-based methods. Increasing the training population size as well as applying different statistical methods that can account efficiently for the accuracy due to LD may give further confidence in implementing GEBVs in the breeding programme (Habier et al. 2010). The benefits of genomic selection per unit of time for tree growers will be considerable, and therefore cost-effective ways to apply genomics in the operational breeding should also be the focus of future research.

Conclusions

This study showed that a significant improvement in breeding value accuracy and genetic gains for selection of wood properties in E. nitens was possible by implementing genomic marker-based prediction compared to pedigree-based prediction. The greatest improvement in genetic parameters was obtained for tangential air-dry wood shrinkage and growth strain, which are the key traits in selection for solid wood production in eucalypts. Wood shrinkage traits had moderate heritabilities, which mainly increased further with genomic prediction.

Results from cross-validation analysis implied that further infusions of additional seed-orchard material into the training data would be useful to increase the efficiency of genomics in the selection, regarding breeding value accuracy and predictive accuracy. Further analysis, including more progeny trial sites to investigate the transferability of these models across generations and environments is recommended.

Data availability

Data available from Dryad: https://doi.org/10.5061/dryad.pf58510.

Change history

22 October 2018
Published online: 6 July 2018
This article was originally published under standard licence, but has now been made available under a [CC BY 4.0] license. The PDF and HTML versions of the paper have been modified accordingly.

References

Beaulieu J, Doerksen TK, MacKay J, Rainville A, Bousquet J (2014a) Genomic selection accuracies within and between environments and small breeding groups in white spruce. BMC Genom 15(1):1048
Article Google Scholar
Beaulieu J, Doerksen T, Clément S, MacKay J, Bousquet J (2014b) Accuracy of genomic selection models in a large population of open-pollinated families in white spruce. Heredity 113:343–352
Article CAS PubMed PubMed Central Google Scholar
Butler DG, Cullis BR, Gilmour AR, Gogel BJ (2009) ASReml-R reference manual. Queensland Department of Primary Industries, Queensland, Australia
Google Scholar
Doerksen TK, Herbinger CM (2010) Impact of reconstructed pedigrees on progeny-test breeding values in red spruce. Tree Genet Genomes 6(4):591–600
Article Google Scholar
Durán R, Isik F, Zapata-Valenzuela J, Balocchi C, Valenzuela S (2017) Genomic predictions of breeding values in a cloned Eucalyptus globulus population in Chile. Tree Genet Genomes 13:74. https://doi.org/10.1007/s11295-017-1158-4
Article Google Scholar
El-Dien OG, Ratcliffe B, Klápště J, Chen C, Porth I, El-Kassaby YA (2015) Prediction accuracies for growth and wood attributes of interior spruce in space using genotyping-by-sequencing. BMC Genom 16(1):370
Article Google Scholar
El-Kassaby YA, Cappa EP, Liewlaksaneeyanawin C, Klápště J, Lstibůrek M (2011) Breeding without breeding: is a complete pedigree necessary for efficient breeding. PLoS ONE 6(10):e25737
Article CAS PubMed PubMed Central Google Scholar
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6(5):e19379
Article CAS PubMed PubMed Central Google Scholar
Endelman JB (2011) Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4(3):250–255
Article Google Scholar
Falconer DS, Mackay TFC (1996) Introduction to quantitative genetics, 4th edition. Longman Group Ltd. Pearson Education Limited, Harlow, Essex, England
Google Scholar
Forni S, Aguilar I, Misztal I (2011) Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic information. Genet Sel Evol 43:1
Article PubMed PubMed Central Google Scholar
Grattapaglia D, Kirst M (2008) Eucalyptus applied genomics: from gene sequences to breeding tools. New Phytol 179:911–929
Article CAS PubMed Google Scholar
Grattapaglia D,Resende MDV,(2011) Genomic selection in forest tree breeding Tree Genet Genomes 7(2):241–255
Article Google Scholar
Habier D, Fernando R, Dekkers J (2007) The impact of genetic relationship information on genome-assisted breeding values. Genetics 177(4):2389–2397
CAS PubMed PubMed Central Google Scholar
Habier D, Tetens J, Seefried F-R, Lichtner P, Thaller G (2010) The impact of genetic relationship information on genomic breeding values in German Holstein cattle. Genet Sel Evol 42:5
Article PubMed PubMed Central Google Scholar
Habier D, Fernando RL, Garrick DJ (2013) Genomic BLUP decoded: a look into the black box of genomic prediction. Genetics 194(3):597–607
Article CAS PubMed PubMed Central Google Scholar
Hamilton MG, Raymond CA, Harwood CE, Potts BM (2009) Genetic variation in Eucalyptus nitens pulpwood and wood shrinkage traits. Tree Genet Genomes 5(2):307–316
Article Google Scholar
Hayes BJ, Visscher PM, Goddard ME (2009) Increased accuracy of artificial selection by using the realized relationship matrix. Genet Res 91:47–60
Article CAS Google Scholar
Isik F (2014) Genomic selection in forest tree breeding: the concept and an outlook to the future. New For 45:379. https://doi.org/10.1007/s11056-014-9422-z
Article Google Scholar
Isik F, Bartholomé J, Farjat A, Chancerel E, Raffina A, Sanchez L, Plomion C, Bouffier L (2016) Genomic selection in maritime pine. Plant Sci 242:108–119
Article CAS PubMed Google Scholar
King JN, Wilcox MD (1988) Family tests as a basis for the genetic improvement for Eucalyptus nitens in New Zealand. New Zealand. J For Sci 18(3):253–266
Google Scholar
Klápště J, Suontama M, Telfer E, Graham N, Low C, Stovold T, McKinley R, Dungey H (2017) Exploration of genetic architecture through sib-ship reconstruction in advanced breeding population of Eucalyptus nitens. PLoS ONE 12(9):e0185137
Article PubMed PubMed Central Google Scholar
Kube PD (2005). Genetic improvement of the wood properties of Eucalyptus nitens. Breeding to improve solid wood and pulp properties. PhD Thesis, University of Tasmania
Kube PD, Raymond CA (2005) Breeding to minimise the effects of collapse in Eucalyptus nitens sawn timber. For Genet 12(1):23–34
Google Scholar
Lenz PRN, Beaulieu J, Mansfield SD, Clément S, Desponts M, Bousquet J (2017) Factors affecting the accuracy of genomic selection for growth and wood quality traits in an advanced-breeding population of black spruce (Picea mariana). BMC Genom 18:335
Article Google Scholar
Li Y, Suontama M, Burdon RD, Dungey HS (2017) Genotype by environment interaction in the forest tree breeding: review methodology and perspectives on research and application. Tree Genet Genomes 13:60. https://doi.org/10.1007/s11295-017-1144-x
Article Google Scholar
Lindgren D, Gea L, Jefferson P (1996) Loss of genetic diversity monitored by status number. Silvae Genet 45:52–59
Google Scholar
Meuwissen THE, Heyes BJ, Goddard ME (2001) Prediction of total genetic values using genome-wide dense marker maps. Genetics 157:1819–1829
CAS PubMed PubMed Central Google Scholar
Mrode RA (2014). Linear models for the prediction of animal breeding values, 3rd edn. Wallingford, United Kingdom. CABI
Müller BSF, Neves LG, de Almeida Filho JE, Resende Jr MFR, Muñoz PR, dos Santos PET, Paludzyszyn Filho E, Kirst M, Grattapaglia D (2017) Genomic prediction in contrast to a genome-wide association study in explaining heritable variation of complex growth traits in breeding populations of Eucalyptus. BMC Genom 18:524. https://doi.org/10.1186/s12864-017-3920-2
Article Google Scholar
Nejati-Javaremi A, Smith C, Gibson J (1997) Effect of total allelic relationship on accuracy of evaluation and response to selection. J Anim Sci 75(7):1738–1745
Article CAS PubMed Google Scholar
Neves LG, Davis JM, Barbazuk WB, Kirst M (2013) Whole‐exome targeted sequencing of the uncharacterized pine genome. Plant J 75(1):146–156
Article CAS PubMed Google Scholar
Ødegård J, Meuwissen THE (2014) Identity-by-descent genomic selection using selective and sparse genotyping. Genet Sel Evol 46:3
Article PubMed PubMed Central Google Scholar
Ratcliffe B, El-Dien OG, Klápště J, Porth I, Chen C, Jaquish B, El-Kassaby YA (2015) A comparison of genomic selection models across time in interior spruce (Picea engelmannii×glauca) using unordered SNP imputation methods. Heredity 115(6):547–555
Article CAS PubMed PubMed Central Google Scholar
Raymond CA (2002) Genetics of Eucalyptus wood properties. Ann For Sci 59:525–531
Article Google Scholar
Resende MFR, Muñoz P, Resende MDV, Garrick DJ, Fernando RL, Davis JM, Jokela EJ, Martin TA, Peter GF, Kirst M (2012a) Accuracy of genomic selection methods in a standard data set of loblolly pine. (Pinus taeda L) Genet 190(4):1503–1510
Article Google Scholar
Resende MFR, Munoz P, Acosta JJ, Peter GF, Davis JM, Grattapaglia D, Resende MDV, Kirst M (2012b) Accelerating the domestication of trees using genomic selection: accuracy of prediction models across ages and environments. New Phytol 193(3):617–624
Article PubMed Google Scholar
Resende MDV, Resende MFR, Sansaloni CP, Petroli CD, Missiaggia AA, Aguiar AM et al. (2012) Genomic selection for growth and wood quality in Eucalyptus: capturing the missing heritability and accelerating breeding for complex traits in forest trees. New Phytol 194:116–128
Article PubMed Google Scholar
Scutari M, Mackay I, Balding D (2016) Using genetic distance to infer accuracy of genomic prediction. PLoS Genet 12(9):e1006288
Article PubMed PubMed Central Google Scholar
Silva‐Junior OB, Faria DA, Grattapaglia D (2015) A flexible multi‐species genome‐wide 60K SNP chip developed from pooled resequencing of 240 Eucalyptus tree genomes across 12 species. New Phytol 206(4):1527–1540
Article PubMed Google Scholar
Tan B, Grattapaglia D, Martins GS, Ferreira KZ, Sundberg B, Ingvarsson PK (2017) Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F1 hybrids. BMC Plant Biol 17:110. https://doi.org/10.1186/s12870-017-1059-6
Article PubMed PubMed Central Google Scholar
Tan B, Grattapaglia D, Wu HX, Ingvarsson PK (2018) Genomic relationships reveal significant dominance effects for growth in hybrid Eucalyptus. Plant Sci 267:84–93
Article CAS PubMed Google Scholar
Telfer E, Graham N, Stanbra L, Manley T, Wilcox P (2013) Extraction of high purity genomic DNA from pine for use in a high-throughput genotyping platform. NZ J For Sci 43(1):1–8
Google Scholar
Telfer EJ, Stovold GT, Li Y, Silva-Junior OB, Grattapaglia DG, Dungey HS (2015) Parentage reconstruction in Eucalyptus nitens using SNPs and microsatellite markers: a comparative analysis of marker data power and robustness. PLoS ONE 10(7):e0130601
Article PubMed PubMed Central Google Scholar
Thumma BR, Southerton SG, Bell JC, Owen JV, Henery ML, Moran GF (2010a) Quantitative trait locus (QTL) analysis of wood quality traits in Eucalyptus nitens. Tree Genet Genomes 6:305–317. https://doi.org/10.1007/s11295-009-0250-9
Article Google Scholar
Thumma BR, Baltunis BS, Bell JC, Emebiri LC, Moran GF, Southerton SG (2010b) Quantitative trait locus (QTL) analysis of growth and vegetative propagation traits in Eucalyptus nitens full-sib families. Tree Genet Genomes 6:877–889. https://doi.org/10.1007/s11295-010-0298-6
Article Google Scholar
VanRaden P (2008) Efficient methods to compute genomic predictions. J Dairy Sci 91(11):4414–4423
Article CAS PubMed Google Scholar
Vidal M, Plomion C, Harvengt L, Raffin A, Boury C, Bouffier L (2015) Paternity recovery in two maritime pine polycross mating designs and consequences for breeding. Tree Genet Genomes 11(5):1–13
Article Google Scholar
Wilcox MD (1980) Genetic improvement of eucalypts in New Zealand. NZ J For Sci 10(2):343–359
Google Scholar
Wright S (1922) Coefficients of inbreeding and relationship. Am Nat 56(645):330–338
Article Google Scholar
Zapata-Valenzuela J, Whetten RW, Neale D, McKeand S, Isik F (2013) Genomic estimated breeding values using genomic relationship matrices in a cloned population of Loblolly pine. G3 Genes Genome Genet 3:909–916. https://doi.org/10.1534/g3.113.005975
Article Google Scholar

Download references

Acknowledgements

Specialty Wood Products Research Partnership (contract no. C04X1104) and Scion Core Funding are acknowledged for funding this study.

Author information

Authors and Affiliations

Scion (The New Zealand Forest Research Institute Ltd.), 49 Sala Street, Rotorua, 3046, New Zealand
Mari Suontama, Jaroslav Klápště, Emily Telfer, Natalie Graham, Toby Stovold, Charlie Low, Russell McKinley & Heidi Dungey

Authors

Mari Suontama
View author publications
Search author on:PubMed Google Scholar
Jaroslav Klápště
View author publications
Search author on:PubMed Google Scholar
Emily Telfer
View author publications
Search author on:PubMed Google Scholar
Natalie Graham
View author publications
Search author on:PubMed Google Scholar
Toby Stovold
View author publications
Search author on:PubMed Google Scholar
Charlie Low
View author publications
Search author on:PubMed Google Scholar
Russell McKinley
View author publications
Search author on:PubMed Google Scholar
Heidi Dungey
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Mari Suontama.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Suontama, M., Klápště, J., Telfer, E. et al. Efficiency of genomic prediction across two Eucalyptus nitens seed orchards with different selection histories. Heredity 122, 370–379 (2019). https://doi.org/10.1038/s41437-018-0119-5

Download citation

Received: 18 January 2018
Revised: 11 June 2018
Accepted: 12 June 2018
Published: 06 July 2018
Version of record: 06 July 2018
Issue date: March 2019
DOI: https://doi.org/10.1038/s41437-018-0119-5

This article is cited by

Data augmentation in pedigreed forest tree population improves genomic prediction accuracy
- Jaroslav Klápště
- Mari Suontama
- Heidi S. Dungey
Genetic Resources and Crop Evolution (2026)
Genomic selection: a revolutionary approach for forest tree improvement in the wake of climate change
- Umesh Sharma
- H. P. Sankhyan
- Neeraj Sankhyan
Euphytica (2024)
Genomic prediction of growth and wood quality traits in Eucalyptus benthamii using different genomic models and variable SNP genotyping density
- Regiane Abjaud Estopa
- João Gabriel Zanon Paludeto
- Dario Grattapaglia
New Forests (2023)
Metadata analysis indicates biased estimation of genetic parameters and gains using conventional pedigree information instead of genomic-based approaches in tree breeding
- Jean Beaulieu
- Patrick Lenz
- Jean Bousquet
Scientific Reports (2022)