Introduction

Sexual reproduction is the general rule in higher plants and animals (Bell, 1982; Barton and Charlesworth, 1998; West et al., 1999). However, some species exhibit an apomictic or parthenogenetic reproduction in which the offspring derives from only one parent, the mother (Foroughi-Wehr and Wenzel, 1993). According to McKone and Halpern (2003), very rare cases of androgenesis have been observed in animals. So far, Cupressus dupreziana A. Camus (Cupressaceae) is the only plant species where ‘paternal apomixis’ has been reported (Pichot et al., 2001).

Cupressus dupreziana (or Tassili cypress) is a highly endangered woody species. It is represented by only one natural population in the Tassili N’Ajjer desert, Algeria, and the number of surviving individuals is limited to 233 (Abdoun and Beddiaf, 2002). This species can easily be distinguished from C. sempervirens L. (common cypress or Mediterranean cypress) on the basis of a multi-locus profile obtained using AFLP (Pichot et al., 2008) and nuclear microsatellites (transferred from Sebastiani et al., 2005). The precarious state of the species results from a combination of extreme climatic conditions, long-term human pressure and extremely scarce regeneration. The species always exhibits a low seed viability rate, which can be related to the abnormalities of its sexual reproductive process. The aberrant meiosis that occurs in both micro and macrospores leads to the development of diploid pollen grains (Pichot and El Maâtaoui, 2000) and diploid-based megagametophytes (Pichot et al., 1998). The embryogenic ability of C. dupreziana pollen was first demonstrated in inter-specific pollination experiments. All progeny produced by controlled pollination of the Mediterranean cypress (Cupressus sempervirens) using C. dupreziana pollen were pure C. dupreziana seedlings (Pichot et al., 2001). Reciprocally, the ability of C. dupreziana to act as a surrogate mother for the development of all-paternal C. sempervirens progeny was more recently demonstrated (Pichot et al., 2008). Indeed two-thirds of the seedlings produced by open pollinated C. dupreziana trees planted in a South Eastern France collection were pure C. sempervirens individuals. Most of these seedlings were haploid, which was consistent with the fact that they would derive from haploid C. sempervirens pollen grains without fecundation. On the other hand, the remaining seedlings were diploid and their ontogenic origin remained unknown (Pichot et al., 2008).

In this paper we infer the genetic origin of the diploid all-paternal C. sempervirens seedlings using codominant microsatellite markers (single sequence repeat (SSR)) for which we verified the nuclear inheritance. In addition to the progeny included in the study by Pichot et al. (2008), new progeny originating from either open or controlled pollinations were analyzed. The observed genetic profiles allowed inference of the ontogenic origin of seedlings.

Materials and methods

Plant material

To determine their genetic origin, 63 diploid all-paternal C. sempervirens-like seedlings derived from C. dupreziana seeds were analyzed (Table 1). Most progeny (57) came from open-pollinated C. dupreziana seed trees planted in five ex situ collections (South Eastern France) as described by Pichot et al. (2008). In three of these five cypress collections (Avignon, Carpentras and Montpellier), C. dupreziana seed trees were very close to numerous C. sempervirens pollen trees (more than 30). In Ruscas collection, the only one C. dupreziana seed tree was surrounded by more than 50 C. sempervirens and 50 Cupressus arizonica pollen trees. The last site (Esterel collection) was a pure C. dupreziana collection at a distance of about 2 km from the closest C. sempervirens and C. arizonica trees.

Table 1 Origin and genetic diversity of the 63 diploid Cupressus sempervirens progeny produced by 18 Cupressus duprezian a seed trees. The within-site number of single sequence repeat (SSR) alleles and expected heterozygosity (He) are computed from the seedling genotypes and represent the C. sempervirens pollen-tree diversity, because of the all-paternal origin of the progeny

Six progeny produced by controlled pollinations of one C. dupreziana seed tree with pollen collected from three C. sempervirens trees were also analyzed. Seedling ploidy level was assessed by flow-cytometry upon following the protocol described by Pichot and El Maâtaoui (2000).

Molecular marker selection and inheritance

Seedlings were genotyped using seven nuclear microsatellites (SSRs) developed for C. sempervirens: Cyp52, Cyp84, Cyp101, Cyp174, Cyp257, Cyp258 and Cyp293 (Sebastiani et al., 2005). These markers were highly polymorphic in C. sempervirens populations (Valgimigli, 2005), providing high resolution for paternity analysis and heterozygote detection, but their putative nuclear inheritance had never been verified. As known marker inheritance is a prerequisite for studying the genetic origin of progeny, Mendelian segregation of the SSR markers was tested by analyzing the genotypes of 77 C. sempervirens seedlings from five full-sib families, including two reciprocal crosses, and the six parents of the crosses (see Supplementary information Table 2). This material was selected from a C. sempervirens 9 × 9 diallele mating design (unpublished data). Leaf samples were collected from 5-year old individual C. sempervirens planted in a progeny trial near Avignon, South Eastern France.

Genotyping protocol

Genotyping was carried out in two different laboratories, at CNR-Firenze (Italy) (first set of samples) and at INRA-Avignon (France) (second set of samples). The first set of samples included all the seedlings used for the marker inheritance analysis and 31 of the 63 diploid all-paternal individual seedlings were derived from C. dupreziana seed trees. The remaining 32 diploid all-paternal individual seedlings were included in the second set of samples. DNA was extracted from leaf samples using the DNeasy plant kit (Qiagen, Venlo, The Netherlands).

For the first set of samples, PCRs were carried out in 25 μl containing 10 ng of DNA, 1 × PCR reaction buffer (Promega, Madison, WI, USA), 200 μM of each dNTP, 1-U Taq polymerase (GoTaq, Promega), 1.5 mM MgCl2, and 0.2 μM of each primer. The forward primers were labeled with FAM, HEX or TAMRA. The PCR thermal profile (except for Cyp258, see Sebastiani et al. (2005)) was as follows: denaturation at 94 °C for 5 min followed by 30 cycles of 94 °C for 30 s, 50 °C for 30 s, 72 °C for 30 s and a final extension of 72 °C for 8 min. PCRs were carried out using a Perkin–Elmer GeneAmp 9700 thermal cycler (Applied Biosystems, Foster City, CA, USA). The PCR products were separated by capillary electrophoresis, with a 400 bp size standard, using MEGABACE 1000 (GE Healthcare, Chalfont, St Giles, UK) automatic sequencer. Alleles were sized using FRAGMENT PROFILER version 1.2 (GE Healthcare).

For the second set of samples, three multiplex PCRs (Qiagen), Cyp52 (700 nm)+Cyp84 (800 nm), Cyp101 (700 nm)+Cyp174 (800 nm), Cyp257 and Cyp258 (700 nm)+Cyp293 (800 nm), were carried out in 10 μl containing 0.2 μg of DNA, 5 μl of PCR Master Mix, 1 μl of Primer Mix (0.2 μM each primer), 1 μl of Q-Solution and 1 μl of RNase-free water. Microsatellite loci were analyzed on a LI-COR (4200 IR2) automatic sequencer. The size of alleles was determined automatically and (or) manually with the SAGA (LI-COR IR2) computer software.

To check for consistency in allele size estimation across the two laboratories, a common set of PCR products from 19 samples was run on both automatic sequencers. These 19 samples, for which a perfect match of the length sizes of the amplified fragments was observed, were used as references in all runs allowing unambiguous comparison of the data obtained in the two laboratories.

Statistical analysis

Inheritance of markers

The presence of one maternal allele and one paternal allele in all of the C. sempervirens seedlings analyzed made it possible to verify the biparental inheritance of the SSR markers. Moreover allele segregations within full-sib families were analyzed and compared with the expected nuclear Mendelian segregations. The probability of each observed segregation (P-value) was computed from the distribution of the segregations produced by a Monte Carlo procedure (1000 replicates) under the Mendelian inheritance hypothesis. All computations were performed using the R statistical software (R Development Core Team, 2006).

Probability of single-band SSR profiles

Parent tree SSR genotypes were available for the six progeny produced by controlled pollination. For the other 57 progeny, pollen tree genotypes were unknown and the probability of SSR profiles can, therefore, only be computed from population allele frequencies. Hypothesizing a random mating of gametes, we computed the probability of the single-band profiles that we frequently observed in the diploid all-paternal progeny (see Results section). Computations were carried out using two sets of SSR allele frequencies estimated (a) from the 63 progeny analyzed in the present study and (b) from some Italian cypress populations (Valgimigli, 2005), as an additional control. Indeed, previous results which were based on isozymes demonstrated that French and Italian populations display similar levels of genetic diversity (Dugas, 1996). The probability P(s1) of a single-band profile at locus l is the probability of two identical alleles (homozygosity) plus the probability of only one null allele:

where Pi is the frequency of the non-null allele i and Pnull the frequency of the null allele.

Hypothesizing no linkage disequilibrium among loci, the probability P(S71) of a single-band profile over the seven loci is:

Therefore, the probability P(s71_n63) of observing n single-band individuals among the 63 studied seedlings is:

where (n63) is the number of combinations of n events from 63.

Results

Marker inheritance

Owing to the lack of polymorphism at one SSR marker (Cyp293), segregations could be analyzed for only six of the seven-microsatellite loci. No significant differences between direct and reciprocal crosses were observed; consequently all the progeny produced by the same parents were pooled to increase the sample size and consequently the statistical power of the segregation tests.

Segregations were tested in the three full-sib families for four loci (Cyp84, Cyp101, Cyp257 andCyp258) and in 2 full-sib families for 2 loci (families Cs129 × Cs32 and Cs48 × Cs62 for Cyp52; families Cs79 × Cs18 and Cs48 × Cs62 for Cyp174) (allele segregation raw data are provided in supplementary information). The observed progeny profiles were always consistent with those of their parents, except in four cases for which the presence of one null allele at one parent genotype was hypothesized. The Mendelian segregation hypothesis was never rejected. The P-values computed from Monte Carlo simulations were highly significant except in two cases (cs79 × cs18 family for Cyp101 and cs48 × cs62 family for Cyp258) where probabilities were 8 and 17%, respectively.

SSR profiles of the C. sempervirens progeny produced by C. dupreziana seed trees

High allelic diversity was observed among the 63 diploid C. sempervirens seedlings originating from C. dupreziana seed trees. The number of alleles per SSR locus varied from 6 to 8 and within site diversity was always high (Table 1). Frequencies of null and non-null alleles were estimated from progeny genotypes (Figure 1). Only three seedlings exhibited double-band profiles at 2, 4 or 5 loci. The 60 other seedlings had single-band (or no band) profiles and most of them were unique (57 out of 60). Three single-band profiles were observed twice (see raw data as supplementary material). The absence of bands at three loci (Cyp101, Cyp257 and Cyp293) for respectively 4, 1 and 1 seedlings was attributed to the presence of two homologous null alleles (amplification of these samples was repeated three times).

Figure 1
figure 1

Cumulated single sequence repeat (SSR) allele frequencies estimated in French Cupressus sempervirens populations from the 63 all-paternal progeny produced by Cupressus dupreziana seed trees. Grey scale boxes (from bottom to top): allele frequencies ordered by increasing DNA fragment size. Top boxes of the Cyp101, Cyp257 and Cyp293 markers correspond to null allele.

Using allele frequencies estimated for French populations (present study) or for Italian populations (Valgimigli, 2005), the probabilities of observing one single-band genotype over the seven SSR loci in a random mating population were 0.0019 and 0.0035, respectively. Consequently, the probability to observe 60 single-band genotypes among 63 samples is close to zero (1e−159 and 2e−143). Although there is no evidence for relaxing the hypothesis of linkage equilibrium among loci, some disequilibrium would not strongly affect this computed probability.

The genotypes of the six progeny produced by controlled pollination confirmed the all-paternal origin of these seedlings (see raw data). Indeed progeny produced by parents that do not share alleles exhibited only pollen-tree alleles: a single allele was transmitted to the progeny.

Discussion

Inheritance of the SSR markers

The analysis of allele segregations among the 77 C. sempervirens progeny produced by controlled crosses of C. sempervirens parents confirmed the putative Mendelian inheritance of the SSR markers developed by Sebastiani et al. (2005).

Mendelian inheritance and high polymorphism make nuclear SSR markers very efficient for the evaluation of heterozygosity (vs homozygosity) in diploids (Chani et al., 2000; Hofer et al., 2002; Fei Yu et al., 2004; Vanwynsberghe et al., 2005). However, null alleles are frequently observed and have to be considered for genetic analysis (Dakin and Avise, 2004; Hoffman and Amos, 2005).

SSR allele diversity

The high allelic polymorphism observed in the two sets of analyzed progeny (from C. sempervirens controlled crosses and from C. dupreziana seed trees) is in agreement with previous results reported for C. sempervirens populations (Valgimigli, 2005). Genetic diversity in French cypress populations is similar to that in Italian populations. The null allele frequencies estimated in this study for French populations are of the same magnitude as those found by Sebastiani et al. (2005) (from 0.06 to 0.19 for loci Cyp52, Cyp139, Cyp258 and Cyp293).

Ontogenic origin of the diploid all-paternal C. sempervirens seedlings

The SSR profiles of the six seedlings produced by controlled pollinations unambiguously confirm the all-paternal origin of the seedlings. Indeed alleles that are present in only one of the two parents, either the C. sempervirens pollen tree(s) or the C. dupreziana seed tree, demonstrate the absence of a genetic contribution from the seed tree.

According to previous observations, C. sempervirens produces haploid and uninucleated pollen grains (Pichot and El Maâtaoui, 2000). Diploid pollen was not observed in this species. Consequently, the diploid all-paternal C. sempervirens seedlings are likely to derive from the diploidization of one haploid male gamete or from the fusion of two haploid male gametes deriving from one or two pollen grain(s). Indeed, in most conifers each haploid microgametophyte produces several male gametes through mitotic divisions (12 in C. arizonica and 20 in Cupressus goveniana according to Gaussen et al., 1982). The lack of double-band SSR profiles in the progeny originating from heterozygous pollen trees (eight occurrences) suggests a single pollen grain origin of the diploid seedlings. Consequently, the all-paternal C. sempervirens homozygous seedlings are likely to derive from a haploid pollen genotype, either by the diploidization of an initially haploid embryo or by the fusion of two gametes, with identical genotypes, produced by the same pollen grain.

A similar conclusion can be drawn from the 57 progeny produced by the open-pollination of C. dupreziana seed trees. A very high allelic polymorphism was observed within each of the five locations, where seeds were collected and all but three genotypes were unique. These results indicate that numerous pollen trees contributed to the pollen clouds. Most of these progeny (54) exhibited single-band SSR profiles, which is not compatible with a bi-parental origin. The high SSR allele polymorphism observed within each stand (Table 1) means that the likelihood of even one individual with a single-band profile is very low. Computations based on the allele frequencies estimated in Italian cypress population lead to the same conclusion. Moreover, the all-paternal origin of the seedlings allows rejection of the hypothesis of self-pollination that would have increased the probability of observing single-band genotypes. Consequently, we conclude that single-band progeny originate from single-pollen grains.

Excluding the possibility that seed handling or labeling errors occurred, the production of three heterozygous C. sempervirens diploids provides evidence for the fusion of two male gametes produced by two different pollen grains. The very low proportion of these heterozygous individuals (3 out of 63) suggests that most of the homozygous C. sempervirens seedlings originate from the diploidization of a haploid gamete. Therefore, both processes of male-gamete fusion and of male-gamete diploidization must contribute to the production of diploid C. sempervirens progeny from C. dupreziana seed trees.

Use and importance of homozygous lines

Pure lines are intensively used for selective breeding purposes. They allow for a very efficient estimation of parent-breeding additive value and trait heritability (Gallais, 1990; Bijma et al., 1997). When fertile, they are easily multiplied and produce gametes with an invariable genotype allowing the production of identical offspring such as F1 hybrids. They are also of great interest for genetic mapping, QTL analysis (Picard et al., 1994) and genome sequencing (Komen and Thorgaard, 2007), and facilitate the study of gene expression because recessive alleles cannot be masked by dominant ones as occurs at heterozygous loci (Fu and Jansen, 2006).

In the wild, diploid organisms generally originate from the sexual reproduction of two genetically distinct parents and are mostly heterozygous. In normal sexual reproduction, full homozygous individuals can only be produced either from homozygous parents by selfing or by cross-pollination within a pure line. For allogamous species, sexual reproduction between genetically related individuals shows deleterious recessive mutations leading to inbreeding depression and natural selection acts to purge this genetic load (Charlesworth and Charlesworth, 1987, 1999; Keller and Waller, 2002). The reduced growth and survival rates observed in the diploid all-paternal C. sempervirens progeny as compared with wild C. sempervirens progeny used as controls was attributed to inbreeding depression (Pichot et al., 2008). Inbreeding depression effect is more rapidly observable in diploidization than in selfing because selfing only halves the number of heterozygous loci per generation, whereas diploidization directly produces full homozygous offspring.

In the present study, the good growth performances of the three heterozygous all-paternal C. sempervirens seedlings compared with those of the homozygous diploids are in agreement with this hypothesis, but need to be confirmed with larger sample sizes.

The artificial production of homozygous lines was historically based on several generations of selfing (that only halves the number of heterozygous loci at each generation) or mating between relatives when selfing is not possible. Today it mainly relies on the diploidization of haploid lines produced by gyno- or androgenesis from haploid tissues (Hosp et al., 2007). According to Forster et al. (2007), 50% of the European barley cultivars are produced by a doubled haploid system. In plants, most haploid lines derive from the in vitro culture of anthers or immature pollen grains. Haplo-diploid individuals are also produced in some animal species such as fishes, using irradiated gametes (Komen and Thorgaard, 2007). This strategy deeply shortens the breeding period especially for species exhibiting long-term cycles. It is frequently used in angiosperms (Forster et al., 2007), including trees (Bueno et al., 2004; Assani et al., 2006; Germana, 2006) but has not succeeded to date in gymnosperms. In this paper, we demonstrate that a new strategy is available for C. sempervirens for the production of haploid or diploid homozygous lines by androgenesis in a C. dupreziana surrogate mother. This strategy should improve the efficiency of future breeding programs for the production of ornamental or windbreak C. sempervirens varieties and could be tested in other related species. Our findings also confirm the intrinsic embryogenic ability of pollen grains as frequently reported in angiosperms.