Introduction

Hawaiian flora and fauna are unique in the number of species that are endemic to the archipelago. Many groups of animals and plants have undergone speciation bursts, providing model systems for evolutionary biologists to study the speciation processes. Studies of Hawaiian biogeography (see, for a review, Funk and Wagner, 1995) suggest that the geological history of the archipelago is one of the main causes of frequent speciation events.

Tectonic plate movement over the volcanic hotspot underlying the Hawaiian archipelago results in a continuous generation of new islands at the south east end of the archipelago. The islands erode and subside with time, forming a giant ‘habitat conveyor belt’, with older eroded low islands on the north west, and young active volcanoes to the south east (Carson and Clague, 1995). Continuous changing of the environment on the islands coupled with colonization of the new islands and fragmented geographic range are, probably, conditions that force organisms to change quickly, yielding new species (Hollocher, 1996).

The speciation process for Hawaiian plants is poorly understood. It is not clear whether the speciation occurs due to adaptation to the continuously changing environment (adaptation model of speciation), due to a founder effect during colonization of the new islands (founder effect model of speciation) as suggested by Carson (1968) and Templeton (1980), or whether Hawaiian speciation follows the classical allopatric model of speciation (Mayr, 1963) with geographic barriers to gene flow between populations and slow divergence being the main causes of speciation. Indeed, colonization of a new island may result simultaneously in a drastic change in environmental conditions, in a founder effect and in the creation of geographic barriers to gene flow between new and old populations.

The speciation process has been studied in different groups of organisms endemic to the Hawaiian archipelago with most efforts being focused on Hawaiian Drosophila, which include several hundred species. In many cases, speciation has been found to be due to ‘island hopping’; speciation with new species formed after colonization events (Hollocher, 1996), in which sexual selection played an important role (Carson, 1997). Other groups of organisms have been studied in much less detail, and work has been mostly devoted to the phylogenetic relationships and the biogeography of certain groups of species (Wagner and Funk, 1995). The few available results provide support for the ‘island hopping’ model for most of the studied organisms (Funk and Wagner, 1995).

The genus Schiedea (Caryophyllaceae) includes over 30 species endemic to Hawaii, and represents striking diversity in morphology, breeding system and habitat (Wagner et al, 1995). This genus is a convenient model to study the speciation process, since it represents one of the largest recent adaptive plant radiations on Hawaii and substantial work has been devoted to characterize its phylogenetics and biogeography (Wagner et al, 1995; Weller et al, 1995,1996; Soltis et al, 1996; Sakai et al, 1997). Here, we report the level and the patterns of nuclear DNA diversity and divergence in the populations of S. globosa from Oahu and Maui islands. S. globosa is subdioecious with separate males and females and rare occurrences of hermaphroditic individuals (Weller et al, 1995). Hence a reduction in genetic diversity due to selfing (eg Liu et al, 1999) is limited in this species, simplifying the analysis and the interpretation of the results. Unlike most other Schiedea species, S. globosa inhabits several islands in the Hawaiian archipelago; besides Maui and Oahu it has been found on Molocai and on the Big Island of Hawaii (Steve Weller, personal communication). S. globosa populations from different islands were suggested to be separate species (Soltis et al, 1996), but are currently being treated as populations of the same species because they cannot be reliably distinguished (Steve Weller, personal communication). The data presented here suggest that Oahu and Maui populations of S. globosa are fairly isolated from each other, and are probably diverging into separate species. We also present the nuclear DNA diversity-based inferences of age and population dynamics for these populations.

Methods

Schiedea globosa plants from two Maui populations and one Oahu population, as well as a frozen leaf sample of one S. adamantis plant were kindly provided by Drs S Weller and A Sakai, University of California Irvine, California, USA. The S. globosa samples from Molocai and the Big Island of Hawaii are currently not available (Steve Weller, personal communication). Total genomic DNA was extracted from leaf material according to the CTAB plant miniprep method as described before (Filatov and Charlesworth, 1999).

No gene sequences had been available from the Schiedea genus. In order to isolate new genes from Schiedea, we used Silene latifolia primers XY1+3 and XY1−10 (Filatov et al, 2001), which amplified a region of 3.5 kb from S. globosa. The PCR product was cloned using TA-cloning kit (Invitrogen) and sequenced by a ‘sequence walk’, starting from both ends with the XY1+3 and XY1−10 primers and ordering new internal sequencing primers using the sequence information from the previous sequence reads. As the original Silene PCR primers, XY1+3 and XY1−10 worked only for some individuals, for the population diversity study, we used a different forward primer, SgXY1+31 (GCAACACATACTGACAGTCC) together with the original XY1−10 reverse primer (TCCAGCAGAGCTTGAACAGTCT), which amplified a region of about 3 kb consistently from all the S. globosa individuals. For the PCR amplification, we used High Fidelity kit (Roche) with the following PCR conditions: one cycle of 95°C, 30 s, 58°C, 30 s, 68°C, 4 min followed by 35 cycles of 95°C, 30 s, 56°C, 30 s, 68°C, 3.5 min. The PCR products were extracted from the agarose gels using the Qiagen gel extraction kit and cloned using TA-cloning kit (Invitrogen). Sequencing was performed using ABI BigDye v2 system on the ABI3700 automated sequencing machine with the following primers: SgXY1+31 (see above), SgXY1+33 (CTATGTGCTCTTCTGAACCGC), SgXY1+29 (TTGGACCTCGTGGCATCTAT), SgXY1−30 (CAATTTCAACTTCCTTGGATGTA), SgXY1+34 (GATGCAAGAGTTGGATTGGCAC), SgXY1−27 (AAGCGAGGCAAACCCAAATG), SgXY1+35 (GTCATGATGCTCCTGTCCTTTG), XY1−10 (see above). In all the primers ‘+’ indicates forward and ‘−’ indicates reverse orientation. ABI sequence chromatograms were checked and corrected, and the contigs were assembled using ProSeq v3 software (Filatov, 2002). PCR amplification is not 100% error free, and sequencing of the cloned PCR products may result in a bias due to PCR errors. Therefore, every polymorphic site in the data set was double checked by additional amplification and sequencing. The same primers were used to isolate and sequence the homologous gene from S. adamantis.

Sequences were aligned using ClustalX v1.64 (Thomson et al, 1997) followed by manual adjustment using ProSeq v3 (Filatov, 2002). Estimates of nucleotide diversity, Tajima's D neutrality test (Tajima, 1989), population subdivision and gene flow statistics, and permutation-based tests of significance (Hudson et al, 1992) were performed using ProSeq v3. Both nucleotide substitutions and insertion/deletion (indels) sites were taken into account in these analyses. The significance of the Tajima's neutrality test was assessed by coalescent simulations without recombination using ProSeq v3. The neighbor-joining tree was created using MEGA (Kumar et al, 1993). To conduct the Fs neutrality test (Fu, 1997) and to estimate the time since the population expansion from the pairwise mismatch distribution, we used Arlequin software (Excoffier, 2000). The Fay and Wu's (2000) neutrality test was conducted by a C-program kindly provided by J Fay. The distributions of Tajima's D for different times since population growth were generated using the coalescent simulations tool in ProSeq v3 (Filatov, 2002).

Results

Isolation of the S. globosa gene

As no Schiedea sequences were available, we isolated a new region of about 3 kb long from S. globosa as described in the Methods section. The newly isolated genomic region was homologous to the Silene latifolia SlX1 and SlY1 genes (Delichère et al, 1999; Filatov et al, 2000), thus the new S. globosa homolog was named SgXY1. The regions corresponding to the exons in Silene genes were conserved in the S. globosa homolog, while the introns between Schiedea and Silene genes were almost unalignable. As the open-reading frame in the SgXY1 was preserved, we concluded that this gene might be functional in S. globosa, or have been functional very recently. According to our Southern blot analysis, SgXY1 is a single-copy gene in S. globosa.

The divergence between Silene and Schiedea homologous sequences was estimated to be 45.7% at synonymous sites and 4.54% at nonsynonymous sites. This divergence is consistent with the view that Schiedea is fairly divergent from the other Caryophyllaceae genera (Soltis et al, 1996). The low ratio of replacement over silent divergence (Ka/Ks=0.099) suggests that SgXY1 is not a pseudogene in Schiedea. We also sequenced the same 3 kb long region of the SgXY1 gene from another Schiedea species, S. adamantis. The intron divergence between this species and S. globosa was 1.1±0.2%. No amino-acid substitutions were observed, also suggesting that the Schiedea SgXY1 gene is under purifying selection.

Sequence diversity in S. globosa

The new gene isolated from S. globosa was used to study the DNA diversity in three S. globosa populations, one from Oahu and two from Maui islands. In total, we analyzed sequences of approximately 3 kb long for 35 alleles isolated from 12 plants from Oahu and 23 plants from the two Maui populations (Table 1). Overall, the DNA diversity in S. globosa (πs=0.3%) was lower than in Drosophila (Moriyama and Powell, 1995) and comparable to that in humans (Aquadro et al, 2001). The two Maui populations demonstrated no significant isolation (Fst=0.027, NS), thus in all the further analyses all the Maui sequences will be pooled together. The isolation between the Maui and the Oahu populations is quite high, Fst=0.57, P<0.0001, suggesting that the gene flow between the islands is very low. The diversity on Maui (πs=0.14%) is substantially lower than in the Oahu population (πs=0.24%, Table 1).

Table 1 DNA diversity in Maui and Oahu populations of S. globosa

All the Maui haplotypes cluster together and form a clade within the more diverse Oahu clade (Figure 1), suggesting that the Oahu population is older and, perhaps, Maui was colonized from Oahu. Consistent with this, the diversity in the Oahu population is significantly higher than in either of the two Maui populations, and in both Maui populations pooled (Table 1). The higher diversity in the Oahu population is mainly due to clustering of the sequences into two clades differing by 12 nucleotide substitutions (Figure 1). Although the smaller Oahu clade (including Sg1, Sg6 and Sg12) clusters with Maui haplotypes, this clade is more diverged from the sequences from the Maui population, compared to the larger Oahu clade (average pairwise divergence per nucleotide, Dxy=0.0051 and 0.0033, respectively).

Figure 1
figure 1

Neighbor-joining tree for the S. globosa nuclear gene SgXY1 from Maui (nodes marked with ‘M1’ or ‘M2’) and Oahu (nodes marked with ‘O’) populations, rooted by a single sequence from S. adamantis. The genetic distance (the number of nucleotide substitutions) is shown below the branches. Bootstrap support (%) is shown above the branches.

If the island populations are formed by colonization, we would expect population growth to follow the colonization of a new island. Population growth results in star-like phylogenies with the biased frequency spectrum of polymorphisms (Hudson, 1990; Braverman et al, 1995), which can be detected by negative values of the Tajima's D statistic (Tajima, 1989). Tajima's D is not significantly negative for the Oahu (D=0.34) or for the Maui (D=−0.94) populations. Another statistic, Fu's Fs was shown to be particularly sensitive for the recent population expansions (Fu, 1997). This statistic is based on the number of haplotypes observed in the sample, which is expected to be inflated by a recent population growth. Fu's Fs is not significant for the Oahu population (Fs=4.17). For the Maui population, however, Fs does detect a significant excess of the number of haplotypes (Fs=−3.72, P=0.03), suggesting a recent population growth, or a selective sweep in or near the SgXY1 gene. Fay and Wu's (2000) H statistic was demonstrated to be sensitive to the footprints of a recent selective sweep, but not a population expansion. Fay and Wu's H statistic was not significant in the Maui or in the Oahu populations.

Age of the populations

As these were not significantly negative values of Tajima's D for either Maui or Oahu populations, the results suggest that these populations are sufficiently old for the frequency spectrum to have recovered after the founder effect and the population growth. Assuming a model of an exponential population growth after a founder effect, we can estimate the minimal age of the population as the minimal time since the population growth, which is compatible with the observed values of the Tajima's D statistic. We ran coalescent simulations with exponential population growth between 0.1 and 4Ne generations ago (Figure 2). The observed values of Tajima's D were incompatible with a population growth less than 0.9 and 0.6Ne generations ago for the Oahu and the Maui populations, respectively.

Figure 2
figure 2

Simulated distributions of Tajima's D statistic (Tajima, 1989) for different times since the population expansion. The solid and the dotted lines are the mean and the 2.5 and 97.5% percentiles of the distribution of Tajima's D. The vertical arrows show the minimal and the expected time since the population expansion. Only minimal time since the population expansion is shown for the Oahu population (plot A), as the growth in that population is apparently too old to be detected by the Tajima's D statistic.

Rogers and Harpending (1992) demonstrated that the shape of the distribution of the number of observed differences between the pairs of the nonrecombining DNA sequences (mismatch distribution) reflects the past population dynamics, in particular, population expansion generates a wave in the mismatch distribution. This wave travels to the right with time since the population expansion and the position of the wave allows the timing of the population expansion to be estimated (Rogers and Harpending, 1992). The pairwise mismatch distribution for the Maui and the Oahu S. globosa populations is shown in Figure 3. Both populations show a peak of mismatch distributions around 6 and 11 mismatches for the Maui and the Oahu populations, respectively. To estimate the time and the scale of the population expansion in these populations, we fitted the stepwise demographic expansion model to the data as described by Schneider and Excoffier (1999). The time of the expansion, measured in mutational units, τ=2ut (99% conf. interval), is τ=6.26 (1.32–19.13) for the Maui, and τ=11.18 (6.42–21.16) for the Oahu populations, respectively.

Figure 3
figure 3

Pairwise mismatch distributions (Rogers and Harpending, 1992) for the Oahu and the Maui populations.

Discussion

Despite the abundance of molecular phylogenetic data for many Hawaiian plants (Pax et al, 1997; Baldwin and Sanderson, 1998; Cross and Motley, 2000; Lindqvist and Albert, 2002), there is surprisingly little information on the intraspecific DNA diversity in Hawaiian, and more generally, island plant populations. This is probably because it is generally assumed that intraspecific diversity in the populations of island endemic plants is very low (Barrett and Husband, 1990), which seems quite logical, given that the lack of resolution in DNA-based phylogenies is a common problem in phylogenetic studies of island plant radiations. Hence, the intraspecific genetic diversity data are mostly based on allozyme diversity (Helenurm and Ganders, 1985; Weller et al, 1996; Marr and Bohm, 1999), which may be quite misleading due to possible non-neutrality of charge changing amino-acid replacements creating allozyme diversity (Hudson et al, 1994; Veuille and King, 1995; Eanes et al, 1996; Filatov and Charlesworth, 1999). Here, we reported one of the first DNA sequence-based estimates of genetic diversity in Hawaiian plants. The only other estimate of Hawaiian plant DNA diversity was recently published for silverswords Argyroxiphium sandwicense and Dubautia ciliolata (Lawton-Rauh et al, 2003). The DNA diversity in the S. globosa and silverswords populations appears to be quite similar, π0.03%. This value is also comparable to the DNA diversity observed in selfing Arabidopsis thaliana (Kuittinen and Aguade, 2000; Aguade, 2001), Hordeum vulgare (Cummings and Clegg, 1998) and Pennisetum glaucum (Gaut and Clegg, 1993), outcrossing A. lyrata (Savolainen et al, 2000) and dioecious Dioscorea tokoro (Terauchi et al, 1997), but lower than in outcrossing Leavenworthia stylosa (Filatov and Charlesworth, 1999) and in dioecious Silene latifolia (Filatov et al, 2000,2001).

Although we only studied the diversity at a single locus, our estimate could reflect the average DNA diversity across the genome because the DNA diversity in the S. globosa SgXY1 gene is in close agreement with previous estimates of allozyme diversity in this species (Weller et al, 1996). Indeed, in the Oahu population nonsilent DNA diversity is π=0.9 per 1000 nucleotides (Table 1), or approximately one replacement in 300 amino acids. Assuming the average protein size of 500–1000 amino acids and taking into account that only a fraction of amino-acid replacements is detectable by electrophoresis, we get a similar diversity figure for the allozyme heterozygosity as reported by Weller et al (1996). As the allozyme diversity in S. globosa was demonstrated to be fairly similar to the other species in the genus (Weller et al, 1996), the level of DNA diversity seen in the S. globosa SgXY1 gene is probably typical for many other Schiedea species.

Consistent with the allozyme data (Weller et al, 1996), the diversity in the Maui population appeared to be about two-fold lower than in the Oahu population (Table 1). The Maui populations are not smaller than the population on Oahu (S Weller, persoanl communication), so the lower genetic diversity on Maui may reflect a more recent origin of the Maui populations. The haplotype phylogeny is consistent with this, as the sequences from the Maui populations cluster together within the larger Oahu cluster (Figure 1). If the diversity in the Maui population is reduced due to a recent population bottleneck, this should result in the excess of rare polymorphic sites, which could be detected by the negative values of Tajima's D (Tajima, 1989) or Fu's Fs (Fu, 1997) statistics. Although Tajima's D is not significantly negative, the more sensitive Fs statistic does show a significantly negative value for the Maui population. This could be due to a recent population growth after a bottleneck, or due to a selective sweep in the Maui population. We cannot detect any signature of a selective sweep by the Fay and Wu's (2000) H statistic, which is known to be very sensitive to the selective sweeps; thus, the recent population growth seems a more probable explanation for the negative Fs value in the Maui population.

The age of the populations was estimated using two methods: (i) coalescent simulations with exponential growth (Hudson, 1990) and (ii) the pairwise mismatch distribution (Rogers and Harpending, 1992). Both methods consistently suggest a younger age for the Maui population. The age estimates, however, are in terms of the number of effective population sizes for the first method and in mutational units for the second one. To translate these estimates into the number of generations, we assume per locus per generation mutation rate to be of the order of 10−5. The effective population sizes can be estimated from E(Ne)=E(θ)/4u, and are equal to 0.6 × 105 and 1.4 × 105 individuals for the Maui and the Oahu populations, respectively. For the Oahu population, the expected time since growth exceeds 4Ne generations, allowing only the minimal time since the expansion in the Oahu population to be estimated (Tmin=126 000 generations). For the Maui population, the growth is apparently more recent, allowing the expected and the minimal times since the population expansion to be estimated (Texp=120 000, Tmin=36 000 generations). Assuming the same mutation rate, the pairwise mismatch distribution suggests that the population growth on Maui occurred 310 (70–960) thousand generations ago (KGA), while in the Oahu population it happened around 560 (320–1060) KGA. The estimates of the time since the population expansions obtained by the two methods are not wildly different from each other, and agree with the lower DNA diversity in the Maui population.

We detected a highly significant isolation between the Maui and the Oahu populations (Fst=0.57, P<0.0001), suggesting that the populations on different islands are completely isolated geographically. It seems unlikely that this isolation is due to genetic or ecotypic incompatibilities of the two populations, which are crossfertile (Weller et al, 2001).

No polymorphic sites were shared and only three sites were fixed between the Maui and Oahu populations, with most polymorphisms occurring exclusively either on Maui or on Oahu (17 and 17 sites, respectively). This suggests that the populations have gone through a bottleneck, causing the loss of ancestral polymorphism (hence no shared polymorphisms), and the independent accumulation of mutations in the two populations. On the other hand, the two populations might have diverged fairly recently, so there was not enough time for the populations to accumulate many fixed differences. This is consistent with the scenario of a formation of the Maui population due to a colonization event, which resulted in a substantial loss of ancestral genetic diversity and the subsequent accumulation of differences in isolation. This might be a typical pattern for the entire genus, because most Schiedea species were probably formed after colonization of the new islands (Wagner et al, 1995).

The colonization and bottleneck events may result in a substantial reduction of genetic diversity. On the other hand, in some cases bottlenecks can increase additive genetic variance and hence the capacity of populations to adapt to new environments (eg Fernandez et al, 1995; Meffert, 2000). It remains to be demonstrated whether this is the factor that promotes speciation on islands, as suggested by Templeton (1980), or the new species are formed due to geographic isolation between the islands according to Mayr's (1963) classic model of allopatric speciation. The demonstration of almost complete geographical isolaton of the two island S. globosa populations and the lack of any evidence of adaptation to different environments makes Mayr's (1963) model of allopatric divergence a more plausible scenario for this species.