Introduction

Oceanic archipelagoes provide an excellent setting to study the evolutionary forces that drive speciation and diversification. A reliable time frame for geological events, superimposed on patterns of molecular and/or morphological variation, allows inferences about allopatric differentiation, the predominant mode by which new species arise (Emerson, 2002).

The geological history of the Azores is relatively well documented and offers a wide range of ages (0.04–8 My) for the different islands (Morton et al, 1998). Nevertheless, to our knowledge, no published studies have focused on the inter- and intraspecific patterns of genetic variation in relation to the genesis of the Azores (but see Medeiros et al, 2000). The Azores archipelago consists of nine volcanic islands near the Mid-Atlantic Ridge, spanning an area of 530 (latitudes 36°55′–39°43′ N) by 320 km (longitudes 25°–31°17′). There are three groups of islands: a western group (Flores and Corvo), a central group (Terceira, Graciosa, Faial, Pico and São Jorge) and an eastern group (Santa Maria and São Miguel). The largest island, São Miguel, was formed by two separate islands that coexisted for about 0.550 My and that merged about 50 000 years ago (Figure 1: Forjaz, 1997).

Figure 1
figure 1

The Azores archipelago (a) and detailed map of the island São Miguel (b) with indication of the sampled populations (see Table 1 for codes) of L. a. azorica, pie charts of allele frequencies for loci that differ most between Lae and Law (EST-1, GPI, LAP and PGM respectively) and the different volcanologic regions (I–VI). Ages: I: 4.2–0.95 My; II: 3.2–0.65 My; III: 0.8 My; IV: 0.55 My; V: 0.29 My; VI: 0.05 My (Forjaz, 1997).

Land snails represent an important portion of the endemic fauna on the Azores (Backhuys, 1975). Leptaxis azorica (Albers, 1852) is a hygromiid land snail, endemic in the Azores. Two distinct varieties were originally described from the two (oldest) islands of the eastern Azorean group. These varieties are now regarded as subspecies (Backhuys, 1975).

L. a. azorica (Albers, 1852) occurs on São Miguel and the western island group, while there are no records of the species from the central island group. On São Miguel, isolated populations are found scattered over the eastern and western part of the island but not in the intervening, geologically much younger, lower and more urbanised part of the island (Plataforma dos Picos; Figure 1b). On Flores, L. a. azorica is only known from a limited number of localities (Backhuys, 1975), while no individuals of a population from Corvo (AM Frias Martins, unpublished record, 1976) could be found during recent expeditions. The second subspecies, L. a. minor (Backhuys, 1975), is endemic for the easternmost island ‘Santa Maria’ where it is restricted to the surroundings of Pico Alto.

Using molecular, morphometric and geological data, we aimed at answering the following questions: (1) Can patterns of population differentiation be explained in relation to geological events, colonisation history and/or human disturbance? (2) What is the rate of change and to what extent is differentiation at neutral (allozyme) loci accompanied by morphological changes indicative of reproductive isolation?

Material and methods

Sampling and sample preparation

Animals were collected by hand and stored at −80°C. A total of 424 specimens were collected from 23 locations across the species range. The 19 populations from São Miguel were assigned to either the eastern or western region (Table 1; Figure 1b). For convenience, the following abbreviations will be used to refer to four ‘operational taxonomic units’ (OTUs): Lae (L. a. azorica ‘east São Miguel’), Law (L. a. azorica ‘west São Miguel’), Laf (L. a. azorica ‘Flores’) and Lam (L. a. minor). The three L. a. azorica OTUs together will be referred to as Laa. All animals were dissected on ice under a stereomicroscope. The digestive gland was removed, weighed and homogenised in a 20% aqueous sucrose solution (5 μl/mg tissue). A piece of tissue from the foot muscle was kept at −80°C for future DNA extraction while the remaining soft body parts were stored in 70% ethanol.

Table 1 Sampling localities, sample codes and sample sizes (N) for each population

Allozyme electrophoresis

Vertical polyacrylamide gel electrophoresis (PAGE) was used to screen variation in the following enzyme systems: nonspecific esterases (EST; EC3.1.1.-), aspartate aminotransferase (AAT; EC2.6.1.1), glycerol-3-phosphate dehydrogenase (GPD; EC1.1.1.8), glucose-6-phosphate isomerase (GPI; EC5.3.1.9), 3-hydroxybutyrate dehydrogenase (HBDH; EC1.1.1.30), isocitrate dehydrogenase (IDH; EC1.1.1.42), leucyl aminopeptidase (LAP; EC3.4.11.1), peptidase (PEP; EC3.4.13.18; substrate leu-ala), phosphoglucomutase (PGM; EC5.4.2.2) and superoxide dismutase (SOD; EC1.15.1.1). Staining recipes were modified after Harris and Hopkinson (1976). The software BIOSYS-1 (Swofford and Selander, 1981) was used to calculate allele frequencies, observed (Ho) and expected (He) heterozygosities, mean number of alleles (A), percentage polymorphic loci (P), Nei's (1978) unbiased genetic distances (DNei) and to perform a hierarchical analysis of population differentiation by partitioning FST into a within-region (FSR) and a between-region (FRT) component ((1−FST)=(1−FSR)(1−FRT); Wright, 1978). Population differentiation (FST assuming random or nonrandom mating) and pairwise differences (two-sided test) in allelic richness (A, corrected for sample size), Ho, He, FIS and FST between OTUs were tested (null hypothesis of no difference; 1000 permutations) using FSTAT v2.9.3 (Goudet, 1995). The software package GENEPOP v3.1b (Raymond and Rousset, 1995) was used to evaluate (FIS) deviations from Hardy–Weinberg equilibrium (HWE), pairwise differences in allele frequencies (exact tests for null hypothesis of respectively no heterozygote deficiency and no differences in allele frequency; default options) and to estimate gene flow (via FST=1/(1+4Nm) (Wright, 1978) and private alleles (Slatkin, 1985)). Correlations between genetic and morphometric data were tested for significance using the Mantel test (1000 permutations; goodness-of-fit statistic=stress2) implemented by NTSYS v1.8 (Rohlf, 1993). For each tested parameter, the sequential Bonferroni technique was used to reduce the chance of making type I errors (Rice, 1989).

Genetic distances were evaluated by nonmetric multidimensional scaling (NMDS), combined with a minimum spanning tree (MST) as implemented by NTSYS v1.8 (Rohlf, 1993).

Given the small sample sizes of some populations, all analyses were repeated using both populations from Laf and Lam, but only the largest populations of Lae (nos. 3, 4, 7) and Law (nos. 11, 13, 16).

Morphometry of shell and genital tract

A Wild-Heerbrugg micrometer attached to a stereomicroscope (accuracy: 0.01–1 mm, depending on the magnification) was used to take 16 measurements of the genital tract (Figure 2). For each adult individual we measured the following: length and width (at the vas deferens) of the flagellum (LF and WF), length (between the penis retractor and the vas deferens) and width (at the vas deferens) of the epiphallus (LE and WE), length (between atrium and penis retractor) and width (at the enlargement) of the penis (LP and WP), length and width of the dart sac (LDS and WDS), length of the free oviduct (LFO; between the dart sac and the spermoviduct), length (between dart sac and mucus glands) and width of the vagina (LV and WV), length of the largest diverticulum of the mucus gland (MM), length (between oviduct and bursa copulatrix) of the pedunculus (LPD) and length and width (at the epiphallus/flagellum) of the vas deferens (LVD and WVD). In order to approximate normality and linearity of relations, all variables were log 10 transformed. In snails, overall body size is influenced by environmental factors (see eg Cameron and Carter, 1979), and the size of the genital organs may vary with reproductive state and season (Jordaens et al, 2002 and references therein). Therefore, the influence of body size and possible allometric relation were removed by regressing (model II regression) all log 10 transformed measurements against log 10 LDS. LDS was the least elastic length variable and was thus expected to have the smallest measurement error. The size-free residuals were analysed in a principal component analysis (PCA). The discriminating power of the PCs was then evaluated by using the individual scores of the PCs in a discriminant analysis (DA). The total sample size (N) for the analyses was 230 with N=119, 65, 20 and 26 for Lae, Law, Laf and Lam, respectively. All computations were done with the software package STATISTICA v5.1 (StatSoft Inc., 1995).

Figure 2
figure 2

Camera lucida drawing of the genital tract of L. azorica showing the landmarks of the characters measured (see also text). Abbreviations: A: genital atrium; B: bursa copulatrix; DS: dart sac; E: epiphallus; F: flagellum; FO: free oviduct; M: mucus gland; P: penis; PD: pedunculus; PR: penis retractor; SO: spermoviduct; V: vagina; VD: vas deferens. Scale: 1mm.

Results

Allozymes

A total of 12 putative loci could be scored unambiguously. All parameter values are based on all populations with N>5. These data are highly congruent with the results using only the largest populations (see above). Sample sizes (N), allele frequencies, values for Ho, He, P, A, FIS, FST, Nm, the proportion of unique alleles (U) and deviations from HWE (each locus in each locality) are summarised in Tables 2 and 3. A significant difference in He could be detected (after Bonferroni correction) between Lae and Lam (P=0.005). Values for FIS and FST were significantly different from zero (P=0.001) in all OTUs (over all loci). After Bonferroni correction, no consistent significant deviations from HWE could be detected (tests per locus and population) at any particular locus or population (see Table 2). Except in Laf and Lam (small sample size), FST also differed significantly from zero (P=0.001) when tested under the assumption of nonrandom mating. All loci besides SOD, PEP-1 (both nearly monomorphic), PEP-2 and GPD (fixed in Laa and Lam, respectively) showed significant differentiation of allele frequencies between populations, for the most part between populations from different OTUs. In agreement with this, mean values for Nei's (1978) unbiased genetic distances are much smaller within (mean DNei: 0.036–0.047), than between OTUs (DNei: 0.163–0.380). Genetic distances between all populations are represented graphically in Figure 3. On a total of 47 alleles over all loci, 23 alleles (49%) were shared among all OTUs with Lam accounting for the major part (percentages calculated within each OTU) of unique alleles (42%; high frequency), followed by Law (21%; low frequency), Laf (16%; high frequency) and Lae (4%; low frequency). Remarkable is the high frequency of a private allele ‘C’ for locus GPI in population Law1, the easternmost population of Law. The hierarchical F-statistics (Wright, 1978) confirmed the strong differentiation between Lae and Law (FST=0.388) on São Miguel. The between-region variance component was higher (59%; mean FRT=0.228) compared to the within-region component (41%; mean FSR=0.208). Although statistically not significant between any OTUs, FST is markedly higher among populations of Lae (mean FST=0.307) and Laf (mean FST=0.263) than among populations of Law (mean FST=0.152) and Lam (mean FST=0.014). The lower values for FST in Lam and Law result in Nm>1 (17.6 and 1.4, respectively), but this is not confirmed by estimates via private alleles.

Table 2 Sample sizes (N), allele frequencies and inbreeding coefficients (FIS) for each locus and percentage polymorphic loci (P), mean number of alleles (A), observed (Ho) and expected (He) heterozygosities and FIS for each population
Table 3 Mean values for each OTU of percentage of polymorphic loci (P), mean number of alleles (A), percentage unique alleles (U; calculated within each OTU) and their mean frequency (p), observed (Ho) and expected (He) heterozygosities, inbreeding coefficients (FIS), fixation indices (FST) and mean number of migrants per generation (Nm) via FST and private alleles (not present in Lae)
Figure 3
figure 3

Plot of Nei's (1978) unbiased genetic distance among all populations of L. azorica (N>5) via NMDS (three dimensions; stress=0.073) with a superimposed MST. Population codes are given in Table 1.

Morphometry

The PCA of the size-free residuals of the 15 measurements of the genital tract yielded two PCs with eigenvalues >1 and a third PC with an eigenvalue ≈1, together explaining 76% of the observed variation (Table 4). The first PC accounted for 44% of the total variation and represents the shape of the male parts (spermatophore producing organs; LE, LP, LVD) and the pedunculus (spermatophore receiving organ; LPD). PC2 explained 26% of the total variation and was strongly negatively correlated with some measurements, mainly those defining the shape in parts of the genital tract that are not directly involved with the transfer of the spermatophore (MM, LFO and WDS). PC3 explained 6% of the total variation and was strongly positively correlated with LV. Using a DA, 70% of the individuals could be classified correctly into the appropriate OTU (Table 5). However, the majority of individuals of Law were erroneously classified with Lae (43%), whereas many individuals of Laf were classified as Law (30%) or Lam (40%).

Table 4 Eigenvalues, percentage explained variation and factor loadings for the first three principal components of a PCA on 15 measurements (size-free residuals) of the genital tract
Table 5 Classification table of a discriminant analysis based on individual scores for the first three principal components from the PCA

Discussion

Our study represents an example of (vicariant) speciation in close concordance with the geological history of the Azorean islands. Most genetic variation present in L. azorica is contained in four OTUs (Lam, Lae, Law and Laf) that coincide with (former) islands. The genetic distances between the four OTUs are typical for congeneric (allopatric) species in several families of pulmonate land snails (eg Boato, 1988; Chiba, 1993; Johnson et al, 2000). Nevertheless, genetic distances do not provide a yardstick for discriminating between taxonomic levels, and considerable differentiation between conspecific populations is not uncommon in land snails (eg Selander and Kaufman, 1975; Ochman et al, 1987).

Additional evidence for (ongoing) speciation is the reproductive isolation that may be inferred from the morphometric differences in the genital tract. Although even large morphometric differences do not necessarily lead to reproductive isolation (eg Johnson et al, 1993), it has been argued that small length differences between the spermatophore producing and receiving parts of the sexual organs of copulating pairs may cause (partial) reproductive isolation (Madec and Guiller, 1993; Jordaens et al, 2002). Although the former two islands that constitute São Miguel have been connected for 50 000 years, homogenisation or more extensive mixing of the gene pools in the two regions has not occurred (Figure 1b; Table 2). The only indication for gene flow between Lae and Law comes from the observation of a private allele at high frequency in population Law1 (locus GPI, allele C), which could be tentatively interpreted as a ‘hybrizyme’ (Figure 1b; Table 2). An increasing frequency of rare electromorphic variants towards the contact zone between hybridising taxa is a commonly observed phenomenon (see eg Woodruff, 1989) but needs further investigation for L. azorica on São Miguel. Although occasional contact may be possible, our results show that there is essentially no gene flow (Nm<1) going on between the populations of Laa, certainly not between Lae and Law on the island São Miguel. This could be expected, given the limited dispersal capabilities of land snails (eg Schilthuizen and Lombaerts, 1994) and the high degree of habitat fragmentation in eastern São Miguel (Mitchell-Thomé, 1976). The younger region between East and West São Miguel, that is, the Plataforma dos Picos (Figure 1b), is the most urbanised part of the island and it may also lack suitable habitat because of its lower elevation (Forjaz, 1997) and its different microecological conditions. The populations sampled in the western region (Law) are all located in the green belt around the Caldeira of Sete Cidades (Figure 1b), which could explain the lower values for FST and concomitant higher levels of gene flow (Nm>1). On the island Santa Maria, the lack of differentiation (and gene flow estimates >>1) between the two putative populations of Lam, probably indicates that they represent one panmictic unit. The high value for FIS in Lam, also after increasing the sample size by pooling both populations, could be explained by the fact that these animals are most commonly found on the invasive, exotic plant, Hedychium gardneranum. Regularly, large areas are cleared in order to control the spread of this plant (Van Riel et al, 2000) probably provoking the loss of numerous individuals and thereby increasing the oppurtunity of genetic drift. However, estimates of variability and gene flow for Lam, and also Laf, must be seen in perspective, given the small sample sizes for these OTUs.

Rates of allozyme change (DNei/time) calibrated on the upper time limits (2 and 4 My, respectively) for the split between Laa on São Miguel/Flores and Laa/Lam (0.101±0.035 and 0.072±0.022 per My, respectively) are similar to that calculated for Partula from the Society Islands (0.126±0.062 per My; Johnson et al, 2000). However, given the maximum time of 0.55 My for differentiation between Lae and Law, the rate of allozyme change (0.325±0.107 per My) by far exceeds the previous values. Of course, a considerable amount of additional error is introduced in the rate calculations by the fact that colonisation of an island does not necessarily coincide with its emergence. However, the absence of L. a. azorica on the central Azorean islands is less difficult to explain if we assume that colonisation of the western islands took place before the former arose above sea level. Furthermore, graphs based on genetic distances are compatible with colonisation that proceeded from the older, easternmost islands further westward (Figures 1 and 3). Most authors agree that Flores (2 My) is older than most of the islands of the central group (range: ±0.04–1 My, except Terceira: ±2–4 My) (Morton et al, 1998). Higher evolutionary rates and/or divergent selection have been invoked as possible mechanisms to bring about extreme intraspecific genetic divergence in, for example, mitochondrial DNA sequences of some helicid land snails (eg Hayashi and Chiba, 2000; Thomaz et al, 1996). In general, a higher susceptibility for genetic drift and natural selection in land snails can be related to their generally patchy distributions caused by their limited dispersal ability and/or dependence on specific (micro-) ecological conditions (eg Boato, 1988; Schilthuizen and Lombaerts, 1994).

Nevertheless, while there is no a priori reason to assume rate heterogeneity and/or the existence of coadapted gene complexes (eg Davison and Clarke, 2000) within Laa, our results show some evidence for stronger effects of genetic drift in the populations of Lae (and Laf) which are characterised by lower values for A and H (see eg Leberg, 1992). Fast differentiation and loss of genetic variability is often attributed to genetic drift involved with the (re)colonisation of new areas, especially in organisms with limited dispersal capabilities (see eg Ibrahim et al, 1996). Population isolation and bottlenecks may also be related to the destruction of suitable habitat by volcanic and/or human activities. São Miguel is most densely populated and has suffered most from volcanic activity in relatively recent times (1563, 1630, 1652), especially the eastern region (Mitchell-Thomé, 1976; Morton et al, 1998). The devastating effects of human activities on land snail communities (eg Goodfriend et al, 1994; Van Riel et al, 2000) undoubtedly have their repercussions on population genetic diversity (eg Leberg, 1992; Ibrahim et al, 1996). Recently, many studies addressed evolution on (volcanic) oceanic islands using a molecular phylogenetic approach (Emerson, 2002). As geological and biological data accumulate, patterns of variation within single islands are more and more related to vicariant events caused by volcanic activity in relatively recent times. Isolation and eradication of populations by lava flows have been invoked to explain phylogeographic structure in, for example, anoles (Malhotra and Thorpe, 2000), beetles (Emerson et al, 1999) and land snails (Shimizu and Ueshima, 2000). The fusion of two or more islands can explain the existence of different DNA lineages on, for example, the Canarian island Tenerife (Juan et al, 2000). On São Miguel, additional evidence for such a scenario comes from the preliminary results of a genetic analysis of another endemic land gastropod, that is, the slug Plutonia atlantica (Medeiros et al, 2000). Currently, mitochondrial and nuclear DNA sequences of L. azorica are being analysed in order to test the hypotheses with respect to colonisation and diversification, which have been outlined in this paper.