Introduction

Conservation of genetic diversity in natural fish populations has long been a challenge (several articles in STOCS, 1981), particularly since the impact of non-native species introductions has been noticed. Among the questions opened by these introductions is the impact of restocking on the genetic integrity of aboriginal populations. For fish, non-native hatchery-reared individuals are stocked abundantly in rivers for recreational purposes. One long-term effect of these practices could result in the loss of genetic originality of wild populations, by introgression with hatchery-born fish (Hindar et al., 1991).

Different types of genetic markers have been developed to study the interactions of domesticated and native fish. The genetic impacts of restocking on natural populations have been documented with some precision using data from diagnostic enzyme loci in the brown trout, Salmo trutta (Guyomard, 1989; Garcia-Marin et al., 1991). In this taxon, two diagnostic loci allow sorting of the geographical forms identified in France (Krieg & Guyomard, 1985): the southern form found in Mediterranean watersheds, and the Atlantic form found elsewhere and to which most hatchery stocks belong. There is evidence of introgression into the gene pool of wild Mediterranean trout populations in several river drainages (Barbat-Leterrier et al., 1989; Poteaux & Berrebi, 1997). These studies have usually showed that despite the intense restocking practices, the genetic impact of introductions remains rather low.

The origin of hatchery stocks and their introgressive impact on native populations of brown trout can easily be assessed only in the Mediterranean basin, with two diagnostic protein-coding loci and one mtDNA haplotype (Bernatchez et al., 1992), but these genetic markers are too few for a precise estimation. In addition, there is evidence that enzyme loci may be subjected to selective pressures (Poteaux et al., 1998b) likely to modify their introgressive behaviour. Moreover, the polymorphism level available with protein-coding loci may be a limiting factor for the estimation of effective size, a parameter linked to heterozygosity and mutation rate (Crow & Kimura, 1970), and important in assessing the effect of domestic gene flow. For all these reasons, highly variable markers such as microsatellites, which are supposed to be neutral in most instances, may provide a convenient level of polymorphism.

The degree of divergence of Mediterranean trout from Atlantic domesticated forms is quite high for populations of the same species (Nei's distance = 0.1 for allozymes). Thus, we expect to find a reasonable number of informative alleles, whatever the level of polymorphism. Microsatellite loci are highly suitable for population studies because they are PCR-detectable, and because alleles differing by 1 bp can be separated on high-resolution gels. Estoup et al. (1993) have isolated a number of microsatellite loci for a biogeographical study of European trout. Indeed, these authors have shown that some microsatellites possess alleles discriminating between the Atlantic and the Mediterranean forms. However, few individuals per locality were analysed, so detailed population structure could not be studied.

This paper represents a more quantitative attempt to study with microsatellite loci the genetic structure of disturbed trout populations, limited in scope to a small river drainage in southern France. Within this drainage we compared the polymorphism and the genetic structure of a sample from the main river, which is annually restocked, with a sample from a tributary for which restocking has not been reported for several years. A previous study of the same samples with protein-coding loci has shown that variability and introgression level were higher for the restocked sample (Poteaux et al., 1998b). Furthermore, the analysis by RFLP of the control region of mtDNA also confirmed that the level of introgression of the restocked population was twice that of the undisturbed population and twice that given by protein-coding loci (Poteaux et al., 1998a). Here, we verify the power of microsatellite loci for measuring population differentiation and their suitability for estimating introgression from domestic strains.

Materials and methods

Samples

Four samples of French brown trout were electrofished in October 1993: two populations from the Orb system which drains into the Mediterranean – Orb river (N = 94) and Tes river (N = 98); and two hatchery stocks that were usually used to restock this drainage – La Canourgue, in the Lozère region (N = 50) and Brassac, in Tarn (N = 29). Details of the populations sampled and the stocking practices are described in Table 1. We compared the populations from the main river (Orb), which is intensively restocked, to its tributary (Tes) where restocking was stopped at the end of 1987. In both natural populations, two age-classes were determined from both scale-readings and length-frequency distributions: the fry group, designated as cohort 0+, and all the other fish combined, designated as group 1++.

Table 1 Sample sizes of brown trout for the Orb drainage and the hatcheries

Total genomic DNA was isolated from blood in TNES-urea (10 mM Tris–HCl, 10 mM EDTA, 2% SDS, 0.3 M NaCl, 4 M urea, pH = 8), or from muscle in TE (10 mM Tris–HCl, 1 mM EDTA, pH = 7.4) buffer after overnight incubation at 55°C with proteinase K (100 μg/mL at final concentration) and extracted by the phenol–chloroform method. Other organs were collected and frozen for allozyme studies.

Microsatellites

A partial genomic library was constructed from high-molecular weight DNA of domestic trout, according to Garcia de Leon et al. (1995). Recombinant clones of 300–800 bp size were gridded out on duplicate plates and clones containing microsatellites were identified using the library screening procedures described in Garcia de Leon et al. (1995). Positive clones were sequenced using the Sequenase kit (USB). PCR primers were designed to anneal inside the 30–50 bp flanking the microsatellite array, using the program PRIMER, version 0.5 (S. E. Lincoln, M. J. Daly & E. S. Lander, unpublished).

Two of the 17 microsatellite loci obtained, hereafter designated Strutta-12 and Strutta-58, were chosen for population analysis. The details of the primer sequences for each microsatellite locus and its motif are presented in Table 2. Annealing temperature and MgCl2 concentration were optimized for each locus. PCR reactions (15 μL) contained 40–80 ng of total DNA, 0.25 units of Promega Taq polymerase, 140 ng of one unlabelled primer and 140 ng of the other 33P end-labelled primer, 1.5 mM MgCl2, 0.2 mM of each deoxyribonucleotide, 1 × buffer. The PCR programme consisted of an initial denaturation step (94°C for 3 min), followed by 30 cycles of 92°C for 45 s, 55°C for 45 s, and a final extension of 5 min at 72°C. A third locus, MST-73, was taken from a previous study (Estoup et al., 1993) and DNA amplifications were performed following their protocol. This locus was chosen because it discriminates between hatchery trout and wild Spanish trout from the Mediterranean basin.

Table 2 Microsatellite characteristics, structures, primer sequences and number of alleles per locus

PCR products were mixed with 2/3 volume of formamide loading buffer (Sambrook et al., 1989) and denatured at 95°C for 5 min, before electrophoresis in 6% acryl-bisacrylamide/8 M urea sequencing gels. The M13-plasmid sequence was used as an external size standard.

Statistical analyses

Expected heterozygosity values were computed as unbiased estimates (Nei, 1978). All calculations were performed using the computer package GENETIX 3.0 (Belkhir et al., 1996–98). Genetic differentiation was assessed using Wright's F-statistics (Wright, 1951), which were estimated by Weir & Cockerham's (1984) f, θ and F for FIS, FST and FIT, respectively. Tests of significance were carried out by permutations. Briefly, 2500 permuted data sets were generated from the original genotype matrix. For FIS, alleles were permuted within samples; for FST, genotypes (rather than alleles) were permuted among samples. The value of interest (e.g. FIS) was calculated for each permuted file, generating a distribution of FIS values under the null hypothesis. The chance probability of obtaining the original FIS value under the null hypothesis was estimated as the proportion of the distribution having FIS values larger than the original value.

Because significant deficits of heterozygotes are frequently caused by null alleles at the microsatellite loci, we estimated the expected allele frequency of a presumed null allele using x = (HeHo)/(1 + He) (Brookfield, 1996), where x represents the estimated frequency of the null allele, and He and Ho are the expected and observed frequencies of heterozygotes, respectively.

Introgression

We analysed the frequencies of hatchery-specific alleles at three microsatellite loci. Because we do not have a pure wild sample at hand, we assumed that all alleles shared between native samples and hatchery stocks were of domestic origin. Although this assumption is not likely to be always true, we propose to use the sum of the frequencies of shared alleles in natural samples as an index of maximal introgression.

In addition, we compared the microsatellite introgression with that reported in a complete analysis of protein polymorphism on the same samples (Poteaux et al., 1998b) and with the introgression estimated from mtDNA data. With allozymes, it is possible to assess precisely the native allele frequencies of wild populations because many Mediterranean populations have been studied in the past (Krieg & Guyomard, 1985). The frequencies of hatchery-specific alleles at two diagnostic protein-coding loci, Ldh-5 and Tf, can then be used to compute the introgression level as proposed in Barbat-Leterrier et al. (1989): r= (fsfr)/(fhfr). The proportion of introduced genes (r) is computed as the allele-frequency differences between the sample (fs) and the reference population (fr), weighted by the allele-frequency differences between hatchery strain (fh) and the reference (fr). For the mtDNA, the introgression rate simply corresponds to the frequency of the Atlantic haplotype in wild populations (see Poteaux et al., 1998a for details).

Results

Cloning and microsatellite analysis

Twenty-six of the 1450 recombinant clones were identified as positive clones and sequenced. We report here the isolation of 17 microsatellite loci from the brown trout genome (Genbank accession numbers from U60219 to U60230 for 12 of them). We found 12 (CA)n and one (CT)n microsatellite, with n ranging from 15 to over 75, two trinucleotide microsatellites, one chimera, and a (CA) microsatellite associated with a 37-bp minisatellite with at least seven repeats. The mean number of repeats observed in our clones is about 32. No (AT)n arrays were found. Thus (CA) motifs appeared to be the most abundant type in the trout genome, consistent with the observations of Estoup et al. (1993) for trout and with other studies on fish (Rico et al., 1993; Garcia de Leon et al., 1995).

Mendelian inheritance of Strutta-12 and Strutta-58 was verified by analysis of genotype ratios in backcross offspring (Poteaux et al. submitted): all frequencies were in agreement with Mendel's expected proportions in the offspring analysed (334 individuals of two backcross experiments).

Average motif number varies from locus to locus and it appears that the longer the array, the greater the polymorphism (Table 2). In all samples combined, MST-73 (15 motifs on average) had only four alleles, Strutta-58 (40 motifs) had 23 alleles and Strutta-12 (37 motifs) 27 alleles.

Hatchery stocks

Hatchery samples contain no major alleles for Strutta-12 and 58, but many alleles at low frequencies (p< 0.1 in most cases). Some alleles at Strutta-12 and Strutta-58 were detected in the hatchery stocks only (Table 3). We found only three alleles at locus MST-73, the same as those identified by Presa et al. (1994) in their hatchery samples, but at different frequencies.

Table 3 Allele frequencies, expected heterozygosities (He), mean number of alleles (n) and f-values for the brown trout samples

The average heterozygosity was high in both cases (0.77 for Brassac and 0.84 for La Canourgue). Both hatchery samples show significant departure from Hardy– Weinberg expectations (f = 0.07, P< 0.05 and f= 0.13, P< 0.001). Furthermore, domestic stocks significantly differed from each other (θ = 0.04, P< 0.001).

Wild Mediterranean populations

The frequency distributions of microsatellite alleles in the Orb and Tes samples show similar patterns: the major alleles correspond to alleles absent from the hatchery stocks (Table 3). The predominant allele for MST-73, out of a total of four, corresponds to an allele which seems to be diagnostic between the Mediterranean and domesticated forms (our data; Presa, 1995). For the other two loci, the mean number of alleles per sample is high, 16.5 and 17.5 for Orb and Tes, respectively.

Expected heterozygosity and deviations from Hardy–Weinberg equilibrium are presented in Table 3: the global f estimation is significant for both samples, but caused by different loci (Strutta-12 for the Orb river and Strutta-58 for the Tes river). Heterozygosity is higher in the Orb sample than in the Tes sample (0.74 and 0.59, respectively) but it is on average lower than the value for the hatchery samples (0.82). Multilocus f are high and significant for both populations (0.10 and 0.11 for Tes and Orb, respectively) and for all age groups.

Differentiation between samples and age groups

The frequency distributions of the different loci show slight differences between the Tes and Orb samples and between age groups 0+ and 1++ (Table 3). On average, these differences are significant between the Orb cohorts (θ = 0.012, P< 0.01), but not between the Tes cohorts (θ = 0). The Orb and Tes populations were slightly but significantly differentiated (θ = 0.02, P< 0.001), this holding true at the level of age groups (between adults θ = 0.03, P< 0.01).

Genetic effect of restocking

Values of protein-coding and mtDNA introgression indices (Poteaux et al., 1998a) are given in Table 4. Domestic allele frequencies for the protein-coding loci varied from 9 to 38%. They confirmed that both populations were introgressed and the mean introgression index for the Orb population is twice that of its tributary (mean for both loci: 28% and 12.5%, respectively). That is confirmed by the mtDNA data, for which the introgression level is higher than that of the protein-coding loci.

Table 4 Introgression index for protein loci, mtDNA and microsatellite loci for the brown trout samples

For the microsatellites, we can distinguish four kinds of alleles in the allelic distribution (Table 3): (i) alleles present only in the hatchery samples, (ii) alleles present only in the Tes and/or Orb samples, (iii) shared alleles that are frequent in the hatchery samples and rare in wild samples and (iv) shared alleles that are rare in the hatchery samples and frequent in the wild samples.

The allele types (i) and (ii) cannot be used to estimate introgression. If we consider that introgression is evidenced by the alleles observed in wild and hatchery samples, the introgression index we computed is simply the cumulative frequencies of shared alleles (Table 4). The indices then ranged from 13 to 68%. MST-73 gave exactly the same level of introgression as Tf* whereas values for the other two microsatellite loci are higher than those for proteins but close to the mtDNA estimates (Table 4). Nevertheless, the same pattern of variation, that is the frequency of alleles shared with domestic strains is higher in the Orb samples than in the Tes, was observed for Strutta-12 and MST-73, Strutta-58 showing similar values for all samples. We will discuss below the limits of this kind of index and we will question the usefulness of microsatellite loci to answer problems of introgression.

Discussion

The situation under study is characterized by the human influence on wild Mediterranean trout populations. The ancestral composition of French Mediterranean trout populations is now lost and we are left to deduce it by subtracting the effects of introgression of domestic genes. Introgressed populations are reproducing populations. Their structure is of interest because they constitute field-wide experiments of interactions between differentiated gene pools a priori adapted to different environmental conditions, an issue which has important bearings in conservation biology. We report here two separate pieces of evidence for which we must find satisfactory explanations. One is the existence of large heterozygote deficiencies. The other is an apparent difference, and a possible decrease, in the level of introgression when stocking has ceased.

Departure from Hardy–Weinberg equilibrium in hatchery and wild populations

For the hatchery populations, the departure from Hardy–Weinberg expectations shown by both samples could be easily explained by the mixing of different sets of offspring (because of fish farms' practice of introducing progenitors from other hatcheries or from rivers of the Atlantic basin). A previous analysis with microsatellites showed that no deficits were obtained for trout populations from the Moselle River, not disturbed by restocking (Estoup et al., 1998), whereas in the case of our wild restocked populations, we observed deficits of about 10%. Rico et al. (1997) reviewed the possible causes of excess homozygosity. From their list, we will take three hypotheses to explain the heterozygote deficits observed in the Orb and Tes populations.

(i) Null alleles. Here, heterozygote deficits were high and detected at different microsatellite loci for the two samples (Table 3). The expected allele frequency of the null allele in the Tes River, the Orb River and both hatchery samples was 0.038, 0.05, 0.0006 and 0.063, respectively. Because of these relatively low frequencies, this cause cannot explain all the disequilibria.

(ii) A Wahlund effect either in time (overlapping of genetically differentiated generations) or in space (migration between genetically differentiated populations). In this last case, it would mean that these hypothetical populations show between them FST values similar to the FIS observed when they are mixed (≈10% whereas the higher θ value obtained here was only 3%). We doubt that such subpopulations can coexist generation after generation in the same relatively small river drainage. Similarly, if an effect were to be sought because of overlapping generations we would still have to explain how these generations could be so different, and also the different age-groups should show much larger genetic differences than we saw.

(iii) Finally, assortative mating or selection against heterozygotes could cause all or a part of these deficits. Because restocking results in the admixture of differentiated gene pools, one cannot reject a priori the possibility that some assortative mating or differential survival of genotypes plays a part in these heterozygote deficiencies, although we have no direct evidence for this hypothesis.

Actually, we can not conclude unambiguously which cause explains the excess homozygosity observed and no single factor may alone provide a satisfactory explanation.

Putative assessment of introgression with microsatellite data

The diagnostic power of protein-coding loci and mtDNA haplotypes is relatively well established (Krieg & Guyomard, 1985; Giuffra et al., 1996) but, for microsatellites, some wild alleles scored as identical to hatchery alleles introduce an uncertainty on the nature of such shared alleles. They may have three origins: (i) convergent mutation; (ii) common ancestral polymorphism; or (iii) introgression. Here, because few alleles are shared with the hatchery samples, the least polymorphic microsatellite locus gives introgression levels similar to those given by protein-coding loci. At the opposite, the most variable ones give higher introgression levels because of numerous alleles shared and at low frequency. The discrepancy between introgression results from markers and from microsatellite loci suggests that estimates of introgression using molecular methods should still be interpreted with caution. But we disagree with the idea that polymorphic microsatellite loci should only be used to analyse population structure or geographical differentiation, and we suggest that most of the variability observed in these disturbed populations has a foreign origin.

No undisturbed Mediterranean subpopulations still exist in this drainage and we could not compare our introgressed samples with a pure reference to identify diagnostic microsatellite alleles. However, we decided not to include pure reference samples from a distinct river drainage in this study because (i) owing to the high rate of evolution of such markers, large differences between basins are probable in the microsatellite alleles' composition in the wild (e.g. Brunner et al., 1998) and (ii) even if allozymes showed that loci were fixed for wild alleles, we could not be sure that no ancient unknown introgression had occurred and that no foreign microsatellite alleles had been fixed following founder effects or bottleneck events (Poteaux et al. unpubl. data). Nevertheless, to establish comparisons, we used an index which gives a maximum introgression by addition of all shared alleles among domestic and wild populations, therefore assuming that shared alleles are too numerous to be explained by ancestral polymorphism or homoplasy.

The low polymorphism of allozymes (Krieg & Guyomard, 1985; Moran et al., 1993; Poteaux & Berrebi, 1997) and mtDNA (Apostolidis et al., 1996) suggests that these riverine systems probably host trout populations of small effective sizes. Indeed, for the Orb River drainage, direct population estimates based on individual density per hectare and on spawning grounds inventories during several winters in both rivers (Beaudou, 1993) have yielded values on the order of 200–250 progenitors. In our case, with overlapping generations, we can then be confident that the effective population size (Ne) is much smaller than the observed number of spawning fishes. Hence, if Ne is small, we expect a small number of alleles at mutation/drift equilibrium, and shared alleles should then more likely reflect introgression rather than convergence or parallelism. If we compute the heterozygosity expected at mutation/drift equilibrium with the infinite allele model (IAM; Kimura & Crow, 1964), this quantity He ≈ 0.6–0.7 is obtained with a mutation rate higher than 10–3 or for Ne values much larger than expected for this kind of river. However, a mutation rate higher than 10–3 is not possible because no mutation event was detected by the analysis of 334 backcross progeny at these loci (Poteaux et al. submitted). A good part of the observed variability is therefore probably of foreign origin. All this suggests that introgression is a powerful means of increasing the levels of heterozygosity in rivers by introducing novel alleles into trout populations. Our index of ‘maximum introgression’ thus seems reliable enough to monitor fluctuations of introgression.

Differentiation between wild populations

Microsatellite markers gave results consistent with proteins and mtDNA data, in that the restocked Orb population was more polymorphic than the Tes population and, using the ‘maximum introgression’ index, appeared to have a higher frequency of ‘domesticated genes’. It is noteworthy that a significant difference of microsatellite allele frequencies, especially for alleles of domesticated origin, can be detected for all loci between the two rivers. Small et al. (1998) observed a significant structure between coho salmon populations from the upper and lower part of the Fraser River drainage. They supposed that the regional structure observed might be maintained by adaptive differences between the two groups. At the lower scale of our study, there is no obstacle between the upper part of the Tes River and the Orb River, downstream. Because migrations were effective at some periods, this opens the question of the nonhomogenization of both gene pools. Tes samples (both age groups) still showed some alleles originating from hatchery stocks, but at lower frequency than in the Orb. The complete elimination of hatchery alleles in the undisturbed river, if it is to occur, is very unlikely to be observed in our case because restocking ceased only 6 years before the study, that is only two trout generations, and because Orb progenitors can move upstream for mating. Nevertheless, our results mean that 6 years without restocking were sufficient to create, or enhance if it already existed, a significant difference between the two rivers.

Conclusion

Microsatellite markers used here reveal a slight genetic differentiation between tributaries within the same basin. Our study was a first attempt to evaluate with microsatellite loci the impact of hatchery trout on wild populations. Although the introgression of hatchery genes in the wild populations is likely to be overestimated, a better method cannot be proposed because of the lack of pure reference samples. Nevertheless, our relatively simple approach gives coherent results. This opens the way to the study of the fate of introduced genes with a genome coverage larger than previously attainable by proteins and mtDNA.