Introduction

Pollution-safe cultivar, which refers to the use of cultivars that accumulate a very low level of a specific pollutant, have been proposed to be a strategy to ensure the crop remains safe for human consumption, even when grown in contaminated soil1. Owing to rapid urbanisation and industrialisation, heavy metal pollution has become a widespread environmental problem2. Many processes, such as the application of agrochemicals (fertilisers, pesticides and animal manures), sewage irritation and industrial pollution (including from nonferrous metal, ceramics, printing and dyeing, electroplate and chemical adhesive industries), have emitted heavy metals into soils3. McLaughlin et al.4 reported that metals could be transferred into the roots of plants through the soil pore water in the form of dissolved ions. The heavy metals can then accumulate in the edible part of plants and easily enter the human body through the food chain, which results in an increased risk of disease5,6,7. Mercury (Hg) is one of the most toxic heavy metals8, 9. According to the Chinese National Standard for Soil Environmental Quality, the Hg concentration in soils is divided into three classes: in Class I, the Hg concentration is under 0.15 mg kg−1 in soils, which is the natural background value; in Class II, the Hg concentration is under 1 mg kg−1, which is the upper acceptable limit for agricultural soils; and in Class III, the Hg concentration is under 1.5 mg kg−1 in soils, which is the limit for the normal growth of plants10. According to Lin et al.11, the background Hg levels of soils in China range from 0.02 to 0.20 mg kg−1. However, the soil Hg content in many parts of China far exceeds its background value. For example, the soil Hg content at the site of the Wuchuan mine in China is as high as 24 mg kg−19.

A high soil Hg content can impact plant growth and development when absorbed by plants. Seed injury to cereals caused by organomercury can inhibit cell division during seed germination, and the elongation of Oryza sativa seedlings is also inhibited by high Hg concentration12. Hg can cause a large and rapid reduction in the hydraulic conductivity of roots, which results in the inhibition of aquaporin functions13, 14, and higher Hg concentrations have a toxic effect on root growth in alfalfa by inducing oxidative stress15. Additionally, high Hg doses can affect the absorption and evaporation of water, and decrease the chlorophyll content and photosynthetic efficiency16,17,18. Importantly, the soil Hg absorbed by crops can also be transported into the human body through the food chain19. Hg, in the form of methylethylmercury, absorbed by the human body can accumulate in the brain, eventually causing neurotoxic effects and nerve-related diseases, such as autism, attention deficit disorder and mental retardation, and death20, 21.

Many studies have investigated the physiological and biochemical responses to Hg intoxication. Hg also inhibits the 5-amino levulinic acid dehydratase activity in the cells of maize leaves, thereby affecting the synthesis of chlorophyll22, and inhibiting plant growth and development23. Yu et al. first identified three quantitative trait loci (QTLs) for Hg accumulation and tolerance at the seedling stage in rice using a doubled haploid population24. Wang et al. also detected three QTLs for Hg tolerance at the seedling stage in rice using a recombinant inbred population25. Fu et al. found 23 QTLs for Hg accumulation in five maize tissues using a recombinant inbred population26. In addition to these QTLs, genes closely related to Hg accumulation and tolerance have also been reported. In Arabidopsis thaliana, merApe9 transgenic plants are more tolerant to Hg at the seedling stage and during the flowering period27. The overexpression of HO-1 in algae also resulted in a high tolerance to Hg exposure and a reduced Hg accumulation28. The overexpression of BnHO-1 in Brassica napus resulted in a reduced Hg accumulation in the transgenic plants29. MTH1745 transgenic rice has an enhanced Hg tolerance30.

Genome-wide association studies (GWASs) are useful for identifying candidate loci associated with traits in animal and plant species31, 32. For example, the examination of maize oil biosynthesis identified 74 loci significantly associated with kernel oil concentration and fatty acid composition in a GWAS using 1 million single-nucleotide polymorphisms (SNPs) characterised in 368 maize inbred lines33. Furthermore, GWAS and QTL mapping have been found to be complementary and to overcome each other's limitations in Arabidopsis 34, 35. In soya bean, the genotyping by sequencing-GWAS approach has been used to identify loci governing eight agronomic traits and was validated by QTL mapping36.

Maize is an important grain and feed crop for humans and animals. Hg accumulation in plants through the plant–soil system can cause toxicity that affects both plant growth and human health through the food chain12. Thus, maize planted on Hg-polluted soil can absorb and accumulate Hg in its edible parts, posing a potential threat to human health. Although some QTLs and genes regulating the accumulation of, and tolerance to, Hg have been reported previously24,25,26,27,28,29,30, knowledge of the genetic basis for Hg accumulation in maize remains limited. In the present study, an association population consisting of 230 maize inbred lines was evaluated at two locations having different soil Hg concentrations. The main purpose of this study was to detect SNPs that are significantly associated with Hg accumulation in five maize tissues, which may aid in the selection of elite inbred lines with low Hg-accumulation capabilities in the kernels and high accumulation capabilities in the leaves, to maintain maize production.

Results

Performance of the measured traits

In the association population, the Hg content in all five of the maize tissues tested was much higher at the Xixian location compared with the corresponding tissues at Changge (Table 1, Fig. 1). At Xixian, the average Hg contents in the kernels, axes, stems, bracts and leaves were 1.39, 3.04, 4.94, 6.62 and 29.46 µg kg−1, respectively. At Changge, the average Hg contents in the kernels, axes, stems, bracts and leaves were 1.05, 2.79, 3.84, 6.01 and 26.49 µg kg−1, respectively. The Hg concentrations in the different maize tissues showed the same trend, kernels < axes < stems < bracts < leaves, at the two locations.

Table 1 Means and standard deviation (SD) values, variance components and heritability values of maize kernels, axes, stems, bracts and leaves.
Figure 1
figure 1

Histogram of Hg concentrations in five maize tissues in the association population.

The Hg content in each maize tissue varied widely in the association population at the two locations. The Hg concentration in kernels varied from 0.15 to 6.50 µg kg−1 at Xixian and from 0.14 to 3.55 µg kg−1 at Changge. The Hg concentrations in axes, stems, bracts and leaves varied from 1.01 to 6.4 µg kg−1, 1.93 to 14.39 µg kg−1, 4.61 to 12.78 µg kg−1 and 20.50 to 46.30 µg kg−1, respectively, at Xixian; and from 1.82 to 4.86 µg kg−1, 1.31 to 10.35 µg kg−1, 2.37 to 13.40 µg kg−1 and 12.26 to 42.43 µg kg−1, respectively, at Changge.

An analysis of variance indicated that the Hg concentrations in the five measured tissues in the association population were significantly affected by environments, genotypes and genotype × environment interactions (Table 1), indicating that the soil Hg concentration is an important factor affecting the Hg contents in maize tissues. The heritability values of the Hg contents in kernels, axes, stems, bracts and leaves at Xixian were 86.50%, 80.82%, 79.85%, 85.94% and 82.93%, respectively; and 88.55%, 92.50%, 95.38%, 97.39% and 98.67%, respectively, at Changge. The high heritability in each tissue at both locations indicated that much of the phenotypic variance in the population was genetically controlled. Additionally, the correlations of the Hg contents in the five tissues at each location and the means of the Hg concentrations of the corresponding tissues at both locations were calculated using the Pearson correlation coefficient. At Xixian, the Hg concentrations between bracts and stems showed a significant relationship (Table 2), while at Changge, the kernel's Hg concentration had significant relationships with those of both bracts and axes. In the two locations combined, the mean of the bract's Hg content had significant relationships with those of both kernels and stem.

Table 2 Correlation coefficients among different tissues in the maize association population.

Linkage disequilibrium (LD) in the association panel

The genome-wide LD was calculated using the 522,744 SNPs (minor allelic frequency > 0.05), which were used as the input data (Fig. 2). LD decay varied from 50 kb to100 kb across different chromosomes at r = 0.1. The LD reached within 40–50 kb on chromosome 1, 50–60 kb on chromosome 2, 75–100 kb on chromosome 3, and 50–100 kb on the remaining chromosomes. The average LD decay across the entire genome was 50–100 kb (r = 0.1).

Figure 2
figure 2

Linkage disequilibrium decay of each chromosome.

GWAS

A total of 230 inbred lines were characterised phenotypically across different tissues at two different sites. Principal components (PCs) were used to control the population structure. A principal component analysis (PCA) showed that the inbred lines used in this study could be separated well by PC1 and PC2 (Fig. S1). A GWAS was performed for the Hg contents in five maize tissues using a mixed linear model (Figs 3 and 4). A total of 37 significant SNPs that associated with kernels, 12 with axes, 13 with stems, 27 with bracts and 23 with leaves were detected with p < 0.0001, which explained 6.96%–10.56%, 7.19%–15.87%, 7.11%–10.19%, 7.16%–8.71% and 6.91%–9.17% of the phenotypic variation for kernels, axes, stems, bracts and leaves, respectively (Table 3). All of these significant SNPs were distributed over 10 chromosomes. In the association population, the most significant SNPs for kernels, axes, stems, bracts and leaves were chr5. S_196250608, chr1.S_242708944, chr7. S_15426996, chr9. S_105527159 and chr4. S_239906146, respectively, located on chromosome 5, 1, 7, 9 and 4, respectively, explaining 10.56%, 15.86%, 10.19%, 8.71% and 9.17% of the phenotypic variation, respectively. The quantile–quantile plots were determined and indicated that population structure was well controlled by PCA and Kinship of each tissue.

Figure 3
figure 3

Quantile–quantile plot of the Hg contents in five maize tissues from two locations combined, as determined by a genome-wide association analysis.

Figure 4
figure 4

Manhattan plot of the Hg contents in five maize tissues from two locations combined, as determined by a genome-wide association analysis.

Table 3 A total of 111 significantly associated SNPs in maize detected by GWAS.

The overall LD decay for the entire genome in this panel was 100 kb. Given the extent of average LD, we feel confident that a 200 kb window centered on each significant SNP has a good chance to capture the gene of interest.

Discussion

Hg poisoning, as the result of environmental pollution, has become a problem. In rice, the Hg content in different tissues follows the trend: root > stalk > leaf > husk > seed37. In maize, Liu et al.38 reported that the Hg levels in different tissues were different, having the following trend: root > leaf > stalk > grain. Fu et al.26 found a similar distribution of the Hg content, leaves > bracts > stems > axes > kernels, in maize. The same trend for Hg concentrations in different maize tissues was also found in the present study (Table 1). The similar distribution of the Hg content across different maize tissues indicated that a common regulatory mechanism might exist in the Hg-accumulation process.

In crop breeding, QTL mapping for important traits is a most common approach, providing the basis for marker-assisted selection. However, QTL detection is a traditional genotyping method and is laborious and time-consuming, and QTLs are low-density markers that map with low resolutions because of the limited recombination numbers. With the advent of next-generation sequencing technologies, GWASs have become a powerful genetics-based strategy to explore allelic variation with a broader scope. GWASs can save time and labour, increase the detectable range of natural variation and improve the resolution of QTL mapping39, 40. In plants, GWASs have been used to identify many loci for complex traits, including drought tolerance41, and seed oil36 and protein contents41. In the present study, 11 significant SNPs associated with Hg accumulation co-localised with six previously reported QTL intervals26 (Table 4). On chromosome 7, one significant SNP (chr7.S_130427527) associated with the Hg content in kernels was detected within the qAHC7 QTL interval located in bin7.03. A total of five significant SNPs on chromosome 8, which associated with the Hg contents in kernels (chr8.S_128481917), stems (chr8.S_161469759), bracts (chr8.S_163741736) and leaves (chr8.S_128751246 and chr8.S_162579911), co-localised with the previously reported QTL named qBHC8a. The other significant SNP (PZE-108038334) on chromosome 8 co-localised with the qSHC8b QTL located in bin8.03. On chromosome 9, three significantly associated SNPs observed within the region between 105.527 and 115.321 Mb co-localised with the reported QTL qKHC9b/qBHC9. The other significant SNP (PZE-109058129) on chromosome 9 co-localised with the qKHC9a QTL located in bin9.03.

Table 4 Identification of maize loci using a GWAS and previous QTLs.

In the present study, more loci were detected through the GWAS than through linkage mapping, possibly because of the reduced QTL detection efficiency and the smaller number of molecular markers used in linkage mapping. A combination of linkage mapping and GWAS would promote the analysis of complex quantitative traits. The loci identified by the integration of GWAS and QTL mapping would provide useful reference information for studies on the functional verification of Hg accumulation.

Because maize is a food, feed and industrial crop, increasing its grain yield is an important breeding goal42, and is also important to ensure the quality of maize grain as worldwide demand for food increases. Heavy metal pollution in soils has become a major issue globally43, and the heavy metals absorbed by crops could affect grain quality. Zhang et al.44 reported that it was necessary and feasible to select and plant low heavy metal accumulation varieties in contaminated soil, which could reduce the heavy metal content of the edible crop parts. The strategy of breeding pollution-safe cultivars, in which heavy metals in contaminated soil could accumulate at sufficiently low levels in the edible parts of crops for safe consumption1, has been applied. Hg usually accumulates to a significantly higher level in leaves than in kernels25, 39, 40, and this was corroborated in the present study. For example, the mean Hg concentrations in leaves were 29.46 µg kg−1 at Xixian and 26.49 µg kg−1 at Changge, which far exceeds that in kernels (1.39 µg kg−1 at Xixian and 1.05 µg kg−1 at Changge). In addition, the Pearson correlation coefficient indicates that there is no significant relationship between the Hg contents in leaves and kernels. The results imply that inbred lines with high accumulations of Hg in leaves and low accumulations of Hg in kernels could be used as parents for selecting elite hybrids, which would be helpful both for soil remediation and for the assurance of food safety.

Materials and Methods

Plant materials and SNP markers

A total of 298 maize inbred lines, which came from temperate, tropical and subtropical zones, were used in this study. After removing materials with higher heterozygosity and loss rates, 230 inbred lines were selected to constitute the association population, among which 151 inbred lines came from the temperate zone, and 79 came from tropical and subtropical zones (Supplementary Table S1)45. The selected inbred lines were genotyped in this study using two genotyping platforms (RNA sequencing and a SNP array) containing 556,809 SNPs according to the method described by Yang et al.45 The SNP data is available from http://www.maizego.org/Resources.html.

Plant treatments and soil conditions

The field trials were conducted in 2012 in Xixian (E 114°72′, N 32°35′) and Changge (E 113°34′, N 34°09′) Counties, which are located in northern China, with average temperatures of 15.2 and 14.3 °C, respectively, and rainfalls of 873.8 and 462.8 mm, respectively. The maize association population was grown in a randomised complete block design with three replicates at each location. Each plot included 15 plants planted 0.67 m apart in a single row 4 m long, allowing a final plant density of 67,500 plants per hectare. At the Xixian location, because of irrigation with Hg-rich surface water, the soil Hg concentration (457.57 ± 31.30 µg kg−1, pH 6.5) was much higher than at Changge (345.40 ± 22.24 µg kg−1, pH 6.5). The soil Hg concentrations at the two locations were higher than in Class I, according to the Chinese national standards related to Hg soil concentration rankings10.

Determining the Hg concentrations in maize tissues

Five consecutive plants from each plot were harvested for further analyses when they reached physiological maturity. After the collected plant materials were dried, they were dissected into five parts: kernels, axes, stems, bracts and leaves. Each part of the plant was ground into a fine powder using a mortar and pestle. Powdered samples (0.5 g) were digested with 5 mL HNO3/HClO4 (80/20 v/v) in polypropylene tubes using a heating block (AIM500 Digestion System, A.I. Scientific, Australia). Then, the Hg concentrations in the different plant materials were determined using atomic fluorescence spectrometry (AFS-3000, Beijing Haiguang Analytical Instrument Co., Beijing, China) (Supplementary Table S2). Data were analysed using a two-way analysis of variance with the IBM SPSS Statistics package, and broad-sense heritability was calculated according to the method developed by Knapp et al.46.

GWAS

SNPs with more than 12% missing data and a minor allele frequency < 5% were excluded, leaving 522,744 SNPs for further analyses. The LD between SNPs on each chromosome was estimated with r2 using TASSEL 5.047. The PCs and the kinship matrix were also determined using TASSEL 5.0. A mixed linear model with the obtained SNPs, PCs, kinship and the means of the Hg contents was established for the GWAS. The relative distribution of −log10 p-values was observed for each SNP association and compared individually with the expected distribution using a quantile–quantile and manhattan plot. The adjusted p-value threshold of significance for each trait was corrected. SNP loci in significant LD regions were identified by revealing the significant contributions to phenotypic variation of the agronomic traits with the highest magnitude of marker trait-association and lowest adjusted p-values (threshold p < 1 × 10−4).

Analysis of candidate genes

The reported genome sequence of maize B73 provides a useful reference database for candidate gene analyses. According to Lawrence et al.48, probes of ~120 bp containing the SNPs associated with Hg accumulation in different maize tissues were used for comparisons with the maize B73 genome. Based on the LD decay, a 200-kb window for the significant SNPs (100 kb upstream and downstream of the lead SNP) was selected to identify the candidate genes. Genes within the region were identified based on the positions of the closest flanking significant SNPs (p < 1 × 10−4). The candidate genes were obtained using the BLASTN algorithm (http://blast.ncbi.nlm.nih.gov/) and annotated against the Gene Ontology database (http://www.geneontology.org/) for functional annotations.