Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Communications Earth & Environment
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. communications earth & environment
  3. articles
  4. article
Machine learning uncovers dominant fractions of heavy metal(loid)s in global soils
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 31 January 2026

Machine learning uncovers dominant fractions of heavy metal(loid)s in global soils

  • Tao Hu1,2,
  • Mengting Wu1,
  • Qiusong Chen1,
  • Liyuan Chai2 &
  • …
  • Chongchong Qi  ORCID: orcid.org/0000-0001-5189-16141,2 

Communications Earth & Environment , Article number:  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Environmental chemistry
  • Solid Earth sciences

Abstract

Global heavy metal(loid) (HM) pollution in soils threatens food security and human health, yet HM mobility—governed by chemical fractions—remains poorly understood at large scales. Here we develop an eXtreme Gradient Boosting model for classifying soil HM dominant fractions using a globally compiled dataset of 9489 field observations. We show that organic carbon and pH positively affect high-mobility of HMs in soil. Using mercury as a case study, we identify both known and previously unreported high-mobility hotspots. The global map indicates that mercury has high mobility across 17.85% of global regions. Combining population and cropland distribution maps with model predictions shows that an estimated 15.1 million people and 100.9 million hectares of farmland are situated in high-mobility regions, with Asia being disproportionately impacted. This study facilitates the efficient identification of dominant HM fractions on a large scale, providing a critical foundation for developing targeted HM stabilization/solidification strategies.

Similar content being viewed by others

Global and regional patterns of soil metal(loid) mobility and associated risks

Article Open access 26 March 2025

Inclusion of key soil parameters in the modified contamination factor (MCF) model as a tool for assessing heavy metal pollution in agricultural soils

Article Open access 02 December 2025

Heavy metal enrichments in rural environment soils, pollution dynamics, source identification, and the role of subsurface mineralizations

Article Open access 14 October 2025

Data availability

The data and code generated in this study are available on Figshare (https://doi.org/10.6084/m9.figshare.29554202)72. The Supplementary Data file, including all source data underlying the figures, is also available via Figshare. Global soil properties data were obtained from SoilGrids version 2.0 (https://soilgrids.org/)34. Global soil Hg concentration is available at Figshare (https://doi.org/10.6084/m9.figshare.26335654.v1)33. Global crop data is available at Figshare (https://doi.org/10.6084/m9.figshare.22491997.v5)44. Global population data is available at Zenodo (https://doi.org/10.5281/zenodo.11179644)45.

References

  1. Bagnall, D. K., Shanahan, J. F., Flanders, A., Morgan, C. L. S. & Honeycutt, C. W. Soil health considerations for global food security. Agron. J. 113, 4581–4589 (2021).

    Google Scholar 

  2. Hou, D. et al. Global soil pollution by toxic metals threatens agriculture and human health. Science 388, 316–321 (2025).

    Google Scholar 

  3. Wei, R. et al. Heavy metal concentrations in rice that meet safety standards can still pose a risk to human health. Commun. Earth Environ. 4, 84 (2023).

    Google Scholar 

  4. Hou, D. et al. Metal contamination and bioremediation of agricultural soils for food safety and sustainability. Nat. Rev. Earth Environ. 1, 366–381 (2020).

    Google Scholar 

  5. Coban, O., De Deyn, G. B. & van der Ploeg, M. Soil microbiota as game-changers in restoration of degraded lands. Science 375, abe0725 (2022).

    Google Scholar 

  6. Yang, Q. et al. A review of soil heavy metal pollution from industrial and agricultural regions in China: Pollution and risk assessment. Sci. Total Environ. 642, 690–700 (2018).

    Google Scholar 

  7. Liu, X. et al. Molecular-level understanding of metal ion retention in clay-rich materials. Nat. Rev. Earth Environ. 3, 461–476 (2022).

    Google Scholar 

  8. Imseng, M. et al. Fate of Cd in Agricultural Soils: A Stable Isotope Approach to Anthropogenic Impact, Soil Formation, and Soil-Plant Cycling. Environ. Sci. Technol. 52, 1919–1928 (2018).

    Google Scholar 

  9. Alloway, B. J. Heavy Metals in Soils: Trace Metals and Metalloids in Soils and Their Bioavailability (Springer, Berlin, 2012).

  10. Zhang, Y. et al. Experimental and molecular simulation study of HCl-modified SiO2 nanofluid in coal wettability. Powder Technol 469, 121887 (2026).

    Google Scholar 

  11. Song, X. et al. Uncovering soil heavy metal pollution hotspots and influencing mechanisms through machine learning and spatial analysis. Environ. Pollut. 370, 125901 (2025).

    Google Scholar 

  12. Zhang, S. et al. Escalating arsenic contamination throughout Chinese soils. Nat. Sustain. 7, 766–775 (2024).

    Google Scholar 

  13. Wang, X. et al. Climate and Vegetation As Primary Drivers for Global Mercury Storage in Surface Soil. Environ. Sci. Technol. 53, 10665–10675 (2019).

    Google Scholar 

  14. Jia, J. et al. Fractionation, source, and ecological risk assessment of heavy metals in cropland soils across a 100-year reclamation chronosequence in an estuary, South China. Sci. Total Environ. 807, 151725 (2022).

    Google Scholar 

  15. Qi, C. et al. Global and regional patterns of soil metal(loid) mobility and associated risks. Nat. Commun. 16, 2947 (2025).

    Google Scholar 

  16. Wieczorek, J., Baran, A. & Bubak, A. Mobility, bioaccumulation in plants, and risk assessment of metals in soils. Sci. Total Environ. 882, 163574 (2023).

    Google Scholar 

  17. Liu, H. et al. Dynamic Behaviors of Newly Deposited Atmospheric Heavy Metals in the Soil-Pak Choi System. Environ. Sci. Technol. 56, 12734–12744 (2022).

    Google Scholar 

  18. Yang, H. et al. Predicting Heavy Metal Adsorption on Soil with Machine Learning and Mapping Global Distribution of Soil Adsorption Capacities. Environ. Sci. Technol. 55, 14316–14328 (2021).

    Google Scholar 

  19. Qi, C. et al. Spectral-Based Machine Learning Enables Rapid and Large-Scale Adsorption Capacity Prediction of Heavy Metals in Soil. ACS ES&T Eng 4, 2657–2667 (2024).

    Google Scholar 

  20. Angon, P. B. et al. Sources, effects and present perspectives of heavy metals contamination: Soil, plants and human food chain. Heliyon 10, e28357 (2024).

    Google Scholar 

  21. Xu, W., Yuan, J. & Zeng, G. Introduction of heavy metals contamination in the water and soil: a review on source, toxicity and remediation methods. Green Chem. Lett. Rev 17, 2404235 (2024).

    Google Scholar 

  22. Li, G., Lu, N., Wei, Y. & Zhu, D. Relationship between Heavy Metal Content in Polluted Soil and Soil Organic Matter and pH in Mining Areas. IOP Conf. Ser.:Mater. Sci. Eng. 394, 052081 (2018).

    Google Scholar 

  23. Król, A., Mizerna, K. & Bożym, M. An assessment of pH-dependent release and mobility of heavy metals from metallurgical slag. J. Hazard. Mater. 384, 121502 (2020).

    Google Scholar 

  24. Basti, S., Sahu, C., Pati, S. S. & Sahu, S. K. Impact of organic carbon on heavy metals of river sediments and aquatic ecosystems: A review. Environ. Qual. Manag. 34, e22253 (2024).

    Google Scholar 

  25. Qu, C. et al. Heavy metal behaviour at mineral-organo interfaces: Mechanisms, modelling and influence factors. Environ. Int. 131, 104995 (2019).

    Google Scholar 

  26. Wu, Y. et al. pH-dependent overestimation of heavy metals mobility risk in soils: Implications for soil quality assessment. J. Hazard. Mater. 501, 140792 (2026).

    Google Scholar 

  27. Caporale, A. G. & Violante, A. Chemical Processes Affecting the Mobility of Heavy Metals and Metalloids in Soil Environments. Curr. Pollut. Rep. 2, 15–27 (2016).

    Google Scholar 

  28. Kyziol, J. Effect of physical properties and cation exchange capacity on sorption of heavy metals onto peats. Pol. J. Environ. Stud. 11 (2002).

  29. Inglezakis, V., Zorpas, A., Loizidou, M. & Grigoropoulou, H. The effect of competitive cations and anions on ion exchange of heavy metals. Sep. Purif. Technol. 46, 202–207 (2005).

    Google Scholar 

  30. William Kajjumba, G., Emik, S., Öngen, A., Kurtulus Özcan, H. & Aydın, S. Modelling of adsorption kinetic processes—Errors, theory and application. In Advanced Sorption Process Applications (ed. Edebali, S.) (IntechOpen, 2019). https://doi.org/10.5772/intechopen.80495.

  31. Gu, S., Kang, X., Wang, L., Lichtfouse, E. & Wang, C. Clay mineral adsorbents for heavy metal removal from wastewater: a review. Environ. Chem. Lett. 17, 629–654 (2019).

    Google Scholar 

  32. Martin, L. A., Wissocq, A., Benedetti, M. F. & Latrille, C. Thallium (Tl) sorption onto illite and smectite: Implications for Tl mobility in the environment. Geochim. Cosmochim. Acta 230, 1–16 (2018).

    Google Scholar 

  33. Guo, W. et al. Warming-Induced Vegetation Greening May Aggravate Soil Mercury Levels Worldwide. Environ. Sci. Technol. 58, 15078–15089 (2024).

    Google Scholar 

  34. Poggio, L. et al. SoilGrids 2.0: producing soil information for the globe with quantified spatial uncertainty. Soil 7, 217–240 (2021).

    Google Scholar 

  35. Soares, L. C., Egreja Filho, F. B., Linhares, L. A., Windmoller, C. C. & Yoshida, M. I. Accumulation and oxidation of elemental mercury in tropical soils. Chemosphere 134, 181–191 (2015).

    Google Scholar 

  36. O’Connor, D. et al. Mercury speciation, transformation, and transportation in soils, atmospheric flux, and implications for risk management: A critical review. Environ. Int. 126, 747–761 (2019).

    Google Scholar 

  37. Chen, L. et al. Global Distribution of Mercury in Foliage Predicted by Machine Learning. Environ. Sci. Technol. 58, 15629–15637 (2024).

    Google Scholar 

  38. Xiao, C. et al. Understanding the Global Distribution of Groundwater Sulfate and Assessing Population at Risk. Environ. Sci. Technol. 58, 21002–21014 (2024).

    Google Scholar 

  39. Ayotte, J. D., Medalie, L., Qi, S. L., Backer, L. C. & Nolan, B. T. Estimating the High-Arsenic Domestic-Well Population in the Conterminous United States. Environ. Sci. Technol. 51, 12443–12454 (2017).

    Google Scholar 

  40. Orgiazzi, A., Ballabio, C., Panagos, P., Jones, A. & Fernández-Ugalde, O. LUCAS Soil, the largest expandable soil dataset for Europe: a review. Eur. J. Soil Sci. 69, 140–153 (2018).

    Google Scholar 

  41. Tørseth, K. et al. Introduction to the European Monitoring and Evaluation Programme (EMEP) and observed atmospheric composition change during 1972–2009. Atmos. Chem. Phys. 12, 5447–5481 (2012).

    Google Scholar 

  42. Lobo Vicente, J. et al. HBM4EU results support the Chemicals’ Strategy for Sustainability and the Zero-Pollution Action Plan. Int. J. Hyg. Environ. Health 248, 114111 (2023).

    Google Scholar 

  43. Panagos, P., Jones, A., Lugato, E. & Ballabio, C. A Soil Monitoring Law for Europe. Glob. Chall. 9, 2400336 (2025).

    Google Scholar 

  44. Tang, F. H. M. et al. CROPGRIDS: a global geo-referenced dataset of 173 crops. Sci. Data 11, 413 (2024).

    Google Scholar 

  45. Liu, L., Cao, X., Li, S. & Jie, N. A 31-year (1990–2020) global gridded population dataset generated by cluster analysis and statistical learning. Sci. Data 11, 124 (2024).

    Google Scholar 

  46. Malasani, C. R. et al. Modeling of mercury deposition in India: evaluating emission inventories and anthropogenic impacts. Environ. Sci.:Processes Impacts 26, 1999–2009 (2024).

    Google Scholar 

  47. Vudamala, K. et al. Comparative Assessment of Heavy Metal Contamination and Pollution Trends in Indian Estuarine Sediments: Insights from East and West Coasts of India. ACS ES&T Water 5, 2760–2777 (2025).

    Google Scholar 

  48. Jetashree, Wicke, B. & Liang, S. Parsing India’s mercury crisis: A socioeconomic take reveals the defining role of domestic consumerism. J. Cleaner Prod. 463, 142742 (2024).

    Google Scholar 

  49. Kumar, A. et al. Mercury in the natural environment: Biogeochemical cycles and associated health risks. J. Geochem. Explor. 267, 107594 (2024).

    Google Scholar 

  50. Sutou, A. & Wang, J. Influence-Balanced XGBoost: Improving XGBoost for Imbalanced Data Using Influence Functions. IEEE Access 12, 193473–193486 (2024).

    Google Scholar 

  51. Rauret, G. et al. Improvement of the BCR three step sequential extraction procedure prior to the certification of new sediment and soil reference materials. J. Environ. Monit. 1, 57–61 (1999).

    Google Scholar 

  52. Kumar, A. K., Mai, N. N., Tian, K. & Xia, Y. Isochronous Temporal Metric for Neighbourhood Analysis in Classification Tasks. SN Comput Sci 4, 807 (2023).

    Google Scholar 

  53. Jiang, Y. et al. Topological representations of crystalline compounds for the machine-learning prediction of materials properties. npj Comput. Mater. 7, 28 (2021).

    Google Scholar 

  54. Shaheen, S. M. & Rinklebe, J. Geochemical fractions of chromium, copper, and zinc and their vertical distribution in floodplain soil profiles along the Central Elbe River, Germany. Geoderma 228-229, 142–159 (2014).

    Google Scholar 

  55. Bogusz, A. & Oleszczuk, P. Sequential extraction of nickel and zinc in sewage sludge- or biochar/sewage sludge-amended soil. Sci. Total Environ. 636, 927–935 (2018).

    Google Scholar 

  56. Ye, J. et al. New magnetic proxies to reveal source and bioavailability of heavy metals in contaminated soils. J. Hazard. Mater. 479, 135665 (2024).

    Google Scholar 

  57. Chen, M. et al. Unstable geogenic arsenic in reclaimed coastal soils poses environmental risks. Commun. Earth Environ. 6, 633 (2025).

    Google Scholar 

  58. Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).

    Google Scholar 

  59. Theng, D. & Bhoyar, K. K. Feature selection techniques for machine learning: a survey of more than two decades of research. Knowl. Inf. Syst. 66, 1575–1637 (2024).

    Google Scholar 

  60. Sharma, A. & Singh, M. Batch reinforcement learning approach using recursive feature elimination for network intrusion detection. Eng. Appl. Artif. Intel. 136, 109013 (2024).

    Google Scholar 

  61. Chennamaneni, P. R., Echambadi, R., Hess, J. D. & Syam, N. Diagnosing harmful collinearity in moderated regressions: A roadmap. Int. J. Res. Mark. 33, 172–182 (2016).

    Google Scholar 

  62. Luo, J., Yuan, Y. & Xu, S. Improving GBDT performance on imbalanced datasets: An empirical study of class-balanced loss functions. Neurocomputing 634, 129896 (2025).

    Google Scholar 

  63. Zhang, N. et al. Geochemical-integrated machine learning approach predicts the distribution of cadmium speciation in European and Chinese topsoils. Commun. Earth Environ. 6, 548 (2025).

    Google Scholar 

  64. Qi, X. et al. Machine learning and SHAP value interpretation for predicting comorbidity of cardiovascular disease and cancer with dietary antioxidants. Redox Biology 79, 103470 (2025).

    Google Scholar 

  65. Hu, T. et al. Classification of arsenic contamination in soil across the EU by vis-NIR spectroscopy and machine learning. Int. J. Appl. Earth. Obs. 134, 104158 (2024).

    Google Scholar 

  66. Driscoll, C. T., Mason, R. P., Chan, H. M., Jacob, D. J. & Pirrone, N. Mercury as a Global Pollutant: Sources, Pathways, and Effects. Environ. Sci. Technol. 47, 4967–4983 (2013).

    Google Scholar 

  67. Zhou, J., Obrist, D., Dastoor, A., Jiskra, M. & Ryjkov, A. Vegetation uptake of mercury and impacts on global cycling. Nat. Rev. Earth Environ. 2, 269–284 (2021).

    Google Scholar 

  68. Dastoor, A. et al. Arctic mercury cycling. Nat. Rev. Earth Environ. 3, 270–286 (2022).

    Google Scholar 

  69. Landis, J. D. et al. Quantifying soil accumulation of atmospheric mercury using fallout radionuclide chronometry. Nat. Commun. 15, 5430 (2024).

    Google Scholar 

  70. Peng, D. et al. Human perturbations to mercury in global rivers. Sci. Adv. 11, eadw0471.

  71. van den Hoogen, J. et al. Soil nematode abundance and functional group composition at a global scale. Nature 572, 194–198 (2019).

    Google Scholar 

  72. Hu, T. et al. Figshare. https://doi.org/10.6084/m9.figshare.29554202 (2025).

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (No. 22376221 and No. 22176223), Natural Science Foundation of Hunan Province, China (No. 2024JJ2074), and Young Elite Scientists Sponsorship Program by CAST (No. 2023QNRC001). This work was partly supported by the High Performance Computing Center of Central South University.

Author information

Authors and Affiliations

  1. School of Resources and Safety Engineering, Central South University, Changsha, 410083, China

    Tao Hu, Mengting Wu, Qiusong Chen & Chongchong Qi

  2. School of Environmental Science and Engineering, Tianjin University, Tianjin, 300072, China

    Tao Hu, Liyuan Chai & Chongchong Qi

Authors
  1. Tao Hu
    View author publications

    Search author on:PubMed Google Scholar

  2. Mengting Wu
    View author publications

    Search author on:PubMed Google Scholar

  3. Qiusong Chen
    View author publications

    Search author on:PubMed Google Scholar

  4. Liyuan Chai
    View author publications

    Search author on:PubMed Google Scholar

  5. Chongchong Qi
    View author publications

    Search author on:PubMed Google Scholar

Contributions

T.H., and C.C.Q. designed the project. T.H., C.C.Q., and M.T.W. constructed the modeling methodology and performed the modeling. L.Y.C., and Q.S.C. participated in results analysis and discussion. T.H. wrote the initial draft of the manuscript, and all authors edited the manuscript.

Corresponding author

Correspondence to Chongchong Qi.

Ethics declarations

Competing interests

The authors declare no competing interests

Peer review

Peer review information

Communications Earth & Environment thanks Abiot Molla and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Sadia Ilyas and Somaparna Ghosh. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Transparent Peer Review file

Supplementary Information

Supplementary Data

Description of Additional Supplementary File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hu, T., Wu, M., Chen, Q. et al. Machine learning uncovers dominant fractions of heavy metal(loid)s in global soils. Commun Earth Environ (2026). https://doi.org/10.1038/s43247-026-03221-8

Download citation

  • Received: 21 August 2025

  • Accepted: 14 January 2026

  • Published: 31 January 2026

  • DOI: https://doi.org/10.1038/s43247-026-03221-8

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • Reviews & Analysis
  • News & Comment
  • Collections
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims & Scope
  • Journal Information
  • Open Access Fees and Funding
  • Journal Metrics
  • Editors
  • Editorial Board
  • Calls for Papers
  • Referees
  • Editorial Values Statement
  • Editorial policies
  • Conferences
  • Contact

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Communications Earth & Environment (Commun Earth Environ)

ISSN 2662-4435 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing