Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Ensemble machine learning strategies for mineral prospectivity mapping under data scarcity
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 15 February 2026

Ensemble machine learning strategies for mineral prospectivity mapping under data scarcity

  • Poorya Amirajlo1,
  • Hossein Hassani1,
  • Amin Beiranvand Pour1 &
  • …
  • Narges Habibkhah1 

Scientific Reports , Article number:  (2026) Cite this article

  • 360 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Engineering
  • Mathematics and computing

Abstract

In mineral prospectivity mapping (MPM), the scarcity of labeled data and severe class imbalance often undermine the stability and reliability of machine learning models. This study advances a reliability-centered framework that prioritizes calibration and reproducibility over marginal accuracy gains when training data are limited. Two ensemble configurations, Light Gradient Boosting Machine combined with AdaBoost, and Support Vector Machine combined with Gaussian Naive Bayes, were systematically evaluated using three hyperparameter optimization strategies: Grid Search, Random Search, and Bayesian Optimization. The Synthetic Minority Oversampling Technique (SMOTE) and five-fold cross-validation were applied to counteract data imbalance and improve model robustness. Results from the Dehaq Pb–Zn district, within the Sanandaj–Sirjan Zone of Iran, reveal that while Bayesian Optimization can yield slightly higher Receiver Operating Characteristic (ROC)-Area Under the Curve (AUC) scores, Grid Search consistently delivers more stable, well-calibrated probabilities under extreme data scarcity. The SVM + GNB ensemble tuned via Grid Search demonstrated superior balance between discrimination and reliability, achieving an AUC of 0.90 with the most consistent calibration curve, whereas the Bayesian Optimization configuration marginally reached the highest AUC (0.95). These findings highlight that stability and probability calibration are decisive for trustworthy mineral prospectivity modeling in data-scarce environments. The proposed framework provides a reproducible and interpretable pathway for translating machine learning predictions into reliable exploration decisions, supporting risk-aware targeting in early-stage mineral exploration.

Data availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

References

  1. Bonham-Carter, G. Geographic Information Systems for Geoscientists: Modelling with GIS (Elsevier) (1994).

  2. Carranza, E. J. M. Geochemical Anomaly and Mineral Prospectivity Mapping in GIS. 11 (Elsevier) (2008).

  3. Parsa, M. & Carranza, E. J. M. Modulating the impacts of stochastic uncertainties linked to deposit locations in data-driven predictive mapping of mineral prospectivity. Nat. Resour. Res. 30 (5), 3081–3097 (2021).

    Google Scholar 

  4. Carranza, E. J. M. & Laborte, A. G. Random forest predictive modeling of mineral prospectivity with small number of prospects and data with missing values in abra (Philippines). Comput. Geosci. 74, 60–70 (2015).

    Google Scholar 

  5. Chen, Y., Wang, S., Zhao, Q. & Sun, G. Detection of multivariate geochemical anomalies using the bat-optimized isolation forest and bat-optimized elliptic envelope models. J. Earth Sci. 32 (2), 415–426 (2021).

    Google Scholar 

  6. Mao, X. et al. Bayesian decomposition modelling: an interpretable nonlinear approach for mineral prospectivity mapping. Math. Geosci. 55 (7), 897–942 (2023).

    Google Scholar 

  7. Chen, Y. & Wu, W. Mapping mineral prospectivity using an extreme learning machine regression. Ore Geol. Rev. 80, 200–213 (2017).

    Google Scholar 

  8. Sun, T. et al. Data-driven predictive modelling of mineral prospectivity using machine learning and deep learning methods: A case study from Southern Jiangxi Province, China. Minerals 10 (2), 102 (2020).

    Google Scholar 

  9. Chen, G., Cheng, Q. & Puetz, S. Data-driven discovery in geosciences: opportunities and challenges. Math. Geosci. 55 (3), 287–293 (2023).

    Google Scholar 

  10. Xiong, Y. & Zuo, R. GIS-based rare events logistic regression for mineral prospectivity mapping. Comput. Geosci. 111, 18–25 (2018).

    Google Scholar 

  11. Abedi, M., Norouzi, G. H. & Bahroudi, A. Support vector machine for multi-classification of mineral prospectivity areas. Comput. Geosci. 46, 272–283 (2012).

    Google Scholar 

  12. Zuo, R. & Carranza, E. J. M. Support vector machine: A tool for mapping mineral prospectivity. Comput. Geosci. 37 (12), 1967–1975 (2011).

    Google Scholar 

  13. Parsa, M. & Maghsoudi, A. Assessing the effects of mineral systems-derived exploration targeting criteria for random Forests-based predictive mapping of mineral prospectivity in Ahar-Arasbaran area, Iran. Ore Geol. Rev. 138, 104399 (2021).

    Google Scholar 

  14. Talebi, H., Peeters, L. J. M., Otto, A. & Tolosana-Delgado, R. A truly spatial random forests algorithm for geoscience data analysis and modelling. Math. Geosci. 54(1), 1–22 (2022).

    Google Scholar 

  15. Maepa, F., Smith, R. S. & Tessema, A. Support vector machine and artificial neural network modelling of orogenic gold prospectivity mapping in the Swayze greenstone belt, Ontario, Canada. Ore Geol. Rev. 130, 103968 (2021).

    Google Scholar 

  16. Li, Q., Chen, G. & Luo, L. Mineral prospectivity mapping using attention-based convolutional neural network. Ore Geol. Rev. 156, 105381 (2023).

    Google Scholar 

  17. Xu, Y. et al. Mineral prospectivity mapping by deep learning method in Yawan-Daqiao area, Gansu. Ore Geol. Rev. 138, 104316 (2021).

    Google Scholar 

  18. Yang, N., Zhang, Z., Yang, J. & Hong, Z. Mineral prospectivity prediction by integration of convolutional autoencoder network and random forest. Nat. Resour. Res. 31 (3), 1103–1119 (2022).

    Google Scholar 

  19. Zuo, R. & Xu, Y. Graph deep learning model for mapping mineral prospectivity. Math. Geosci. 55 (1), 1–21 (2023).

    Google Scholar 

  20. Cheng, Q. Mapping singularities with stream sediment geochemical data for prediction of undiscovered mineral deposits in Gejiu, Yunnan Province, China. Ore Geol. Rev. 32 (1–2), 314–324 (2007).

    Google Scholar 

  21. Van Engelen, J. E. & Hoos, H. H. A survey on semi-supervised learning. Mach. Learn. 109 (2), 373–440 (2020).

    Google Scholar 

  22. Bruzzone, L., Chi, M. & Marconcini, M. A novel transductive SVM for semisupervised classification of remote-sensing images. IEEE Trans. Geosci. Remote Sens. 44 (11), 3363–3373 (2006).

    Google Scholar 

  23. Camps-Valls, G., Marsheva, T. V. B. & Zhou, D. Semi-supervised graph-based hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 45 (10), 3044–3054 (2007).

    Google Scholar 

  24. Zhu, X., Ghahramani, Z. & Lafferty, J. D. Semi-supervised learning using gaussian fields and harmonic functions. In: Proceedings of the 20th International Conference on Machine Learning (ICML-03). 912–919. (2003).

  25. Miller, D. J. & Uyar, H. A mixture of experts classifier with learning based on both labelled and unlabelled data. Adv Neural Inf. Process. Syst ;9. (1996).

  26. Salimans, T. et al. Improved techniques for training Gans. Adv Neural Inf. Process. Syst ;29. (2016).

  27. Fatehi, M. & Asadi, H. H. Data integration modeling applied to drill hole planning through semi-supervised learning: A case study from the dalli Cu-Au porphyry deposit in the central Iran. J. Afr. Earth Sci. 128, 147–160 (2017).

    Google Scholar 

  28. Wang, J., Zuo, R. & Xiong, Y. Mapping mineral prospectivity via semi-supervised random forest. Nat. Resour. Res. 29, 189–202 (2020).

    Google Scholar 

  29. Li, Q., Chen, G. & Wang, D. Mineral prospectivity mapping using semi-supervised machine learning. Math. Geosci. 57 (2), 275–305 (2025).

    Google Scholar 

  30. Chen, G., Kusky, T., Luo, L., Li, Q. & Cheng, Q. Hadean tectonics: insights from machine learning. Geology 51 (8), 718–722 (2023).

    Google Scholar 

  31. Tao, J., Yuan, F., Zhang, N. & Chang, J. Three-dimensional prospectivity modeling of Honghai volcanogenic massive sulfide Cu–Zn deposit, Eastern Tianshan, Northwestern China using weights of evidence and fuzzy logic. Math. Geosci. 53, 131–162 (2021).

    Google Scholar 

  32. Chen, M. & Xiao, F. Projection pursuit random forest for mineral prospectivity mapping. Math. Geosci. 55 (7), 963–987 (2023).

    Google Scholar 

  33. Chen, G. et al. Mineral prospectivity mapping based on wavelet neural network and Monte Carlo simulations in the Nanling W-Sn metallogenic Province. Ore Geol. Rev. 143, 104765 (2022).

    Google Scholar 

  34. Bigdeli, A., Maghsoudi, A. & Ghezelbash, R. Application of self-organizing map (SOM) and K-means clustering algorithms for portraying geochemical anomaly patterns in Moalleman district, NE Iran. J. Geochemical Explor. 233, 106923 (2022).

    Google Scholar 

  35. Zuo, R., Xiong, Y., Wang, J. & Carranza, E. J. M. Deep learning and its application in geochemical mapping. Earth Sci. Rev. 192, 1–14 (2019).

    Google Scholar 

  36. Zuo, R. G., Peng, Y., Li, T. & Xiong, Y. Challenges of geological prospecting big data mining and integration using deep learning algorithms. Earth Sci. 46 (1), 350–358 (2021).

    Google Scholar 

  37. McMillan, M., Haber, E., Peters, B. & Fohring, J. Mineral prospectivity mapping using a VNet convolutional neural network. Lead. Edge. 40 (2), 99–105 (2021).

    Google Scholar 

  38. Yang, N., Zhang, Z., Yang, J. & Hong, Z. Applications of data augmentation in mineral prospectivity prediction based on convolutional neural networks. Comput. Geosci. 161, 105075 (2022).

    Google Scholar 

  39. Yin, B., Zuo, R. & Sun, S. Mineral prospectivity mapping using deep self-attention model. Nat. Resour. Res. 32 (1), 37–56 (2023).

    Google Scholar 

  40. Luo, Z., Farahbakhsh, E., Müller, R. D. & Zuo, R. Multivariate statistical analysis and bespoke deviation network modeling for geochemical anomaly detection of rare Earth elements. Appl Geochem. 174, 106146 (2024).

    Google Scholar 

  41. Hosseini-Dinani, H. & Yazdi, M. Multi-dataset analysis to assess mineral potential of MVT-type zinc-lead deposits in Malayer-Isfahan metallogenic belt, Iran. Arab. J. Geosci. 14, 1–23 (2021).

    Google Scholar 

  42. Hajihosseinlou, M., Maghsoudi, A. & Ghezelbash, R. Intelligent mapping of geochemical anomalies: adaptation of DBSCAN and mean-shift clustering approaches. J. Geochemical Explor. 258, 107393 (2024).

    Google Scholar 

  43. Yousefi, M. & Hronsky, J. M. A. Translation of the function of hydrothermal mineralization-related focused fluid flux into a mappable exploration criterion for mineral exploration targeting. Appl. Geochem. 149, 105561 (2023).

    Google Scholar 

  44. Khalifani, F. M. et al. Generation of an efficient structural evidence layer for mineral exploration targeting. J. Afr. Earth Sci. 160, 103609 (2019).

    Google Scholar 

  45. Kreuzer, M., Fenske, N., Schnelzer, M. & Walsh, L. Lung cancer risk at low radon exposure rates in German uranium miners. Br. J. Cancer. 113 (9), 1367–1369 (2015).

    Google Scholar 

  46. Yousefi, M. & Nykänen, V. Data-driven logistic-based weighting of geochemical and geological evidence layers in mineral prospectivity mapping. J. Geochemical Explor. 164, 94–106 (2016).

    Google Scholar 

  47. Amirajlo, P., Hassani, H., Beiranvand Pour, A. & Habibkhah, N. Detection of multivariate geochemical anomalies using machine learning (ML) algorithms in Dehaq Pb-Zn mineralization, Sanandaj-Sirjan zone, Isfahan, Iran. Earth Sci. Inf. 18 (1), 124 (2025).

    Google Scholar 

Download references

Acknowledgements

We would appreciate the Department of Mining Engineering at Amirkabir University of Technology (Tehran Polytechnic). This study is part of the research activity carried out during doctoral research (By the first author) at the Department of Mining Engineering at Amirkabir University of Technology (Tehran Polytechnic).

Author information

Authors and Affiliations

  1. Department of Mining Engineering, Amirkabir University of Technology (Tehran Polytechnic), 1591634311, Tehran, Iran

    Poorya Amirajlo, Hossein Hassani, Amin Beiranvand Pour & Narges Habibkhah

Authors
  1. Poorya Amirajlo
    View author publications

    Search author on:PubMed Google Scholar

  2. Hossein Hassani
    View author publications

    Search author on:PubMed Google Scholar

  3. Amin Beiranvand Pour
    View author publications

    Search author on:PubMed Google Scholar

  4. Narges Habibkhah
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Conceptualization, P.A.; Methodology, P.A.; Software, P. A.; Validation, P. A., H.H., N.H; Writing – Original Draft Preparation, P.A., H.H; Writing – Review & Editing, P.A., H.H., A.B.P.; Visualization, N.H.; Supervision, H.H.; Project administration, P.A. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Hossein Hassani.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Amirajlo, P., Hassani, H., Pour, A.B. et al. Ensemble machine learning strategies for mineral prospectivity mapping under data scarcity. Sci Rep (2026). https://doi.org/10.1038/s41598-026-40125-1

Download citation

  • Received: 04 November 2025

  • Accepted: 10 February 2026

  • Published: 15 February 2026

  • DOI: https://doi.org/10.1038/s41598-026-40125-1

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Mineral prospectivity mapping
  • Ensemble learning
  • Data scarcity
  • Model calibration
  • Grid Search
  • SMOTE
  • Bayesian Optimization
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics