Abstract
In mineral prospectivity mapping (MPM), the scarcity of labeled data and severe class imbalance often undermine the stability and reliability of machine learning models. This study advances a reliability-centered framework that prioritizes calibration and reproducibility over marginal accuracy gains when training data are limited. Two ensemble configurations, Light Gradient Boosting Machine combined with AdaBoost, and Support Vector Machine combined with Gaussian Naive Bayes, were systematically evaluated using three hyperparameter optimization strategies: Grid Search, Random Search, and Bayesian Optimization. The Synthetic Minority Oversampling Technique (SMOTE) and five-fold cross-validation were applied to counteract data imbalance and improve model robustness. Results from the Dehaq Pb–Zn district, within the Sanandaj–Sirjan Zone of Iran, reveal that while Bayesian Optimization can yield slightly higher Receiver Operating Characteristic (ROC)-Area Under the Curve (AUC) scores, Grid Search consistently delivers more stable, well-calibrated probabilities under extreme data scarcity. The SVM + GNB ensemble tuned via Grid Search demonstrated superior balance between discrimination and reliability, achieving an AUC of 0.90 with the most consistent calibration curve, whereas the Bayesian Optimization configuration marginally reached the highest AUC (0.95). These findings highlight that stability and probability calibration are decisive for trustworthy mineral prospectivity modeling in data-scarce environments. The proposed framework provides a reproducible and interpretable pathway for translating machine learning predictions into reliable exploration decisions, supporting risk-aware targeting in early-stage mineral exploration.
Data availability
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
References
Bonham-Carter, G. Geographic Information Systems for Geoscientists: Modelling with GIS (Elsevier) (1994).
Carranza, E. J. M. Geochemical Anomaly and Mineral Prospectivity Mapping in GIS. 11 (Elsevier) (2008).
Parsa, M. & Carranza, E. J. M. Modulating the impacts of stochastic uncertainties linked to deposit locations in data-driven predictive mapping of mineral prospectivity. Nat. Resour. Res. 30 (5), 3081–3097 (2021).
Carranza, E. J. M. & Laborte, A. G. Random forest predictive modeling of mineral prospectivity with small number of prospects and data with missing values in abra (Philippines). Comput. Geosci. 74, 60–70 (2015).
Chen, Y., Wang, S., Zhao, Q. & Sun, G. Detection of multivariate geochemical anomalies using the bat-optimized isolation forest and bat-optimized elliptic envelope models. J. Earth Sci. 32 (2), 415–426 (2021).
Mao, X. et al. Bayesian decomposition modelling: an interpretable nonlinear approach for mineral prospectivity mapping. Math. Geosci. 55 (7), 897–942 (2023).
Chen, Y. & Wu, W. Mapping mineral prospectivity using an extreme learning machine regression. Ore Geol. Rev. 80, 200–213 (2017).
Sun, T. et al. Data-driven predictive modelling of mineral prospectivity using machine learning and deep learning methods: A case study from Southern Jiangxi Province, China. Minerals 10 (2), 102 (2020).
Chen, G., Cheng, Q. & Puetz, S. Data-driven discovery in geosciences: opportunities and challenges. Math. Geosci. 55 (3), 287–293 (2023).
Xiong, Y. & Zuo, R. GIS-based rare events logistic regression for mineral prospectivity mapping. Comput. Geosci. 111, 18–25 (2018).
Abedi, M., Norouzi, G. H. & Bahroudi, A. Support vector machine for multi-classification of mineral prospectivity areas. Comput. Geosci. 46, 272–283 (2012).
Zuo, R. & Carranza, E. J. M. Support vector machine: A tool for mapping mineral prospectivity. Comput. Geosci. 37 (12), 1967–1975 (2011).
Parsa, M. & Maghsoudi, A. Assessing the effects of mineral systems-derived exploration targeting criteria for random Forests-based predictive mapping of mineral prospectivity in Ahar-Arasbaran area, Iran. Ore Geol. Rev. 138, 104399 (2021).
Talebi, H., Peeters, L. J. M., Otto, A. & Tolosana-Delgado, R. A truly spatial random forests algorithm for geoscience data analysis and modelling. Math. Geosci. 54(1), 1–22 (2022).
Maepa, F., Smith, R. S. & Tessema, A. Support vector machine and artificial neural network modelling of orogenic gold prospectivity mapping in the Swayze greenstone belt, Ontario, Canada. Ore Geol. Rev. 130, 103968 (2021).
Li, Q., Chen, G. & Luo, L. Mineral prospectivity mapping using attention-based convolutional neural network. Ore Geol. Rev. 156, 105381 (2023).
Xu, Y. et al. Mineral prospectivity mapping by deep learning method in Yawan-Daqiao area, Gansu. Ore Geol. Rev. 138, 104316 (2021).
Yang, N., Zhang, Z., Yang, J. & Hong, Z. Mineral prospectivity prediction by integration of convolutional autoencoder network and random forest. Nat. Resour. Res. 31 (3), 1103–1119 (2022).
Zuo, R. & Xu, Y. Graph deep learning model for mapping mineral prospectivity. Math. Geosci. 55 (1), 1–21 (2023).
Cheng, Q. Mapping singularities with stream sediment geochemical data for prediction of undiscovered mineral deposits in Gejiu, Yunnan Province, China. Ore Geol. Rev. 32 (1–2), 314–324 (2007).
Van Engelen, J. E. & Hoos, H. H. A survey on semi-supervised learning. Mach. Learn. 109 (2), 373–440 (2020).
Bruzzone, L., Chi, M. & Marconcini, M. A novel transductive SVM for semisupervised classification of remote-sensing images. IEEE Trans. Geosci. Remote Sens. 44 (11), 3363–3373 (2006).
Camps-Valls, G., Marsheva, T. V. B. & Zhou, D. Semi-supervised graph-based hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 45 (10), 3044–3054 (2007).
Zhu, X., Ghahramani, Z. & Lafferty, J. D. Semi-supervised learning using gaussian fields and harmonic functions. In: Proceedings of the 20th International Conference on Machine Learning (ICML-03). 912–919. (2003).
Miller, D. J. & Uyar, H. A mixture of experts classifier with learning based on both labelled and unlabelled data. Adv Neural Inf. Process. Syst ;9. (1996).
Salimans, T. et al. Improved techniques for training Gans. Adv Neural Inf. Process. Syst ;29. (2016).
Fatehi, M. & Asadi, H. H. Data integration modeling applied to drill hole planning through semi-supervised learning: A case study from the dalli Cu-Au porphyry deposit in the central Iran. J. Afr. Earth Sci. 128, 147–160 (2017).
Wang, J., Zuo, R. & Xiong, Y. Mapping mineral prospectivity via semi-supervised random forest. Nat. Resour. Res. 29, 189–202 (2020).
Li, Q., Chen, G. & Wang, D. Mineral prospectivity mapping using semi-supervised machine learning. Math. Geosci. 57 (2), 275–305 (2025).
Chen, G., Kusky, T., Luo, L., Li, Q. & Cheng, Q. Hadean tectonics: insights from machine learning. Geology 51 (8), 718–722 (2023).
Tao, J., Yuan, F., Zhang, N. & Chang, J. Three-dimensional prospectivity modeling of Honghai volcanogenic massive sulfide Cu–Zn deposit, Eastern Tianshan, Northwestern China using weights of evidence and fuzzy logic. Math. Geosci. 53, 131–162 (2021).
Chen, M. & Xiao, F. Projection pursuit random forest for mineral prospectivity mapping. Math. Geosci. 55 (7), 963–987 (2023).
Chen, G. et al. Mineral prospectivity mapping based on wavelet neural network and Monte Carlo simulations in the Nanling W-Sn metallogenic Province. Ore Geol. Rev. 143, 104765 (2022).
Bigdeli, A., Maghsoudi, A. & Ghezelbash, R. Application of self-organizing map (SOM) and K-means clustering algorithms for portraying geochemical anomaly patterns in Moalleman district, NE Iran. J. Geochemical Explor. 233, 106923 (2022).
Zuo, R., Xiong, Y., Wang, J. & Carranza, E. J. M. Deep learning and its application in geochemical mapping. Earth Sci. Rev. 192, 1–14 (2019).
Zuo, R. G., Peng, Y., Li, T. & Xiong, Y. Challenges of geological prospecting big data mining and integration using deep learning algorithms. Earth Sci. 46 (1), 350–358 (2021).
McMillan, M., Haber, E., Peters, B. & Fohring, J. Mineral prospectivity mapping using a VNet convolutional neural network. Lead. Edge. 40 (2), 99–105 (2021).
Yang, N., Zhang, Z., Yang, J. & Hong, Z. Applications of data augmentation in mineral prospectivity prediction based on convolutional neural networks. Comput. Geosci. 161, 105075 (2022).
Yin, B., Zuo, R. & Sun, S. Mineral prospectivity mapping using deep self-attention model. Nat. Resour. Res. 32 (1), 37–56 (2023).
Luo, Z., Farahbakhsh, E., Müller, R. D. & Zuo, R. Multivariate statistical analysis and bespoke deviation network modeling for geochemical anomaly detection of rare Earth elements. Appl Geochem. 174, 106146 (2024).
Hosseini-Dinani, H. & Yazdi, M. Multi-dataset analysis to assess mineral potential of MVT-type zinc-lead deposits in Malayer-Isfahan metallogenic belt, Iran. Arab. J. Geosci. 14, 1–23 (2021).
Hajihosseinlou, M., Maghsoudi, A. & Ghezelbash, R. Intelligent mapping of geochemical anomalies: adaptation of DBSCAN and mean-shift clustering approaches. J. Geochemical Explor. 258, 107393 (2024).
Yousefi, M. & Hronsky, J. M. A. Translation of the function of hydrothermal mineralization-related focused fluid flux into a mappable exploration criterion for mineral exploration targeting. Appl. Geochem. 149, 105561 (2023).
Khalifani, F. M. et al. Generation of an efficient structural evidence layer for mineral exploration targeting. J. Afr. Earth Sci. 160, 103609 (2019).
Kreuzer, M., Fenske, N., Schnelzer, M. & Walsh, L. Lung cancer risk at low radon exposure rates in German uranium miners. Br. J. Cancer. 113 (9), 1367–1369 (2015).
Yousefi, M. & Nykänen, V. Data-driven logistic-based weighting of geochemical and geological evidence layers in mineral prospectivity mapping. J. Geochemical Explor. 164, 94–106 (2016).
Amirajlo, P., Hassani, H., Beiranvand Pour, A. & Habibkhah, N. Detection of multivariate geochemical anomalies using machine learning (ML) algorithms in Dehaq Pb-Zn mineralization, Sanandaj-Sirjan zone, Isfahan, Iran. Earth Sci. Inf. 18 (1), 124 (2025).
Acknowledgements
We would appreciate the Department of Mining Engineering at Amirkabir University of Technology (Tehran Polytechnic). This study is part of the research activity carried out during doctoral research (By the first author) at the Department of Mining Engineering at Amirkabir University of Technology (Tehran Polytechnic).
Author information
Authors and Affiliations
Contributions
Conceptualization, P.A.; Methodology, P.A.; Software, P. A.; Validation, P. A., H.H., N.H; Writing – Original Draft Preparation, P.A., H.H; Writing – Review & Editing, P.A., H.H., A.B.P.; Visualization, N.H.; Supervision, H.H.; Project administration, P.A. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Amirajlo, P., Hassani, H., Pour, A.B. et al. Ensemble machine learning strategies for mineral prospectivity mapping under data scarcity. Sci Rep (2026). https://doi.org/10.1038/s41598-026-40125-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-40125-1