Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Reports
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific reports
  3. articles
  4. article
Diagnosing method conditioned bias in mineral resource estimation using a mutual information and entropy uncertainty indicator
Download PDF
Download PDF
  • Article
  • Open access
  • Published: 20 May 2026

Diagnosing method conditioned bias in mineral resource estimation using a mutual information and entropy uncertainty indicator

  • Xiaoqing He1,
  • Yuhan Huang1,
  • Bin Wu2,
  • Yingchun Ao3 &
  • …
  • Haijun Jiao4 

Scientific Reports (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Mathematics and computing
  • Solid Earth sciences

Abstract

Conventional evaluation of mineral resource interpolation emphasizes predictive accuracy and reporting compliance, but these criteria do not directly characterize the stability of estimation outputs with respect to method choice. This study introduces an Mutual Information and Entropy Based Interpolation Uncertainty Indicator (MUI) as a diagnostic measure of method-conditioned instability. MUI is defined from normalized entropy and normalized mutual information over controlled realization ensembles, and evaluates the coherence and stability of interpolation outputs under a prescribed realization protocol. In the case study, comparable multi-realization ensembles were constructed for inverse distance weighting (IDW) and ordinary kriging (OK) for Cu estimation in the skarn and hornfels domains of a heterogeneous porphyry-skarn deposit. The results show that OK produced a less coherent and less stable realization ensemble than IDW, with the contrast most evident in skarn. At the same time, conventional benchmarking against infill drilling data indicates that both methods remained comparatively accurate in both skarn and hornfels. These results show that realization stability and interpolation accuracy describe different aspects of estimation behavior. MUI therefore provides a useful complement to conventional validation by capturing method-dependent differences that standard accuracy-based frameworks do not directly express. Although MUI does not explicitly quantify total methodological uncertainty, it can serve as a practical diagnostic of potential method-induced bias. Interpreted together with infill-drilling benchmarking and constrained resampling, the results further provide a safety-oriented assessment of interpolation behavior in terms of accuracy, robustness, and explainability.

Similar content being viewed by others

Reservoir quality and its controlling diagenetic factors in the Bentiu Formation, Northeastern Muglad Basin, Sudan

Article Open access 16 September 2021

Evolution of strain field and crack prediction in cemented paste backfill specimens based on digital image correlation and computer vision recognition model

Article Open access 28 March 2025

Petrogenesis and geodynamics of the Early Miocene Kashan granitoids in the central Urumieh-Dokhtar Magmatic Arc, Iran

Article Open access 11 May 2026

Funding

This research was supported by National Natural Science Foundation of China (No.42073038).

Author information

Authors and Affiliations

  1. China University of Geosciences Beijing, Beijing, 100083, China

    Xiaoqing He & Yuhan Huang

  2. Norinmining Co., Ltd., Beijing, 100053, China

    Bin Wu

  3. China National Gold Group Co., Ltd., Beijing, 100010, China

    Yingchun Ao

  4. Tibet Huatailong Mining Development Co., Ltd., Tibet Lhasa, 850000, China

    Haijun Jiao

Authors
  1. Xiaoqing He
    View author publications

    Search author on:PubMed Google Scholar

  2. Yuhan Huang
    View author publications

    Search author on:PubMed Google Scholar

  3. Bin Wu
    View author publications

    Search author on:PubMed Google Scholar

  4. Yingchun Ao
    View author publications

    Search author on:PubMed Google Scholar

  5. Haijun Jiao
    View author publications

    Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Yuhan Huang.

Ethics declarations

Ethics approval and consent to participate

This study did not involve human participants or animals.

Patient consent

Not applicable, as this study did not involve patients.

Permission to reproduce material from other sources

No materials from other sources were reproduced in this study.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendices

Appendix A: MUI calculation and realization generation

Appendix A.1 Multi-realization construction of target methods

Spatial sample data are defined over the estimation domain, and a set of candidate interpolation or geostatistical modeling methods is defined for evaluation. Each method is treated as a distinct estimator whose compatibility with the statistical-spatial structure of the data is to be assessed.

For each method, a realization ensemble is constructed under a fixed realization protocol. Deterministic methods generate realizations through controlled perturbations of parameters, data subsets, or initialization conditions, whereas stochastic methods produce realizations intrinsically through simulation procedures. All realizations are mapped onto a common spatial support to ensure comparability.

The choice of spatial grid (support) influences the sensitivity of entropy-based measures and is therefore treated as a fixed condition throughout the analysis.

Appendix A.2 discrete state representation and entropy

Continuous outputs are transformed into a finite set of discrete states to enable consistent estimation of information-theoretic quantities.

Let \({X}_{r}\left(u\right)\) denote the value at location \(u\) for realization \(r\), and let \(S=\left\{1,...,K\right\}\) denote the set of discrete states obtained through a predefined discretization scheme (e.g., equal-width binning). The discretized state is defined as:

$${s_r}\left( u \right)=\emptyset \left( {{X_r}\left( u \right)} \right)~ \in ~S$$
(1)

where \(\varnothing (.)\) is the discretization operator applied consistently across all realizations. For each spatial location \(u\), the empirical probability distribution over state is estimated from the realization ensemble:

$${\hat {p}_u}\left( k \right)=\frac{1}{R}\sum\limits_{{r=1}}^{R} {1\left\{ {{s_r}\left( u \right)=k} \right\},~k \in ~S}$$
(2)

The local Shannon entropy at location \(u\) is defined as:

$$H\left( u \right)= - \sum\limits_{{k=1}}^{K} {{{\hat {p}}_u}\left( k \right)log{{\hat {p}}_u}\left( k \right)}$$
(3)

To enable comparability across different discretization choices, entropy is normalized:

$${H^*}\left( u \right)=\frac{{H\left( u \right)}}{{logK}}$$
(4)

The domain-averaged normalized entropy is then:

$${\bar {H}^*}=\frac{1}{{\left| U \right|}}\mathop \sum \limits_{{u \in U}} {H^*}\left( u \right)$$
(5)

This quantity represents the overall level of state dispersion across realizations.

Appendix A.3 Mutual information across realizations

To quantify structural agreement, mutual information is computed between realization fields. Let \({X}_{r}\) and \({X}_{r{\prime}}\) denote two realizations over the domain. Their joint empirical distribution is defined as:

$${\hat {p}_{r,r'}}\left( {k,l} \right)=\frac{1}{{\left| U \right|}}\sum\limits_{{u \in U}} {1\left\{ {{s_r}\left( u \right)=k,~{s_{r'}}\left( u \right)=l} \right\}}$$
(6)

The marginal distributions are:

$${\hat {p}_r}\left( k \right)=\frac{1}{{\left| U \right|}}\sum\limits_{u} {1\left\{ {{s_r}\left( u \right)=k} \right\},} ~~{\hat {p}_{r'}}\left( l \right)=\frac{1}{{\left| U \right|}}\sum\limits_{u} {1\left\{ {{s_{r'}}\left( u \right)=l} \right\}}$$
(7)

The mutual information between realizations \(r,r^{\prime}\) is:

$$I\left( {{X_r};{X_{r'}}} \right)=\sum\limits_{{k=1}}^{K} {\sum\limits_{{l=1}}^{K} {{{\hat {p}}_{r,r'}}\left( {k,l} \right)log\frac{{{{\hat {p}}_{r,r'}}\left( {k,l} \right)}}{{{{\hat {p}}_r}\left( k \right){{\hat {p}}_{r'}}\left( l \right)}}} }$$
(8)

To ensure comparability, mutual information is normalized:

$$M\left( {r,r'} \right)=\frac{{I\left( {{X_r};{X_{r'}}} \right)}}{{max\left\{ {H\left( {{X_r}} \right),H\left( {{X_{r'}}} \right)} \right\}}}$$
(9)

where

$$H\left( {{X_r}} \right)= - \sum\limits_{k} {{{\hat {p}}_r}\left( k \right)log{{\hat {p}}_r}\left( k \right)}$$
(10)

This normalization ensures \(0\le M(r,r^{\prime})\le 1\) and provides a consistent scale for comparing structural agreement across realizations. While alternative normalization schemes exist, this formulation is adopted for stability and interpretability in discretized empirical settings.

The average normalized mutual information across all realization pairs is:

$$\bar {M}=\frac{2}{{R\left( {R - 1} \right)}}\sum\limits_{{r \leqslant r'}} {M\left( {r,r'} \right)}$$
(11)

Appendix A.4 Definition of the mutual information-entropy-based interpolation uncertainty indicator (MUI)

Based on the two complementary components defined above, The Mutual Information-Entropy-Based Interpolation Uncertainty Indicator (MUI) is defined as:

$$MUI={\bar {H}^*}\left( {1 - \bar {M}} \right)$$
(12)

Based on the definition, \(0\le MUI\le 1\).

A.5 Limiting Cases

If realizations are identical: \({\overline{H}}^{*}=0, \bar{M}=1 \Rightarrow MUI=0\);

If realizations are highly dispersed but structurally consistent:

$${\bar {H}^{*~}}\;high,\bar {M}\;high \Rightarrow MUI\;moderate,$$

If realizations are both dispersed and structurally inconsistent:

$${\bar {H}^*}\;high,~\bar {M}\;~low~ \Rightarrow ~MUI\;high;$$

MUI is conditional on discretization, spatial support, and realization design.

A higher MUI value indicates weaker compatibility between method assumptions and data characteristics under the prescribed realization protocol. It should therefore be interpreted as a diagnostic signal of potential method-induced bias, rather than as a direct measurement of methodological bias. Conversely, a lower MUI reflects stronger convergence among realizations and higher method-data compatibility.

Appendix B Method-Conditioned MUI Behavior (Heuristic Approximate)

This appendix provides method-conditioned, first-order approximations of MUI behavior under simplified assumptions. These expressions are not closed-form derivations but serve as interpretive tools grounded in the formal definition:

$$MUI={\bar {H}^*}\left( {1 - \bar {M}} \right)$$

All method-specific approximations in this appendix are derived by examining how a given method class tends to affect the normalized state-dispersion and the normalized structural-consistency. The use of MUI under this assumption helps identify when a method is well-suited to the data spatial structure.

Appendix B.1 Geometry-based methods

Geometry-based methods (e.g., nearest-neighbor, triangulation and polygon partitioning) are assumed to have uncertainty mainly arising from geometric interpretation.

However, the methods do not generate realizations intrinsically. Realization ensembles are constructed through perturbations of sample configuration or partition boundaries.

Under stable partitioning, state assignments remain invariant (Simplified conditions of spatially uniform sampling and stable partition geometry):

$${\hat {p}_u}\left( k \right) \to 1 \Rightarrow 1~~{\bar {H}^*} \to 0,~\bar {M} \Rightarrow 1 \Rightarrow ~MU{I_{geo}} \to ~0$$

Under irregular sampling or unstable boundaries (clustered sampling pattern, irregular boundaries, or local support is poorly constrained):

$${\bar {H}^*} \uparrow ,~\bar {M} \downarrow \Rightarrow MU{I_{geo}} \uparrow$$

Also, under these conditions, a first-order heuristic approximation can be interpreted as:

$$MU{I_{geo}} \propto {\Delta _{perturbation}} \cdot {\Delta _{clustering~or~geometry}}$$

MUI scales with geometric sensitivity to data configuration and partition stability.

Appendix B.2 Variogram-based and kriging methods

Variogram-based methods (e.g., simple kriging, ordinary kriging, indicator kriging) impose an explicit covariance structure on the data.

Realizations are constructed through perturbations of variogram parameters, search neighborhoods, or data subsets.

Let \({\delta}_{\gamma}\) denote the mismatch between the covariance model and the spatial organization of the data.

Under well-specified conditions small \({\delta}_{\gamma}\):

$$\bar {M} \to 1,~{\bar {H}^*} \to \bar {H}_{0}^{*}$$

\({\overline{H}}_{0}^{*}\) is the baseline normalized entropy under well-aligned conditions.

For small mismatch, first-order approximations can be written as:

$$1 - \bar {M} \approx {c_1}{\delta _\gamma }$$
$${\bar {H}^*} \approx \bar {H}_{0}^{*}+{c_2}{\delta _\gamma }$$

where \({c}_{1}\), \({c}_{2}>0\) are method and dataset-dependent proportionality constants:

$$MU{I_{kig}} \approx \left( {\bar {H}_{0}^{*}+{c_2}{\delta _\gamma }} \right){c_1}{\delta _\gamma }$$

In the near-ideal regime \({{\updelta}}_{{\upgamma}}\ll 1\).

$$MU{I_{kig}} \approx \bar {H}_{0}^{*}{c_1}{\delta _\gamma }.$$

This indicates that MUI increases linearly with covariance or anisotropy mismatch in the near-ideal regime.

Stronger mismatch from stationarity, anisotropy, or variogram stability will produce nonlinear propagation in both entropy and structural inconsistency.

Appendix B.3 Simulation-based methods

Simulation-based methods (e.g., sequential Gaussian simulation and conditional smoothing) generate realization ensembles intrinsically to represent conditional spatial variability. Since the realizations are generated from a Gaussian process, the variability in method outputs is generally constrained within a certain range.

Let \({\rho}_{cond}\in \left[\text{0,1}\right]\) denote the effective conditioning imposed by data and model constraints, the realizations are expected to exhibit high entropy due to the inherent stochastic nature of the method, but the structural agreement (mutual information) remains stable unless conditioning becomes weak.

$${\bar {H}^*} \to high$$

Under well-conditioned simulation:

$$\bar {M} \approx {\rho _{cond}},{\rho _{cond}} \to 1 \Rightarrow MU{I_{sim}} \approx {\bar {H}^*}{c_3}\left( {1 - {\rho _{cond}}} \right)$$

Under weak conditioning (sparse data or unstable covariance):

$${\rho _{cond}} \downarrow \Rightarrow MU{I_{sim}} \uparrow$$

Thus, for simulation-based methods, MUI is controlled by the extent to which dispersion is accompanied by loss of structural coherence, rather than dispersion alone.

Appendix B.4 Bayesian methods

Bayesian approaches generate realizations through posterior sampling governed by a prior–likelihood structure.

Let \({{\updelta}}_{{\uppi}}\) denote the effective mismatch between prior assumptions and data-supported structure.

Under prior–data consistency:

$$\bar {M} \to 1,~{\bar {H}^*} \to \bar {H}_{0}^{*}$$

For small mismatch:

$$1 - \bar {M}={c_4}{\delta _\pi },~~{\bar {H}^*} \approx \bar {H}_{0}^{*}+{c_5}{\delta _\pi } \Rightarrow MU{I_{bayes}} \approx \left( {\bar {H}_{0}^{*}+{c_5}{\delta _\pi }} \right){c_4}{\delta _\pi }$$

For weak mismatch, \({\delta}_{\pi}\ll 1\), this reduces to.

$$MU{I_{bayes}} \approx \bar {H}_{0}^{*}{c_4}{\delta _\pi }$$

This shows that MUI increases when posterior realizations lose structural coherence due to prior–data inconsistency or insufficient data support.

Appendix B.5 Data-Driven or Machine-Learning Methods

In data-driven methods, spatial structure is learned implicitly from data rather than imposed through explicit covariance models.

Let \({\delta}_{fit}\) denote instability arising from training sensitivity, feature dependence, or overfitting.

Under stable training:

$$\bar {M} \to 1,~{\bar {H}^*} \to \bar {H}_{0}^{*}$$

Under instability:

$$1 - \bar {M} \approx {c_6}{\delta _{fit}},~~{\bar {H}^*} \approx \bar {H}_{0}^{*}+{c_7}{\delta _{fit}} \Rightarrow MU{I_{ML}} \approx \left( {\bar {H}_{0}^{*}+{c_7}{\delta _{fit}}} \right){c_6}{\delta _{fit}}$$

Thus, MUI scales with training instability and sensitivity to perturbations in data representation.

Appendix B.6 Scope of the approximations

These method-conditioned heuristic approximations derived from the unified MUI formulation provide a structured interpretation of how estimation mechanisms influence entropy and mutual information. Conversely, the use of MUI under methods assumptions helps identify whether a method is well-suited to the data spatial structure.

Their role is interpretive and diagnostic: to guide understanding of MUI behavior across method classes and to support practical evaluation in technical and commercial settings.

Appendix C: Preprocessing and MUI interaction

Preprocessing alters the statistical and spatial structure of input data and therefore influences MUI through both entropy and mutual information components. Changes in MUI under preprocessing should be interpreted as the combined effect of improved method–data alignment and potential modification of underlying variability.

Appendix C.1 Clustering and denoising of geological data

Let \(X=\left\{{x}_{1}, {x}_{2},...{x}_{n}\right\}\) be the set of samples with grade \(g\left(x\right)\).

Clustering reduces intra-group variance and compresses the range of discretized states.

For a realization ensemble\(\left\{{s}_{r}\right(u\left)\right\}\), the empirical distribution \({\widehat{p}}_{u}\left(k\right)\) is:

$${\hat {p}_u}\left( k \right)=\frac{1}{R}\sum\limits_{{r=1}}^{R} {1\left\{ {{s_r}\left( u \right)=k} \right\}}$$
$${\hat {p}_u}\left( k \right) \to peaked \Rightarrow H\left( u \right) \downarrow$$

This leads to a reduction in entropy and consequently in MUI.

However, excessive clustering or outlier removal may suppress genuine geological heterogeneity, artificially stabilizing the realization ensemble. In such cases, reduced MUI reflects information loss rather than improved structural consistency.

Appendix C.2 Spatial discretization and mesh segmentation

Spatial discretization defines the support over which probability distributions are constructed. Let \(r\) denote grid resolution. Changes in \(r\) affect empirical state distributions:

Finer grids → reducing the variance within each cell → lowering entropy.

Coarser grids → increased aggregation → higher entropy.

Thus:

$$MU{I_{new}}=\frac{{\sum\nolimits_{{u \in U}} {{H_u}} }}{{logK}}\left( {1 - \bar {M}} \right)$$

This dependence reflects scale conditioning rather than methodological deficiency.

Excessively fine discretization under limited sampling density may produce artificially low entropy by isolating sparse observations, while overly coarse discretization may inflate entropy by aggregating heterogeneous regions. Both cases distort the interpretation of method-output stability.

Appendix C.3 Covariance and model calibration

Adjusting covariance structures in kriging-based methods can alter the model’s response to spatial data variations. Overfitting covariance parameters to minimize MUI can result in smoother estimates with reduced variability, which misrepresents geological heterogeneity.

The mutual information \(I({X}_{r};{X}_{r{\prime}})\) between realizations r and r’ can be impacted by the optimization of covariance parameters:

$$I\left( {{X_r};{X_{r'}}} \right)=H\left( {{X_r}} \right)+H\left( {{X_{r'}}} \right) - H\left( {{X_r},{X_{r'}}} \right)$$

Overfitting covariance parameters may reduce marginal entropy but also reduce joint consistency, potentially lowering \(\bar{M}\) in a non-intuitive manner.

Thus, minimizing MUI through parameter tuning does not necessarily imply improved model validity; it may reflect over-smoothing or reduced structural variability.

Appendix C.4 Interpretation

Changes in MUI induced by preprocessing reflect two possibilities, improved compatibility between method assumptions and data structure, or suppression or distortion of inherent variability. Therefore, reductions in MUI should not be interpreted as direct evidence of reduced uncertainty. Instead, MUI should be evaluated jointly with preprocessing choices to distinguish genuine stabilization from artificially induced consistency.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

He, X., Huang, Y., Wu, B. et al. Diagnosing method conditioned bias in mineral resource estimation using a mutual information and entropy uncertainty indicator. Sci Rep (2026). https://doi.org/10.1038/s41598-026-53627-9

Download citation

  • Received: 06 January 2026

  • Accepted: 13 May 2026

  • Published: 20 May 2026

  • DOI: https://doi.org/10.1038/s41598-026-53627-9

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • Mineral reserve estimation
  • Methodological bias
  • Mutual information
  • Interpolation uncertainty
  • Method stability
  • Technical review
  • Due diligence
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Subjects
  • Follow us on Facebook
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • About Scientific Reports
  • Contact
  • Journal policies
  • Guide to referees
  • Calls for Papers
  • Editor's Choice
  • Journal highlights
  • Open Access Fees and Funding

Publish with us

  • For authors
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Reports (Sci Rep)

ISSN 2045-2322 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics