Diagnosing method conditioned bias in mineral resource estimation using a mutual information and entropy uncertainty indicator

He, Xiaoqing; Huang, Yuhan; Wu, Bin; Ao, Yingchun; Jiao, Haijun

doi:10.1038/s41598-026-53627-9

Download PDF

Article
Open access
Published: 20 May 2026

Diagnosing method conditioned bias in mineral resource estimation using a mutual information and entropy uncertainty indicator

Xiaoqing He¹,
Yuhan Huang¹,
Bin Wu²,
Yingchun Ao³ &
…
Haijun Jiao⁴

Scientific Reports (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Conventional evaluation of mineral resource interpolation emphasizes predictive accuracy and reporting compliance, but these criteria do not directly characterize the stability of estimation outputs with respect to method choice. This study introduces an Mutual Information and Entropy Based Interpolation Uncertainty Indicator (MUI) as a diagnostic measure of method-conditioned instability. MUI is defined from normalized entropy and normalized mutual information over controlled realization ensembles, and evaluates the coherence and stability of interpolation outputs under a prescribed realization protocol. In the case study, comparable multi-realization ensembles were constructed for inverse distance weighting (IDW) and ordinary kriging (OK) for Cu estimation in the skarn and hornfels domains of a heterogeneous porphyry-skarn deposit. The results show that OK produced a less coherent and less stable realization ensemble than IDW, with the contrast most evident in skarn. At the same time, conventional benchmarking against infill drilling data indicates that both methods remained comparatively accurate in both skarn and hornfels. These results show that realization stability and interpolation accuracy describe different aspects of estimation behavior. MUI therefore provides a useful complement to conventional validation by capturing method-dependent differences that standard accuracy-based frameworks do not directly express. Although MUI does not explicitly quantify total methodological uncertainty, it can serve as a practical diagnostic of potential method-induced bias. Interpreted together with infill-drilling benchmarking and constrained resampling, the results further provide a safety-oriented assessment of interpolation behavior in terms of accuracy, robustness, and explainability.

Reservoir quality and its controlling diagenetic factors in the Bentiu Formation, Northeastern Muglad Basin, Sudan

Article Open access 16 September 2021

Evolution of strain field and crack prediction in cemented paste backfill specimens based on digital image correlation and computer vision recognition model

Article Open access 28 March 2025

Petrogenesis and geodynamics of the Early Miocene Kashan granitoids in the central Urumieh-Dokhtar Magmatic Arc, Iran

Article Open access 11 May 2026

Funding

This research was supported by National Natural Science Foundation of China (No.42073038).

Author information

Authors and Affiliations

China University of Geosciences Beijing, Beijing, 100083, China
Xiaoqing He & Yuhan Huang
Norinmining Co., Ltd., Beijing, 100053, China
Bin Wu
China National Gold Group Co., Ltd., Beijing, 100010, China
Yingchun Ao
Tibet Huatailong Mining Development Co., Ltd., Tibet Lhasa, 850000, China
Haijun Jiao

Authors

Xiaoqing He
View author publications
Search author on:PubMed Google Scholar
Yuhan Huang
View author publications
Search author on:PubMed Google Scholar
Bin Wu
View author publications
Search author on:PubMed Google Scholar
Yingchun Ao
View author publications
Search author on:PubMed Google Scholar
Haijun Jiao
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Yuhan Huang.

Ethics declarations

Ethics approval and consent to participate

This study did not involve human participants or animals.

Patient consent

Not applicable, as this study did not involve patients.

Permission to reproduce material from other sources

No materials from other sources were reproduced in this study.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: MUI calculation and realization generation

Appendix A.1 Multi-realization construction of target methods

Spatial sample data are defined over the estimation domain, and a set of candidate interpolation or geostatistical modeling methods is defined for evaluation. Each method is treated as a distinct estimator whose compatibility with the statistical-spatial structure of the data is to be assessed.

For each method, a realization ensemble is constructed under a fixed realization protocol. Deterministic methods generate realizations through controlled perturbations of parameters, data subsets, or initialization conditions, whereas stochastic methods produce realizations intrinsically through simulation procedures. All realizations are mapped onto a common spatial support to ensure comparability.

The choice of spatial grid (support) influences the sensitivity of entropy-based measures and is therefore treated as a fixed condition throughout the analysis.

Appendix A.2 discrete state representation and entropy

Continuous outputs are transformed into a finite set of discrete states to enable consistent estimation of information-theoretic quantities.

Let ${X}_{r}\left(u\right)$ denote the value at location $u$ for realization $r$, and let $S=\left\{1,...,K\right\}$ denote the set of discrete states obtained through a predefined discretization scheme (e.g., equal-width binning). The discretized state is defined as:

$${s_r}\left( u \right)=\emptyset \left( {{X_r}\left( u \right)} \right)~ \in ~S$$

(1)

where $\varnothing (.)$ is the discretization operator applied consistently across all realizations. For each spatial location $u$, the empirical probability distribution over state is estimated from the realization ensemble:

$${\hat {p}_u}\left( k \right)=\frac{1}{R}\sum\limits_{{r=1}}^{R} {1\left\{ {{s_r}\left( u \right)=k} \right\},~k \in ~S}$$

(2)

The local Shannon entropy at location $u$ is defined as:

$$H\left( u \right)= - \sum\limits_{{k=1}}^{K} {{{\hat {p}}_u}\left( k \right)log{{\hat {p}}_u}\left( k \right)}$$

(3)

To enable comparability across different discretization choices, entropy is normalized:

$${H^*}\left( u \right)=\frac{{H\left( u \right)}}{{logK}}$$

(4)

The domain-averaged normalized entropy is then:

$${\bar {H}^*}=\frac{1}{{\left| U \right|}}\mathop \sum \limits_{{u \in U}} {H^*}\left( u \right)$$

(5)

This quantity represents the overall level of state dispersion across realizations.

Appendix A.3 Mutual information across realizations

To quantify structural agreement, mutual information is computed between realization fields. Let ${X}_{r}$ and ${X}_{r{\prime}}$ denote two realizations over the domain. Their joint empirical distribution is defined as:

$${\hat {p}_{r,r'}}\left( {k,l} \right)=\frac{1}{{\left| U \right|}}\sum\limits_{{u \in U}} {1\left\{ {{s_r}\left( u \right)=k,~{s_{r'}}\left( u \right)=l} \right\}}$$

(6)

The marginal distributions are:

$${\hat {p}_r}\left( k \right)=\frac{1}{{\left| U \right|}}\sum\limits_{u} {1\left\{ {{s_r}\left( u \right)=k} \right\},} ~~{\hat {p}_{r'}}\left( l \right)=\frac{1}{{\left| U \right|}}\sum\limits_{u} {1\left\{ {{s_{r'}}\left( u \right)=l} \right\}}$$

(7)

The mutual information between realizations $r,r^{\prime}$ is:

$$I\left( {{X_r};{X_{r'}}} \right)=\sum\limits_{{k=1}}^{K} {\sum\limits_{{l=1}}^{K} {{{\hat {p}}_{r,r'}}\left( {k,l} \right)log\frac{{{{\hat {p}}_{r,r'}}\left( {k,l} \right)}}{{{{\hat {p}}_r}\left( k \right){{\hat {p}}_{r'}}\left( l \right)}}} }$$

(8)

To ensure comparability, mutual information is normalized:

$$M\left( {r,r'} \right)=\frac{{I\left( {{X_r};{X_{r'}}} \right)}}{{max\left\{ {H\left( {{X_r}} \right),H\left( {{X_{r'}}} \right)} \right\}}}$$

(9)

where

$$H\left( {{X_r}} \right)= - \sum\limits_{k} {{{\hat {p}}_r}\left( k \right)log{{\hat {p}}_r}\left( k \right)}$$

(10)

This normalization ensures $0\le M(r,r^{\prime})\le 1$ and provides a consistent scale for comparing structural agreement across realizations. While alternative normalization schemes exist, this formulation is adopted for stability and interpretability in discretized empirical settings.

The average normalized mutual information across all realization pairs is:

$$\bar {M}=\frac{2}{{R\left( {R - 1} \right)}}\sum\limits_{{r \leqslant r'}} {M\left( {r,r'} \right)}$$

(11)

Appendix A.4 Definition of the mutual information-entropy-based interpolation uncertainty indicator (MUI)

Based on the two complementary components defined above, The Mutual Information-Entropy-Based Interpolation Uncertainty Indicator (MUI) is defined as:

$$MUI={\bar {H}^*}\left( {1 - \bar {M}} \right)$$

(12)

Based on the definition, $0\le MUI\le 1$.

A.5 Limiting Cases

If realizations are identical: ${\overline{H}}^{*}=0, \bar{M}=1 \Rightarrow MUI=0$;

If realizations are highly dispersed but structurally consistent:

$${\bar {H}^{*~}}\;high,\bar {M}\;high \Rightarrow MUI\;moderate,$$

If realizations are both dispersed and structurally inconsistent:

$${\bar {H}^*}\;high,~\bar {M}\;~low~ \Rightarrow ~MUI\;high;$$

MUI is conditional on discretization, spatial support, and realization design.

A higher MUI value indicates weaker compatibility between method assumptions and data characteristics under the prescribed realization protocol. It should therefore be interpreted as a diagnostic signal of potential method-induced bias, rather than as a direct measurement of methodological bias. Conversely, a lower MUI reflects stronger convergence among realizations and higher method-data compatibility.

Appendix B Method-Conditioned MUI Behavior (Heuristic Approximate)

This appendix provides method-conditioned, first-order approximations of MUI behavior under simplified assumptions. These expressions are not closed-form derivations but serve as interpretive tools grounded in the formal definition:

$$MUI={\bar {H}^*}\left( {1 - \bar {M}} \right)$$

All method-specific approximations in this appendix are derived by examining how a given method class tends to affect the normalized state-dispersion and the normalized structural-consistency. The use of MUI under this assumption helps identify when a method is well-suited to the data spatial structure.

Appendix B.1 Geometry-based methods

Geometry-based methods (e.g., nearest-neighbor, triangulation and polygon partitioning) are assumed to have uncertainty mainly arising from geometric interpretation.

However, the methods do not generate realizations intrinsically. Realization ensembles are constructed through perturbations of sample configuration or partition boundaries.

Under stable partitioning, state assignments remain invariant (Simplified conditions of spatially uniform sampling and stable partition geometry):

$${\hat {p}_u}\left( k \right) \to 1 \Rightarrow 1~~{\bar {H}^*} \to 0,~\bar {M} \Rightarrow 1 \Rightarrow ~MU{I_{geo}} \to ~0$$

Under irregular sampling or unstable boundaries (clustered sampling pattern, irregular boundaries, or local support is poorly constrained):

$${\bar {H}^*} \uparrow ,~\bar {M} \downarrow \Rightarrow MU{I_{geo}} \uparrow$$

Also, under these conditions, a first-order heuristic approximation can be interpreted as:

$$MU{I_{geo}} \propto {\Delta _{perturbation}} \cdot {\Delta _{clustering~or~geometry}}$$

MUI scales with geometric sensitivity to data configuration and partition stability.

Appendix B.2 Variogram-based and kriging methods

Variogram-based methods (e.g., simple kriging, ordinary kriging, indicator kriging) impose an explicit covariance structure on the data.

Realizations are constructed through perturbations of variogram parameters, search neighborhoods, or data subsets.

Let ${\delta}_{\gamma}$ denote the mismatch between the covariance model and the spatial organization of the data.

Under well-specified conditions small ${\delta}_{\gamma}$:

$$\bar {M} \to 1,~{\bar {H}^*} \to \bar {H}_{0}^{*}$$

${\overline{H}}_{0}^{*}$ is the baseline normalized entropy under well-aligned conditions.

For small mismatch, first-order approximations can be written as:

$$1 - \bar {M} \approx {c_1}{\delta _\gamma }$$

$${\bar {H}^*} \approx \bar {H}_{0}^{*}+{c_2}{\delta _\gamma }$$

where ${c}_{1}$, ${c}_{2}>0$ are method and dataset-dependent proportionality constants:

$$MU{I_{kig}} \approx \left( {\bar {H}_{0}^{*}+{c_2}{\delta _\gamma }} \right){c_1}{\delta _\gamma }$$

In the near-ideal regime ${{\updelta}}_{{\upgamma}}\ll 1$.

$$MU{I_{kig}} \approx \bar {H}_{0}^{*}{c_1}{\delta _\gamma }.$$

This indicates that MUI increases linearly with covariance or anisotropy mismatch in the near-ideal regime.

Stronger mismatch from stationarity, anisotropy, or variogram stability will produce nonlinear propagation in both entropy and structural inconsistency.

Appendix B.3 Simulation-based methods

Simulation-based methods (e.g., sequential Gaussian simulation and conditional smoothing) generate realization ensembles intrinsically to represent conditional spatial variability. Since the realizations are generated from a Gaussian process, the variability in method outputs is generally constrained within a certain range.

Let ${\rho}_{cond}\in \left[\text{0,1}\right]$ denote the effective conditioning imposed by data and model constraints, the realizations are expected to exhibit high entropy due to the inherent stochastic nature of the method, but the structural agreement (mutual information) remains stable unless conditioning becomes weak.

$${\bar {H}^*} \to high$$

Under well-conditioned simulation:

$$\bar {M} \approx {\rho _{cond}},{\rho _{cond}} \to 1 \Rightarrow MU{I_{sim}} \approx {\bar {H}^*}{c_3}\left( {1 - {\rho _{cond}}} \right)$$

Under weak conditioning (sparse data or unstable covariance):

$${\rho _{cond}} \downarrow \Rightarrow MU{I_{sim}} \uparrow$$

Thus, for simulation-based methods, MUI is controlled by the extent to which dispersion is accompanied by loss of structural coherence, rather than dispersion alone.

Appendix B.4 Bayesian methods

Bayesian approaches generate realizations through posterior sampling governed by a prior–likelihood structure.

Let ${{\updelta}}_{{\uppi}}$ denote the effective mismatch between prior assumptions and data-supported structure.

Under prior–data consistency:

$$\bar {M} \to 1,~{\bar {H}^*} \to \bar {H}_{0}^{*}$$

For small mismatch:

$$1 - \bar {M}={c_4}{\delta _\pi },~~{\bar {H}^*} \approx \bar {H}_{0}^{*}+{c_5}{\delta _\pi } \Rightarrow MU{I_{bayes}} \approx \left( {\bar {H}_{0}^{*}+{c_5}{\delta _\pi }} \right){c_4}{\delta _\pi }$$

For weak mismatch, ${\delta}_{\pi}\ll 1$, this reduces to.

$$MU{I_{bayes}} \approx \bar {H}_{0}^{*}{c_4}{\delta _\pi }$$

This shows that MUI increases when posterior realizations lose structural coherence due to prior–data inconsistency or insufficient data support.

Appendix B.5 Data-Driven or Machine-Learning Methods

In data-driven methods, spatial structure is learned implicitly from data rather than imposed through explicit covariance models.

Let ${\delta}_{fit}$ denote instability arising from training sensitivity, feature dependence, or overfitting.

Under stable training:

$$\bar {M} \to 1,~{\bar {H}^*} \to \bar {H}_{0}^{*}$$

Under instability:

$$1 - \bar {M} \approx {c_6}{\delta _{fit}},~~{\bar {H}^*} \approx \bar {H}_{0}^{*}+{c_7}{\delta _{fit}} \Rightarrow MU{I_{ML}} \approx \left( {\bar {H}_{0}^{*}+{c_7}{\delta _{fit}}} \right){c_6}{\delta _{fit}}$$

Thus, MUI scales with training instability and sensitivity to perturbations in data representation.

Appendix B.6 Scope of the approximations

These method-conditioned heuristic approximations derived from the unified MUI formulation provide a structured interpretation of how estimation mechanisms influence entropy and mutual information. Conversely, the use of MUI under methods assumptions helps identify whether a method is well-suited to the data spatial structure.

Their role is interpretive and diagnostic: to guide understanding of MUI behavior across method classes and to support practical evaluation in technical and commercial settings.

Appendix C: Preprocessing and MUI interaction

Preprocessing alters the statistical and spatial structure of input data and therefore influences MUI through both entropy and mutual information components. Changes in MUI under preprocessing should be interpreted as the combined effect of improved method–data alignment and potential modification of underlying variability.

Appendix C.1 Clustering and denoising of geological data

Let $X=\left\{{x}_{1}, {x}_{2},...{x}_{n}\right\}$ be the set of samples with grade $g\left(x\right)$.

Clustering reduces intra-group variance and compresses the range of discretized states.

For a realization ensemble$\left\{{s}_{r}\right(u\left)\right\}$, the empirical distribution ${\widehat{p}}_{u}\left(k\right)$ is:

$${\hat {p}_u}\left( k \right)=\frac{1}{R}\sum\limits_{{r=1}}^{R} {1\left\{ {{s_r}\left( u \right)=k} \right\}}$$

$${\hat {p}_u}\left( k \right) \to peaked \Rightarrow H\left( u \right) \downarrow$$

This leads to a reduction in entropy and consequently in MUI.

However, excessive clustering or outlier removal may suppress genuine geological heterogeneity, artificially stabilizing the realization ensemble. In such cases, reduced MUI reflects information loss rather than improved structural consistency.

Appendix C.2 Spatial discretization and mesh segmentation

Spatial discretization defines the support over which probability distributions are constructed. Let $r$ denote grid resolution. Changes in $r$ affect empirical state distributions:

Finer grids → reducing the variance within each cell → lowering entropy.

Coarser grids → increased aggregation → higher entropy.

Thus:

$$MU{I_{new}}=\frac{{\sum\nolimits_{{u \in U}} {{H_u}} }}{{logK}}\left( {1 - \bar {M}} \right)$$

This dependence reflects scale conditioning rather than methodological deficiency.

Excessively fine discretization under limited sampling density may produce artificially low entropy by isolating sparse observations, while overly coarse discretization may inflate entropy by aggregating heterogeneous regions. Both cases distort the interpretation of method-output stability.

Appendix C.3 Covariance and model calibration

Adjusting covariance structures in kriging-based methods can alter the model’s response to spatial data variations. Overfitting covariance parameters to minimize MUI can result in smoother estimates with reduced variability, which misrepresents geological heterogeneity.

The mutual information $I({X}_{r};{X}_{r{\prime}})$ between realizations r and r’ can be impacted by the optimization of covariance parameters:

$$I\left( {{X_r};{X_{r'}}} \right)=H\left( {{X_r}} \right)+H\left( {{X_{r'}}} \right) - H\left( {{X_r},{X_{r'}}} \right)$$

Overfitting covariance parameters may reduce marginal entropy but also reduce joint consistency, potentially lowering $\bar{M}$ in a non-intuitive manner.

Thus, minimizing MUI through parameter tuning does not necessarily imply improved model validity; it may reflect over-smoothing or reduced structural variability.

Appendix C.4 Interpretation

Changes in MUI induced by preprocessing reflect two possibilities, improved compatibility between method assumptions and data structure, or suppression or distortion of inherent variability. Therefore, reductions in MUI should not be interpreted as direct evidence of reduced uncertainty. Instead, MUI should be evaluated jointly with preprocessing choices to distinguish genuine stabilization from artificially induced consistency.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

He, X., Huang, Y., Wu, B. et al. Diagnosing method conditioned bias in mineral resource estimation using a mutual information and entropy uncertainty indicator. Sci Rep (2026). https://doi.org/10.1038/s41598-026-53627-9

Download citation

Received: 06 January 2026
Accepted: 13 May 2026
Published: 20 May 2026
DOI: https://doi.org/10.1038/s41598-026-53627-9

Subjects

Abstract

Similar content being viewed by others

Reservoir quality and its controlling diagenetic factors in the Bentiu Formation, Northeastern Muglad Basin, Sudan

Evolution of strain field and crack prediction in cemented paste backfill specimens based on digital image correlation and computer vision recognition model

Petrogenesis and geodynamics of the Early Miocene Kashan granitoids in the central Urumieh-Dokhtar Magmatic Arc, Iran

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Patient consent

Permission to reproduce material from other sources

Competing interests

Additional information

Publisher’s note

Appendices

Appendices

Appendix A: MUI calculation and realization generation

Appendix A.1 Multi-realization construction of target methods

Appendix A.2 discrete state representation and entropy

Appendix A.3 Mutual information across realizations

Appendix A.4 Definition of the mutual information-entropy-based interpolation uncertainty indicator (MUI)

A.5 Limiting Cases

Appendix B Method-Conditioned MUI Behavior (Heuristic Approximate)

Appendix B.1 Geometry-based methods

Appendix B.2 Variogram-based and kriging methods

Appendix B.3 Simulation-based methods

Appendix B.4 Bayesian methods

Appendix B.5 Data-Driven or Machine-Learning Methods

Appendix B.6 Scope of the approximations

Appendix C: Preprocessing and MUI interaction

Appendix C.1 Clustering and denoising of geological data

Appendix C.2 Spatial discretization and mesh segmentation

Appendix C.3 Covariance and model calibration

Appendix C.4 Interpretation

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links