Introduction

There is a vast amount of literature about the Born or Rytov approximation/series for solving the Helmholtz equation that arises in many fields of research. Applications include studies of weakly scattering media1,2,3, seismic waveforms in geophysics4,5, and optics in highly scattering media6,7. In this work, we focus on the latter, particularly on diffuse optics of biological tissues, where these two approximations have been applied for linearizing the integral equations obtained for solving the diffusion equation (DE) (the DE can always be reduced to a Helmholtz equation by a change of variables6). These integral equations link the source of perturbation (e.g., a change in the optical properties) to the change in fluence rate inside the medium or the change in exiting flux (or reflectance) measured by an optical detector on the surface of the medium. A linearization of these equations allows for a straightforward inversion of the integral equations for recovering the change in the optical properties. This is a fundamental step also for nonlinear methods of spatial reconstruction. Two major applications in diffuse optics of biological tissues are near infrared spectroscopy (NIRS) [or diffuse optical tomography (DOT)]6,8,9 and fluorescence imaging10,11. The advantages of the Rytov over the Born approximation have been discussed in several works6,8,9. They can be associated with the fact that while the Born approximation is linear in the change of the optical properties, the Rytov approximation contains higher powers of these changes, and for this reason it can describe some nonlinear effects.

More recently, with the advent of graphic processing units (GPU), Monte Carlo (MC) simulations have become increasingly important for simulating light propagation in tissues12,13,14. MC simulations allow for solving the radiative transfer equation (RTE)15 for any geometry and spatial distribution of the optical properties without approximations. Due to the increased speed in computation of GPU, MC simulations can be efficiently used as a forward problem solver embedded in inversion procedures16. Also, MC simulations allow for the fast computation of sensitivity functions (i.e., Jacobians) under general conditions of validity of the RTE16. In this regard, a coupled forward-adjoint method has been proposed for improving the variance of the sensitivity functions obtained with conventional methods17,18. For the case of sensitivity to absorption changes, the conventional methods consist of mean pathlength calculations based on the intersection of detected photon trajectories with a region of interest (for which the Jacobian is calculated). In fact, for media verifying the microscopic Beer–Lambert law, the Jacobian of both the fluence rate and the reflectance with respect to the absorption coefficient of a region of interest have the physical meaning of mean pathlength of “detected photons” in the region of interest (see for example Eq. (11) in section "A random multivariate approach to perturbation theory of photon transport"). The coupled forward-adjoint method is rigorous and can be used in total generality within the validity of the RTE. However, due to the prohibitive memory requirement for typical problems in NIRS and DOT, the method has been used only for applications where one considers relatively short source-detector separations (less than about 5 mm) where small tissue volumes are probed (e.g., skin)17. For NIRS-DOT applications, whenever diffusion conditions are satisfied, a coupled forward-adjoint method based on the Rytov approximation has been proposed for the computation of Jacobians with Monte Carlo simulations19,20. If the measurement of interest is the fluence rate at a field point (\(\mathbf{r}\)) due to a source point at \({\mathbf{r}}_{0}\), under the Rytov approximation, the continuous wave (CW) Jacobian for localized absorption perturbations with volume \(dV\) at location \({\mathbf{r}}^{\mathbf{{\prime}}}\), is obtained from the computation of three Green’s functions: (1) the Green’s function of the fluence rate at \({\mathbf{r}}^{\mathbf{{\prime}}}\) due to a point source at \({\mathbf{r}}_{0}\), \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\mathbf{{\prime}}}\right)\); (2) the Green’s function of the fluence rate at the field point \(\mathbf{r}\) due to a point source at \({\mathbf{r}}^{\mathbf{{\prime}}}\), \({G}_{{\phi }_{0}}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to \mathbf{r}\right)\); (3) the Green’s function of the fluence rate at the field point \(\mathbf{r}\) due to a point source at \({\mathbf{r}}_{0}\), \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\). For fluence rate calculations, the use of the reciprocity relationship (\({G}_{{\phi }_{0}}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to \mathbf{r}\right)={G}_{{\phi }_{0}}\left(\mathbf{r}\to \mathbf{r}\mathbf{^{\prime}}\right)\)) makes this approach particularly effective. In fact, the Jacobian at all voxels requires only two MC simulations: (1) the direct simulation defined by placing an isotropic point source at \({\mathbf{r}}_{0}\) to calculate \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\prime}\right)\) for all \({\mathbf{r}}^{\prime}\); (2) an adjoint simulation obtained by placing a point source at \(\mathbf{r}\) to calculate \({G}_{{\phi }_{0}}\left(\mathbf{r}\to {\mathbf{r}}^{\prime}\right)\) for all \({\mathbf{r}}^{\mathbf{{\prime}}}\). The Jacobian is obtained by \(\frac{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\prime}\right){G}_{{\phi }_{0}}\left(\mathbf{r}\to {\mathbf{r}}^{\prime}\right)}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}dV\), where \(dV\) is the volume of the absorption perturbation at \({\mathbf{r}}^{\mathbf{{\prime}}}\). In this case, the only hypothesis needed for using the Rytov approximation is that diffusion conditions are satisfied. However, we notice that by using MC-based Green’s functions of the fluence rate, we are considering a hybrid method that can be effective for the calculation of Jacobians for a wide range of the baseline optical properties; most likely more effective than for the case when DE based Green’s functions are used.

For the calculation of Jacobians for reflectance measurements, the last two steps of the Jacobian calculation for the fluence rate must be modified in the following way21: (2a) the reflectance per unit source power calculated at the detector (\({\mathbf{r}}_{b}\)) due to a point source at the voxel \({\mathbf{r}}^{\mathbf{{\prime}}}\), \({R}_{0}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to {\mathbf{r}}_{b}\right)\); (3a) the reflectance per unit source power calculated at the detector due to a point source at \({\mathbf{r}}_{0}\), \({R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)\). In this case, the Jacobian is obtained as \(\frac{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\prime}\right){R}_{0}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to {\mathbf{r}}_{b}\right)}{{R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)}dV\). In step (2a) a generalized reciprocity relationship based on the properties of RTE17,22 can be used to replace \({R}_{0}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to {\mathbf{r}}_{b}\right)\) with the calculation of the Green’s function of the fluence rate at \({\mathbf{r}}^{\prime}\) due to an adjoint source at \({\mathbf{r}}_{b}\) (i.e.,\({G}_{{\phi }_{0}}\left({\mathbf{r}}_{b}\to {\mathbf{r}}^{\prime}\right))\), which is specified by the detector’s position, size, and emittance angle (which must match the acceptance angle in the direct calculation; see section "Relationship between the correct forward-adjoint method and the present work")17. This calculation should also consider the refractive index mismatch between diffusive and outer medium. Again, this allows for the calculation of the Jacobian at all voxels with only two simulations. Adjoint method calculations for reflectance measurements are also known in diffusion theory (DT) by use of a Robin source6. Also, in most literature where the DE is used23,24,25,26 the Jacobians for reflectance and fluence rate measurements are considered equivalent because of the adoption (not always openly disclosed) of the partial current boundary condition (PCBC)6,15: \({R}_{0}\left(\mathbf{r}\to {\mathbf{r}}_{b}\right)=\frac{{G}_{{\phi }_{0}}\left(\mathbf{r}\to {\mathbf{r}}_{b}\right)}{2A\left({n}_{r}\right)}\), where \(A\left({n}_{r}\right)\) is a constant that depends on the local refractive index mismatch (\({n}_{r}\)) between diffusive and outer medium at \({\mathbf{r}}_{b}\).

In this work, we investigate the calculation of Jacobians for absorbing perturbations, by showing several comparisons using both DT and MC simulations. MC simulations are used for both the calculation of fluence rate in the Rytov approximation, and for the calculation of the mean pathlengths for reflectance measurements (i.e., the correct Jacobians) in regions of interest. The main purpose of this work is to test, for a wide range of optical properties, the performance of the Rytov approximation for the calculation of Jacobians when the measurement of interest is the reflectance. A good part of our effort is dedicated to pointing out the physical meaning of the Rytov and Born approximations. This viewpoint provides a more intuitive picture on the photon transport process. For this purpose, we reframe the general perturbative approach to photon migration by using a definition of the fluence rate based on a random multivariate probability density function derived from the microscopic Beer–Lambert law (section "A random multivariate approach to perturbation theory of photon transport"). The concept of fluence rate partial pathlength moment is introduced and an explicit formula for a general moment is provided under diffusion conditions. The Born and Rytov approximations of the DE are introduced in section "Born and Rytov approximations and their physical meaning". It is immediately recognized how these expressions depend on the fluence rate partial pathlength (\({\langle {l}_{i}\rangle }_{\phi }\)) in a region of interest. In section "Calculation of reflectance based on Born and Rytov approximations" we provide an original formula for the reflectance under the Rytov approximation. The difference between the fluence rate and the reflectance partial pathlengths (\({\langle {l}_{i}\rangle }_{R}\)) are discussed more in detail. In section "Relationship between the correct forward-adjoint method and the present work" we briefly review the rigorous coupled forward-adjoint method17 and we provide the links to the present work. In sections "Diffusion theory calculations” and "Monte Carlo simulations” the details of diffusion and MC calculations are provided, respectively. The results are described in section "Results". Section "Comparison of MC and diffusion theory results for the calculation of reflectance pathlengths" is dedicated to the comparison of reflectance partial pathlengths obtained independently with two MC codes and with DT. Section "Comparison of MCX and diffusion theory results for the calculation of fluence rate and reflectance pathlengths" is dedicated to the comparison of the Jacobians obtained with the Rytov approximation for both DT and MC calculations. The reflectance partial pathlength obtained with MC (the true parameter of interest) is also shown for comparison. In section "Comparison of fluence rate pathlengths in presence of a highly absorbing thin layer" the same comparisons of section "Comparison of MCX and diffusion theory results for the calculation of fluence rate and reflectance pathlengths" are shown for a medium containing a highly absorbing thin layer. Section "Calculations of reflectance perturbation due to a strong localized absorption change" shows two examples of reflectance change for a wide range of absorption change that spans the linear and nonlinear regime. In this section we compare MC and DT calculations under the Born, Rytov and higher order (up to the fourth order) Born series. In section "Discussions" the results are discussed, and in section "Conclusions and future work” we summarize the main results in the conclusions.

Theory

A random multivariate approach to perturbation theory of photon transport

The physical basis of this section is the microscopic Beer–Lambert law21,27,28 (mBLL) which allows for a random multivariate framework of photon transport in a medium with an arbitrary spatial distribution of the scattering and absorption coefficients. This is the approach used in Monte Carlo methods for solving the RTE15. Here we replicate the method used in our previous publication21 but instead of focusing on the reflectance at a boundary point, we focus on the fluence rate at an arbitrary point inside a turbid medium. The theory is described for the continuous-wave domain (CW) but it can be developed also for time-domain and frequency-domain.

We divide a medium (of arbitrary non-concave geometry) into \(N\) regions (of arbitrary shape) characterized by absorption and scattering coefficients \({\mu }_{ai}, {\mu }_{si} (i=\text{1,2},...,N)\), respectively. When light is emitted by a point source at \({\mathbf{r}}_{0}\), the fluence rate at a point \(\mathbf{r}\) can be defined by using a random multivariate probability density function (pdf) \({f}_{0}\left({l}_{1},{l}_{2},\dots ,{l}_{N}\right)\) normalized such that:

$${G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)={\int }_{{\mathbb{R}}^{N+}}{f}_{0}\left({l}_{1},{l}_{2},\dots ,{l}_{N}\right)d{l}_{1}d{l}_{2}\cdots d{l}_{N} $$
(1)

where \({G}_{{\phi }_{0}}\) is the Green’s function of the fluence rate (\(\left[{G}_{{\phi }_{0}}\right]=\text{L}^{-2}\)), and \({f}_{0}\left({l}_{1},{l}_{2},\dots ,{l}_{N}\right)\) is the pdf (\(\left[{f}_{0}\right]=\text{L}^{\text{-N}-2}\)) that a photon emitted at \({\mathbf{r}}_{0}\) and found at \(\mathbf{r}\) spends the pathlengths \({l}_{1},{l}_{2},\dots ,{l}_{N}\) in the regions \(\text{1,2},...,N\), respectively. Note that strictly speaking \({f}_{0}\) is not a pdf, since it is not normalized to “1”. However, this fact is irrelevant for developing our arguments. For conciseness, in Eq. (1), we consider the dependence of \({f}_{0}\) on \({\mathbf{r}}_{0}\) and \(\mathbf{r}\) as implicit. For an arbitrary perturbation of the absorption coefficient \({\Delta \mu }_{ai}(i=1,...,N)\), based on the microscopic Beer–Lambert law27, the new pdf will be:

$${f}_{p}\left({l}_{1},{l}_{2},\dots ,{l}_{N}\right)={f}_{0}\left({l}_{1},{l}_{2},\dots ,{l}_{N}\right){e}^{-{\sum }_{i=1}^{N}{l}_{i}{\Delta \mu }_{ai}}$$
(2)

The integration of \({f}_{p}\) in \({\mathbb{R}}^{N+}\) yields \({G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\), i.e., the Green’s function of the perturbed fluence rate:

$${G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)={\int }_{{\mathbb{R}}^{N+}}{f}_{p}\left({l}_{1},{l}_{2},\dots ,{l}_{N}\right)d{l}_{1}d{l}_{2}...d{l}_{N}$$
(3)

By using a multivariate Taylor expansion of the exponential, we obtain:

$${G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)={G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\sum_{{k}_{1},{k}_{2},\dots ,{k}_{N}=0}^{\infty }{\left(-1\right)}^{{k}_{1}+{k}_{2}+\dots {+k}_{N}}\frac{{\langle {l}_{1}^{{k}_{1}}{l}_{2}^{{k}_{2}}\dots {l}_{N}^{{k}_{N}}\rangle }_{\phi }}{{k}_{1}!{k}_{2}!\dots {k}_{N}!}{{\Delta \mu }_{a1}}^{{k}_{1}}{{\Delta \mu }_{a2}}^{{k}_{2}}\dots {{\Delta \mu }_{aN}}^{{k}_{N}}$$
(4)

where \({\langle {l}_{1}^{{k}_{1}}{l}_{2}^{{k}_{2}}\dots {l}_{N}^{{k}_{N}}\rangle }_{\phi }\) is the mixed pathlength moment of order \({k}_{1}+{k}_{2}+\dots +{k}_{N}\) defined by the relation:

$${\langle {l}_{1}^{{k}_{1}}{l}_{2}^{{k}_{2}}\dots {l}_{N}^{{k}_{N}}\rangle }_{\phi }=\frac{{\int }_{{\mathbb{R}}^{N+}}{l}_{1}^{{k}_{1}}{l}_{2}^{{k}_{2}}...{l}_{N}^{{k}_{N}}{f}_{0}\left({l}_{1},{l}_{2},\dots ,{l}_{N}\right)d{l}_{1}d{l}_{2}\dots d{l}_{N}}{{\int }_{{\mathbb{R}}^{N+}}{f}_{0}\left({l}_{1},{l}_{2},\dots ,{l}_{N}\right)d{l}_{1}d{l}_{2}\dots d{l}_{N}}$$
(5)

The mixed pathlength moment of order \({k}_{1}+{k}_{2}+\dots +{k}_{N}\) can be also defined as:

$${\langle {l}_{1}^{{k}_{1}}{l}_{2}^{{k}_{2}}\dots {l}_{N}^{{k}_{N}}\rangle }_{\phi }=\frac{{\left(-1\right)}^{{k}_{1}+{k}_{2}+\dots {+k}_{N}}}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}\frac{{\partial }^{{k}_{1}+{k}_{2}+\dots {+k}_{N}}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}{{\partial \mu }_{{a}_{1}}^{{k}_{1}}\dots {\partial \mu }_{{a}_{N}}^{{k}_{N}}}$$
(6)

We can find an expression for the moments by solving the perturbative CW RTE21 which relates the change of absorption \(\Delta {\mu }_{a}\left(\mathbf{r}\right)\) to the change in the radiance between an initial (\({\mathfrak{L}}_{0}\)) and a perturbed state (\({\mathfrak{L}}_{p}\)), \(\Delta \mathfrak{L}\left(\mathbf{r},\widehat{\mathbf{s}}\right)={\mathfrak{L}}_{p}\left(\mathbf{r},\widehat{\mathbf{s}}\right)-{\mathfrak{L}}_{0}\left(\mathbf{r},\widehat{\mathbf{s}}\right)\):

$$\widehat{\mathbf{s}}\cdot \nabla \left[\Delta \mathfrak{L}\left(\mathbf{r},\widehat{\mathbf{s}}\right)\right]+{\mu }_{t}\left(\mathbf{r}\right)\Delta \mathfrak{L}\left(\mathbf{r},\widehat{\mathbf{s}}\right)={\mu }_{s}\left(\mathbf{r}\right){\int }_{4\pi }p\left(\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\to \widehat{\mathbf{s}}\right)\Delta \mathfrak{L}\left(\mathbf{r},\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\right)d{\widehat{\mathbf{s}}}^{\boldsymbol{{\prime}}}-\Delta {\mu }_{a}\left(\mathbf{r}\right){\mathfrak{L}}_{p}\left(\mathbf{r},\widehat{\mathbf{s}}\right)$$
(7)

where \(\mathfrak{L}\left(\mathbf{r},\widehat{\mathbf{s}}\right)\) is the radiance, i.e., the number of photons travelling along the direction \(\widehat{\mathbf{s}}\) per unit time, per unit area orthogonal to \(\widehat{\mathbf{s}}\), per unit solid angle (\(\left[\mathfrak{L}\right]=\text{L}^{-2}\text{Sr}^{-1}\text{T}^{-1}\)), which depends on both spatial (\(\mathbf{r}\)) and angular (\(\widehat{\mathbf{s}}\)) variables; \(p\left(\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\to \widehat{\mathbf{s}}\right)\) is the phase function (\(\left[p\right]=\text{Sr}^{-1}\)). Also, \({\mu }_{t}={\mu }_{a}+{\mu }_{s}\) where, \({\mu }_{t}\), \({\mu }_{s}\) and \({\mu }_{a}\) are the extinction, scattering and absorption coefficients, respectively. If the RTE for the initial and perturbed state have a source term \(S\left(\mathbf{r},\widehat{\mathbf{s}}\right)=\delta \left(\mathbf{r}-{\mathbf{r}}_{0}\right)\delta \left(\widehat{\mathbf{s}}-{\widehat{\mathbf{s}}}_{0}\right)\), the solution of Eq. (7) is known as the Dyson equation for the Green’s function of the radiance29:

$${G}_{{\mathfrak{L}}_{p}}\left({\mathbf{r}}_{0},{\widehat{\mathbf{s}}}_{0}\to \mathbf{r},\widehat{\mathbf{s}}\right)={G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}_{0},{\widehat{\mathbf{s}}}_{0}\to \mathbf{r},\widehat{\mathbf{s}}\right)-{\int }_{V}d\mathbf{r}\mathbf{^{\prime}}{\int }_{4\pi }\Delta {\mu }_{a}\left(\mathbf{r}\mathbf{^{\prime}}\right){G}_{{\mathfrak{L}}_{p}}\left({\mathbf{r}}_{0},{\widehat{\mathbf{s}}}_{0}\to \mathbf{r}\mathbf{^{\prime}},\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\right){G}_{{\mathfrak{L}}_{0}}\left(\mathbf{r}\mathbf{^{\prime}},\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\to \mathbf{r},\widehat{\mathbf{s}}\right)d{\widehat{\mathbf{s}}}^{\boldsymbol{{\prime}}}$$
(8)

where \({G}_{\mathfrak{L}}\left({\mathbf{r}}_{0},{\widehat{\mathbf{s}}}_{0}\to \mathbf{r},\widehat{\mathbf{s}}\right)\) is the Green’s function of the radiance that is found at \(\mathbf{r}\) in the direction \(\widehat{\mathbf{s}}\) when light is emitted at \({\mathbf{r}}_{0}\) in the direction \({\widehat{\mathbf{s}}}_{0}\) (\(\left[{G}_{\mathfrak{L}}\right]=\text{L}^{-2}\text{Sr}^{-1}\)). We note that the same method can also be used for the perturbative DE21,30.

The solution of the Dyson equation is expressed by the Neumann series31,32,33. Each order of the Neumann series can be integrated over the solid angle \(\widehat{\mathbf{s}}\) which provides the corresponding order of approximation for \({G}_{{\phi }_{p}}\left({\mathbf{r}}_{0},{\widehat{\mathbf{s}}}_{0}\to \mathbf{r}\right)\) (i.e., the Green’s function of the fluence rate at \(\mathbf{r}\)). Comparison with Eq. (4) allows one to find the expression of the moments. We will provide the expression for a general mixed moment for the Dyson equation of the DE (see Eq. 13). For the RTE, it will suffice to find the first order moment, which corresponds to the first order of the Neumann series. For an isotropic source \(S\left(\mathbf{r},\widehat{\mathbf{s}}\right)=\frac{1}{4\pi }\delta \left(\mathbf{r}-{\mathbf{r}}_{0}\right)\), the corresponding Green’s function for the radiance will be \({G}_{{\mathfrak{L}}_{iso}}\left({\mathbf{r}}_{0}\to \mathbf{r},\widehat{\mathbf{s}}\right)=\frac{1}{4\pi }{\int }_{4\pi }{G}_{\mathfrak{L}}\left({\mathbf{r}}_{0},\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\to \mathbf{r},\widehat{\mathbf{s}}\right)d{\widehat{\mathbf{s}}}^{\boldsymbol{{\prime}}}\). For a uniform change of absorption in the region \(V\), the first order term of the Neumann series yields (from Eq. 8):

$${G}_{{\mathfrak{L}}_{p,iso}}\left({\mathbf{r}}_{0}\to \mathbf{r},\widehat{\mathbf{s}}\right)={G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to \mathbf{r},\widehat{\mathbf{s}}\right)-\Delta {\mu }_{a}{\int }_{V}d\mathbf{r}\mathbf{^{\prime}}{\int }_{4\pi }{G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to \mathbf{r}\mathbf{^{\prime}},\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\right){G}_{{\mathfrak{L}}_{0}}\left(\mathbf{r}\mathbf{^{\prime}},\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\to \mathbf{r},\widehat{\mathbf{s}}\right)d{\widehat{\mathbf{s}}}^{\boldsymbol{{\prime}}}$$
(9)

Integration of Eq. (9) over the angular variable \(\widehat{\mathbf{s}}\) yields:

$$\Delta {G}_{\phi }\left({\mathbf{r}}_{0}\to \mathbf{r}\right)=-\Delta {\mu }_{a}{\int }_{V}d\mathbf{r}\mathbf{^{\prime}}{\int }_{4\pi }d\widehat{{\varvec{s}}}{\int }_{4\pi }{G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to \mathbf{r}\mathbf{^{\prime}},\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\right){G}_{{\mathfrak{L}}_{0}}\left(\mathbf{r}\mathbf{^{\prime}},\widehat{\mathbf{s}}\boldsymbol{^{\prime}}\to \mathbf{r},\widehat{\mathbf{s}}\right)d{\widehat{\mathbf{s}}}^{\boldsymbol{{\prime}}}$$
(10)

where \(\Delta {G}_{\phi }\left({\mathbf{r}}_{0}\to \mathbf{r}\right)={G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)-{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\).

By using Eqs. (6) and (10) we obtain:

$$ \left\langle {l_{i} } \right\rangle _{\phi } = \mathop {\lim }\limits_{{\Delta \mu _{{ai \to 0}} }} - \frac{1}{{G_{{\phi _{0} }} \left( {{\mathbf{r}}_{0} \to {\mathbf{r}}} \right)}}\frac{{\Delta G_{\phi } \left( {{\mathbf{r}}_{0} \to {\mathbf{r}}} \right)}}{{\Delta \mu _{{ai}} }} = \frac{{\mathop \int \nolimits_{{V_{i} }} d{\mathbf{r^{\prime}}}\mathop \int \nolimits_{{4\pi }} d{\hat{\boldsymbol{s}}}\mathop \int \nolimits_{{4\pi }} G_{{\mathfrak{L}}_{{0,iso}} }} \left( {{\mathbf{r}}_{0} \to {\mathbf{r^{\prime}}},{\mathbf{\hat{s}}}^{\prime} } \right)G_{{{\mathfrak{L}}_{0} }} \left( {{\mathbf{r^{\prime}}},{\mathbf{\hat{s}}}^{\prime} \to {\mathbf{r}},{\mathbf{\hat{s}}}} \right)d{\mathbf{\hat{s}}}^{{{\prime}}}}{{G_{{\phi _{0} }} \left( {{\mathbf{r}}_{0} \to {\mathbf{r}}} \right)}} $$
(11)

where \({\langle {l}_{i}\rangle }_{\phi }\) is the average pathlength spent in the region of interest when photons are emitted from \({\mathbf{r}}_{0}\) and later found at \(\mathbf{r}\). We name this type of partial pathlength as fluence rate partial pathlength. It is also the absorption Jacobian of the logarithmic Green’s function, \(\text{ln}\left(\frac{{G}_{\phi }\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}{{\Psi }_{ref}}\right)\), where \({\Psi }_{ref}\) is an arbitrary reference intensity per unit source power (\(\left[{\Psi }_{ref}\right]=\text{L}^{-2}\)). If we can assume that under certain conditions of highly diffusive media \({G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\to \mathbf{r},\widehat{\mathbf{s}}\right)\approx {G}_{{\mathfrak{L}}_{0},iso}\left({\mathbf{r}}^{\boldsymbol{{\prime}}}\to \mathbf{r},\widehat{\mathbf{s}}\right)\), or \({G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right)\approx \frac{1}{4\pi }{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\mathbf{^{\prime}}\right)\) (or both) then we can decouple the two Green’s functions in Eq. (11) and carry out the two angular integrations separately. In this way we obtain:

$${\langle {l}_{i}\rangle }_{\phi }=\frac{{\int }_{{V}_{i}}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}}\right){G}_{{\phi }_{0}}\left({\mathbf{r}}^{\boldsymbol{{\prime}}}\to \mathbf{r}\right)d{\mathbf{r}}^{\boldsymbol{{\prime}}}}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}$$
(12)

We shall see in the next section that this expression is also obtained from the Rytov approximation for the DE [We note that more appropriately in DT \({G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right)\approx \frac{1}{4\pi }{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\mathbf{{\prime}}}\right)+\frac{3}{4\pi }{\mathbf{J}}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\mathbf{{\prime}}}\right)\cdot {\widehat{\mathbf{s}}}^{\prime}\), where \({\mathbf{J}}_{0}\) is the net flux vector15. If we plug this expression into Eq. (11) and we make no assumption for \({G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\to \mathbf{r},\widehat{\mathbf{s}}\right),\) we will arrive to Eq. (12) with the additional term \(-\frac{3}{4\pi {G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}{\int }_{{V}_{i}}{\mathbf{J}}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\mathbf{{\prime}}}\right)\cdot {\mathbf{J}}_{0}\left(\mathbf{r}\to {\mathbf{r}}^{\mathbf{{\prime}}}\right)d{\mathbf{r}}^{\boldsymbol{{\prime}}}\) on the right side. This term is supposed to be negligible with respect to the fluence term].

When diffusion conditions are met, we can solve the perturbative diffusion equation to calculate \({G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\) by use of the Neumann series. The general expression of the mixed pathlength moment of order \({n=k}_{1}+{k}_{2}+..+{k}_{m}\) involving \(m\) regions (\(m\le N\)) is given by Sassaroli et al.21:

$${\langle {l}_{1}^{{k}_{1}}{l}_{2}^{{k}_{2}}...{l}_{m}^{{k}_{m}}\rangle }_{\phi }=\frac{{k}_{1}!{k}_{2}!\dots {k}_{m}!}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}{\sum }_{\wp \left({i}_{1},{i}_{2},\dots ,{i}_{n}\right)}{\int }_{{V}_{{i}_{1}}}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{{i}_{1}}\right){G}_{{\phi }_{0}}\left({\mathbf{r}}_{{i}_{1}}\to {\mathbf{r}}_{{i}_{2}}\right)d{\mathbf{r}}_{{i}_{1}}{\int }_{{V}_{{i}_{2}}}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{{i}_{2}}\to {\mathbf{r}}_{{i}_{3}}\right)d{\mathbf{r}}_{{i}_{2}}\times \dots \times {\int }_{{V}_{{i}_{n-1}}}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{{i}_{n-1}}\to {\mathbf{r}}_{{i}_{n}}\right)d{\mathbf{r}}_{{i}_{n-1}}{\int }_{{V}_{{i}_{n}}}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{{i}_{n}}\to \mathbf{r}\right)d{\mathbf{r}}_{{i}_{n}}$$
(13)

where \(\wp \left({i}_{1},{i}_{2},\dots ,{i}_{n}\right)\) is the permutation of \(n\) integers with repetitions. The first \({k}_{1}\) integers \({i}_{1},{i}_{2},\dots ,{i}_{{k}_{1}}\) are assigned to region “1” (\({i}_{1}{=i}_{2}=\dots ={i}_{{k}_{1}}=1\)), the next \({k}_{2}\) indices are assigned to region “2” (\({i}_{{k}_{1}+1}{=i}_{{k}_{1}+2}=\dots ={i}_{{k}_{1}+{k}_{2}}=2\)), etc. Therefore, the number of permutations is \(\frac{n!}{{k}_{1}!{k}_{2}!\dots {k}_{m}!}\). Further, \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{i}\to {\mathbf{r}}_{j}\right)\) is the Green’s function for the fluence rate for a source at \({\mathbf{r}}_{i}\) and a field point at \({\mathbf{r}}_{j}\). The mixed moment of Eq. (13) is a measure of the interaction between different points of the medium. It can be thought as proportional to the probability that a photon emitted at \({\mathbf{r}}_{0}\) and found at \(\mathbf{r}\) will visit \({k}_{1}\) points of region 1, \({k}_{2}\) points of region 2 etc. The normalization is provided by the probability that a photon emitted at \({\mathbf{r}}_{0}\) and found at \(\mathbf{r}\) will visit \(n\) points (\({n=k}_{1}+{k}_{2}+..+{k}_{m}\)) of the whole medium. Another way to rephrase the previous statement is in terms of the number of scattering events. We note that Eq. (12) is a particular case of Eq. (13), as expected. We also note that Eq. (13) is formally valid also for the moments obtained under the validity of the RTE, if we substitute \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{{i}_{j}}\to {\mathbf{r}}_{{i}_{j+1}}\right)\) with \({G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}_{{i}_{j}},{\widehat{\mathbf{s}}}_{{i}_{j}}\to {\mathbf{r}}_{{i}_{j+1}},{\widehat{\mathbf{s}}}_{{i}_{j+1}}\right)\) and integration is carried out over both spatial and angular domains.

In this work we choose to focus on the simple situation where one region is subject to a uniform absorption change \({\Delta \mu }_{ai}\). Therefore, from Eq. (4) we have:

$${G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)={G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\sum_{k=0}^{\infty }{\left(-1\right)}^{k}\frac{{\langle {l}_{i}^{k}\rangle }_{\phi }}{k!}{{\Delta \mu }_{ai}}^{k}$$
(14)

where \({\langle {l}_{i}^{k}\rangle }_{\phi }\) is the self-moment of order \(k\), which is proportional to the probability that a photon emitted at \({\mathbf{r}}_{0}\) and found at \(\mathbf{r}\) visits \(k\) points of the region of interest. We note that the expression of \({\langle {l}_{i}^{k}\rangle }_{\phi }\) is included in the general expression of a mixed moment (Eq. 13). For reflectance calculations, \({R}_{p}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\) has the same form of Eq. (14) but with \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\) replaced by \({R}_{0}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\), and \({\langle {l}_{i}^{k}\rangle }_{\phi }\) replaced by \({\langle {l}_{i}^{k}\rangle }_{R}\)21. The general expression of the mixed moment of Eq. (13) is also valid for reflectance calculations with the only substitution of the last integrand \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{{i}_{n}}\to \mathbf{r}\right)\) with \({R}_{0}\left({\mathbf{r}}_{{i}_{n}}\to {\mathbf{r}}_{b}\right)\), where \({\mathbf{r}}_{b}\) is a boundary point where the detector is located.

Born and Rytov approximations and their physical meaning

Equation (14) is valid for arbitrary changes of the absorption coefficient, however, more commonly the perturbative diffusion equation is solved according to the Born or Rytov approximations, which assume small changes in the absorption coefficient. The Born approximation assumes that \(\Delta {G}_{\phi }\left({\mathbf{r}}_{0}\to \mathbf{r}\right)={G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)-{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\ll {G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\)15. In other words, the change in the Green’s function should be much smaller than the unperturbed value. This is equivalent to retaining the first order expansion of Eq. (14) (i.e., infinitesimal changes of the absorption coefficient):

$${G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\approx {G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\left[1-{\langle {l}_{i}\rangle }_{\phi }\Delta {\mu }_{ai}\right]$$
(15)

The Rytov approximation seeks a solution to the diffusion equation of the type:

$${G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)={G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right){e}^{-\psi \left({\mathbf{r}}_{0},\mathbf{r}\right)}$$
(16)

After some calculations (Supplementary Information S1) we arrive to the expression for the exponent:

$$\psi \left({\mathbf{r}}_{0},\mathbf{r}\right)=\frac{1}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}{\int }_{{V}^{\prime}}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\mathbf{^{\prime}}\right)\left\{{\Delta \mu }_{a}\left(\mathbf{r}\mathbf{^{\prime}}\right)-\left[D\left(\mathbf{r}\mathbf{^{\prime}}\right){\Vert \nabla \psi \left({\mathbf{r}}_{0},\mathbf{r}\mathbf{^{\prime}}\right)\Vert }^{2}\right]\right\}{G}_{{\phi }_{0}}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to \mathbf{r}\right)d{\mathbf{r}}^{\mathbf{{\prime}}}$$
(17)

The assumption of the Rytov approximation is to neglect \(D\left(\mathbf{r}\mathbf{^{\prime}}\right){\Vert \nabla \psi \left({\mathbf{r}}_{0},\mathbf{r}\mathbf{^{\prime}}\right)\Vert }^{2}\) in the integrand, therefore we obtain (for \({\Delta \mu }_{a}\left(\mathbf{r}\mathbf{^{\prime}}\right)={\Delta \mu }_{ai}\)):

$$\psi \left({\mathbf{r}}_{0},\mathbf{r}\right)=\frac{{\Delta \mu }_{ai}}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}{\int }_{{V}^{\prime}}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\mathbf{^{\prime}}\right){G}_{{\phi }_{0}}\left(\mathbf{r}\mathbf{^{\prime}}\to \mathbf{r}\right)d{\mathbf{r}}^{\mathbf{{\prime}}}={\langle {l}_{i}\rangle }_{\phi }\Delta {\mu }_{ai}$$
(18)

The formula derived by the Rytov approximation for the exponent yields the product of the fluence rate partial pathlength and the change in the absorption coefficient.

Now we observe that Eq. (16) can be rewritten as:

$${G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)={G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right){e}^{-{\langle {l}_{i}\rangle }_{\phi }\Delta {\mu }_{ai}}={G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right){\sum }_{k=0}^{\infty }{\left(-1\right)}^{k}\frac{{\langle {l}_{i}\rangle }_{\phi }^{k}}{k!}{{\Delta \mu }_{ai}}^{k}$$
(19)

Three important considerations follow: (1) the Rytov approximation (unlike the Born approximation) is not linear with \(\Delta {\mu }_{ai}\), therefore it can describe some nonlinear effects due to absorption changes (see section "Calculations of reflectance perturbation due to a strong localized absorption change"); (2) by comparing Eqs. (19) and (14) the physical approximation inherent in the Rytov approximation is that \({\langle {l}_{i}^{k}\rangle }_{\phi }\) can be substituted by \({\langle {l}_{i}\rangle }_{\phi }^{k}\). This assumption is usually a poor one and leads to miscalculations for large values of \(\Delta {\mu }_{ai}\) (see section "Calculations of reflectance perturbation due to a strong localized absorption change"). (3) We can easily verify that the Born and Rytov approximations coincide when \({\Delta \mu }_{ai}\to 0\) i.e., when only the first term of Eq. (14) is retained.

In conclusion, the Rytov approximation has the natural physical interpretation that its exponent is the product of the fluence rate partial pathlength times the change of the absorption coefficient in a region of interest. Therefore Eq. (19) shows an elegant link between the Rytov approximation and Beer–Lambert law for clear media when a change of the absorption coefficient occurs in a region of the medium: \({I}_{d}={I}_{0}{e}^{-{L}_{i}\Delta {\mu }_{ai}},\) where \({I}_{0}\) and \({I}_{d}\) are the detected intensities before and after the absorption change (\(\Delta {\mu }_{ai}\)), respectively, and \({L}_{i}\) is the direct pathlength of photons crossing the region. We can see that they are formally the same equation with the substitution of \({L}_{i}\) with \({\langle {l}_{i}\rangle }_{\phi }\). Equation (19) also implies that for the calculation of perturbations of the fluence rate due to absorption changes, higher order terms of the series (Eq. 19) depend on \({\langle {l}_{i}\rangle }_{\phi }^{k}\) instead of \({\langle {l}_{i}^{k}\rangle }_{\phi }\) (Eq. 14).

Calculation of reflectance based on Born and Rytov approximations

Now we address the calculation of the reflectance under the Born and Rytov approximations by using Fick’s law: \({R}_{p}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)=-D\left({\mathbf{r}}_{b}\right)\nabla {{G}_{{\phi }_{p}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}_{|\mathbf{r}={\mathbf{r}}_{b}}\cdot \widehat{\mathbf{n}}\left({\mathbf{r}}_{b}\right)\), where \(\widehat{\mathbf{n}}\left({\mathbf{r}}_{b}\right)\) is the unit outward vector at the boundary point \({\mathbf{r}}_{b}\). This means that we assume a detector at \({\mathbf{r}}_{b}\) with a 90° acceptance angle. When the fluence rate is obtained from the Born approximation (Eq. 15), by carrying out the gradient and considering the expression for \({\langle {l}_{i}\rangle }_{\phi }\) (Eq. 12) we obtain:

$${R}_{p}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)={R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)\left[1-{\langle {l}_{i}\rangle }_{R}\Delta {\mu }_{ai}\right]$$
(20)

Therefore, the Born approximation for the reflectance has the same form as for the fluence rate, except the mean pathlength in the region of interest is considered for photons exiting the medium (i.e., \({\langle {l}_{i}\rangle }_{R}\)). The expression for \({\langle {l}_{i}\rangle }_{R}\) is easily obtained by applying Fick’s law to Eq. (15):

$${\langle {l}_{i}\rangle }_{R}=\frac{1}{{R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)}{\int }_{{V}^{\prime}}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\mathbf{{\prime}}}\right){R}_{0}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to {\mathbf{r}}_{b}\right)d{\mathbf{r}}^{\mathbf{{\prime}}}$$
(21)

here, \({R}_{0}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to {\mathbf{r}}_{b}\right)\) is the reflectance for a point source at \({\mathbf{r}}^{\mathbf{{\prime}}}\) calculated at \({\mathbf{r}}_{b}\). When the fluence rate is obtained from Rytov approximation, after some calculations (Supplementary Information S2) we obtain:

$${R}_{p}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)={R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right){e}^{-\psi \left({\mathbf{r}}_{0},\mathbf{r}\right)}\left[1-\Delta {\mu }_{ai}\left({\langle {l}_{i}\rangle }_{R}-{\langle {l}_{i}\rangle }_{\phi }\right)\right]$$
(22)

where, \(\psi \left({\mathbf{r}}_{0},\mathbf{r}\right)\) is given by Eq. (18). Therefore, the Rytov approximation for the reflectance has the same form as for the fluence rate only if \({\langle {l}_{i}\rangle }_{R}={\langle {l}_{i}\rangle }_{\phi }\). Even though the expressions of these mean pathlengths are distinct, when \({\mathbf{r}=\mathbf{r}}_{b}\) their values can be very close to each other. In fact, for the calculation of \({\langle {l}_{i}\rangle }_{\phi }\) we consider the average pathlengths of photons emitted at \({\mathbf{r}}_{0}\) that cross the region of interest and are found at \({\mathbf{r}}_{b}\). A subset of these photons will exit the medium at the same point, and it can be anticipated that the average pathlength calculated from this subset (which is \({\langle {l}_{i}\rangle }_{R}\)) yields similar values in several situations of interest in diffuse optics. Even from the standpoint of MC simulations these two pathlengths have distinct definitions. If we consider a photon visiting a field point \(\mathbf{r}\) as “detected” in terms of fluence rate, the main difference between fluence rate and reflectance pathlengths is that a photon can be detected multiple times for fluence rate measurements, but only one time for reflectance measurements (Supplementary Information S3). These considerations also highlight, by directly reasoning with the physics of photon migration, the approximate nature of the PCBC used for solving DE. According to this boundary condition we would have \({\langle {l}_{i}\rangle }_{R}={\langle {l}_{i}\rangle }_{\phi }\), and the form of Rytov approximation for the reflectance perturbation would be the same as the Rytov approximation itself (Eq. 16). This is the most common approach adopted in the literature.

We also note that the expressions for the perturbed reflectance obtained under Born approximation (Eq. 20) and Rytov approximation (Eq. 22) coincide when \({\Delta \mu }_{ai}\to 0\). In fact, Eq. (22) can be rewritten as:

$$\text{ln}\left(\frac{{R}_{p}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)}{{R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)}\right)=-\psi \left({\mathbf{r}}_{s},\mathbf{r}\right)+\text{ln}\left\{\left[1-\Delta {\mu }_{ai}\left({\langle {l}_{i}\rangle }_{R}-{\langle {l}_{i}\rangle }_{\phi }\right)\right]\right\}$$
(23)

When \({\Delta \mu }_{ai}\to 0\) and \({R}_{p}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)\to {R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)\), by taking the Taylor expansion of the logarithms of Eq. (22) we have that:

$$\frac{{R}_{p}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)}{{R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)}-1=-{\langle {l}_{i}\rangle }_{R}\Delta {\mu }_{ai}$$
(24)

which is Eq. (20). Of course, this is true also for the perturbed fluence rates obtained under the Born and Rytov approximations, which in the limit \({\Delta \mu }_{ai}\to 0\) must coincide.

Relationship between the correct forward-adjoint method and the present work

The main advantage of the coupled forward-adjoint method is to reduce the variance of the estimate of the absorption Jacobians in general situations where RTE is valid17,18. The method is also efficient because it allows one to provide a map of the absorption sensitivities in a region of interest by using only two MC simulations for each source-detector pair. The method relies on the reciprocity theorem in the RTE34 \({G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}_{1},{\widehat{\mathbf{s}}}_{1}\to {\mathbf{r}}_{2},{\widehat{\mathbf{s}}}_{2}\right)={G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}_{2},{-\widehat{\mathbf{s}}}_{2}\to {\mathbf{r}}_{1},{-\widehat{\mathbf{s}}}_{1}\right)\) which describes the reversibility of light paths. For a medium with varying refractive index the theorem has a modified form35. To see the importance of the adjoint method, let us consider a case where the physical quantity of interest is the fluence rate at a point \(\mathbf{r}\) (detector) when photons are emitted isotropically at \({\mathbf{r}}_{0}\) (source). The traditional method to calculate the absorption Jacobian at a region of interest is to propagate the photons from the source and calculate the pathlength travelled by each photon in the region of interest (\({\mathbf{r}}^{\prime}\)). This number is stored in memory and will be used only if a photon reaches the detector (\(\mathbf{r}\)). Therefore, depending on the details of the simulation (i.e., size of the region of interest, size of source and detector, source-detector distance, optical properties of the medium) much of the computation will not be used because only a small fraction of the photons that intersect the region of interest are detected. To alleviate this problem, we can use the reciprocity theorem in Eq. (11) which we rewrite as follows:

$${\langle {l}_{i}\rangle }_{\phi }=\frac{{\int }_{{V}_{i}}d{\mathbf{r}}^{\boldsymbol{{\prime}}}{\int }_{4\pi }d\widehat{{\varvec{s}}}{\int }_{4\pi }{G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right){G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r},-\widehat{\mathbf{s}}\to \mathbf{r}}^{\boldsymbol{{\prime}}},-{\widehat{\mathbf{s}}}^{\prime}\right)d{\widehat{\mathbf{s}}}^{\boldsymbol{{\prime}}}}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}=\frac{4\pi {\int }_{{V}_{i}}d{\mathbf{r}}^{\boldsymbol{{\prime}}}{\int }_{4\pi }{G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right){G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}\to \mathbf{r}}^{\boldsymbol{{\prime}}},-{\widehat{\mathbf{s}}}^{\prime}\right)d{\widehat{\mathbf{s}}}^{\boldsymbol{{\prime}}}}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}$$
(25)

and thus:

$${\langle {l}_{i}\rangle }_{\phi }=\frac{4\pi {\int }_{{V}_{i}}d{\mathbf{r}}^{\boldsymbol{{\prime}}}{\int }_{4\pi }{G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right){G}_{{\mathfrak{L}}_{0,iso}}^{\star }\left({\mathbf{r}\to \mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right)d{\widehat{\mathbf{s}}}^{\boldsymbol{{\prime}}}}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)}$$
(26)

where \({G}_{{\mathfrak{L}}_{0,iso}}^{\star }\left({\mathbf{r}\to \mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right)\) is the solution of the adjoint RTE equation for an isotropic point source at \(\mathbf{r}\)17,18. This equation governs the radiance when the roles of source and detectors are swapped. We can see from Eqs. (25) and (26) that we achieve our goal by multiplying two Green’s functions’ fields: one from the source at \({\mathbf{r}}_{0}\) to the region of interest at \({\mathbf{r}}^{\prime}\) in the direct simulation (\({G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right)\)), and the other from the detector at \(\mathbf{r}\) to the region of interest at \({\mathbf{r}}^{\prime}\) in the adjoint simulation (\({G}_{{\mathfrak{L}}_{0,iso}}^{\star }\left({\mathbf{r}\to \mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right)\)). In this situation any photon that reaches the region of interest in the two simulations counts for the calculation of the Jacobian. We note that Eq. (26), apart from the normalization, has the same form as in the work of Gardner, 2014 Eq. (4)18. Also, with our approach, the interrogation density function proposed by Hayakawa, 2007 Eq. (5.1)17 has a straightforward physical interpretation as the ratio of partial over total mean pathlengths: \({\langle {l}_{i}\rangle }_{\phi }/{\langle L\rangle }_{\phi }\) and \({\langle {l}_{i}\rangle }_{R}/{\langle L\rangle }_{R}\) for fluence rate and reflectance measurements, respectively. These ratios are naturally normalized to 1 when we integrate over the whole medium. We note that for the calculation of \({\langle {l}_{i}\rangle }_{R}\) one must also consider the refractive index mismatch between the diffusive and the outer medium. The general formula for \({\langle {l}_{i}\rangle }_{R}\) for an isotropic point source at \({\mathbf{r}}_{0}\) is:

$${\langle {l}_{i}\rangle }_{R}=\frac{{\int }_{{V}_{i}}d{\mathbf{r}}^{\boldsymbol{{\prime}}}{\int }_{\Omega }\left[1-{r}_{io}\left(\widehat{\mathbf{s}}\cdot \widehat{\mathbf{n}}\right)\right]\widehat{\mathbf{s}}\cdot \widehat{\mathbf{n}}d\widehat{{\varvec{s}}}{\int }_{4\pi }{G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right){G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\to {\mathbf{r}}_{b},\widehat{\mathbf{s}}\right)d{\widehat{\mathbf{s}}}^{\boldsymbol{{\prime}}}}{{\int }_{\Omega }\left[1-{r}_{io}\left(\widehat{\mathbf{s}}\cdot \widehat{\mathbf{n}}\right)\right]{G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b},\widehat{\mathbf{s}}\right)\widehat{\mathbf{s}}\cdot \widehat{\mathbf{n}}d\widehat{{\varvec{s}}}}$$
(27)

where \(\Omega \) is the acceptance angle of the detector in the medium, \(\widehat{\mathbf{n}}\) the unit vector directed outward to the medium at the location of the detector (\(\widehat{\mathbf{s}}\cdot \widehat{\mathbf{n}}\ge 0\)), and \({r}_{io}\) is the reflection coefficient for unpolarized light for photons propagating from the diffusive to the outer medium. Here for simplicity, we have considered a point detector (at \({\mathbf{r}}_{b}\)). Even for reflectance calculations a map of the absorption sensitivity can be obtained by using only a direct and an adjoint MC simulation. Before describing these two simulations, we note that in general the two angular integrals of Eq. (27) cannot be decoupled. Under diffusion conditions the decoupling is possible (see section "A random multivariate approach to perturbation theory of photon transport"), and Eq. (27) reduces to Eq. (21). In this situation the direct simulation is used for the calculation of \({G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\right)\approx \frac{1}{4\pi }{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\mathbf{^{\prime}}\right)\) and the denominator of Eq. (27), i.e., \({R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)\). The adjoint simulation is used for estimating \({R}_{0}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to {\mathbf{r}}_{b}\right)\) (Eq. 21), which is obtained by the Green’s function \({G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\to {\mathbf{r}}_{b},\widehat{\mathbf{s}}\right)\) of Eq. (27), by considering that \({G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\to {\mathbf{r}}_{b},\widehat{\mathbf{s}}\right)\approx {G}_{{\mathfrak{L}}_{0,iso}}\left({\mathbf{r}}^{\boldsymbol{{\prime}}}\to {\mathbf{r}}_{b},\widehat{\mathbf{s}}\right)\). The appropriate adjoint simulation to calculate \({R}_{0}\left({\mathbf{r}}^{\mathbf{{\prime}}}\to {\mathbf{r}}_{b}\right)\) at all points \({\mathbf{r}}^{\mathbf{{\prime}}}\), is again obtained by using the reciprocity theorem: \({G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}^{\boldsymbol{{\prime}}},{\widehat{\mathbf{s}}}^{\prime}\to {\mathbf{r}}_{b},\widehat{\mathbf{s}}\right)=\) \({G}_{{\mathfrak{L}}_{0}}\left({\mathbf{r}}_{b},-\widehat{\mathbf{s}}\to {\mathbf{r}}^{\boldsymbol{{\prime}}},-{\widehat{\mathbf{s}}}^{\prime}\right)\). Integration over \({\widehat{\mathbf{s}}}^{\prime}\) in Eq. (27) yields \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{b},-\widehat{\mathbf{s}}\to {\mathbf{r}}^{\boldsymbol{{\prime}}}\right)\). Therefore, we can provide the following “recipe” for the adjoint MC simulation: we inject the photons in the medium at \({\mathbf{r}}_{b}\) isotropically within the acceptance angle \(\Omega \); each photon is assigned a weight \(w=\frac{1}{4\pi }\left[1-{r}_{io}\left(\widehat{\mathbf{s}}\cdot \widehat{\mathbf{n}}\right)\right]\widehat{\mathbf{s}}\cdot \widehat{\mathbf{n}}\), and its contribution to the fluence rate at \({\mathbf{r}}^{\boldsymbol{{\prime}}}\) is calculated. In the expression of \(w\) the effect of the refractive index mismatch is considered in the factor \({w}_{r}=\left[1-{r}_{io}\left(\widehat{\mathbf{s}}\cdot \widehat{\mathbf{n}}\right)\right]\). Alternatively, one can use random numbers to decide if an emitted photon is either reflected or transmitted in the medium. In both cases, the number of emitted photons is used for normalization. We point out that the adjoint simulation does not require the knowledge of the exact angular distribution of the detected radiance in the direct simulation, but only the knowledge of the spatial and angular features of the detector.

Methods

Both diffusion theory (DT) calculations and MC simulations were obtained for nine combinations of the optical properties in a semi-infinite medium geometry. The values of the absorption coefficient were: \({\mu }_{a}=\left(0.005, 0.01, 0.02\right){\text{ mm}}^{-1}\), while the values of the reduced scattering coefficient were: \({\mu ^{\prime}}_{s}=\left(0.5, 1, 1.5\right) {\text{ mm}}^{-1}\). The reduced albedo (\(\frac{{\mu }_{s}^{\prime}}{{\mu }_{s}^{\prime}+{\mu }_{a}}\)) covered the range (0.961, 0.997), where the first and last endpoints correspond to \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.02, 0.5\right){\text{ mm}}^{-1}\) and \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.005, 1.5\right){\text{ mm}}^{-1}\), respectively.

Diffusion theory calculations

The partial pathlength in a region of interest was calculated for exiting photons (here defined as reflectance pathlengths, \({\langle {l}_{i}\rangle }_{R}\)) or for photons crossing an observation point inside the medium or at its boundary (here defined as fluence rate pathlengths \({\langle {l}_{i}\rangle }_{\phi }\)). The formulas used for the two cases are:

$${\langle {l}_{i}\rangle }_{R}=\frac{{\int }_{V}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}}\right){R}_{0}\left({\mathbf{r}}^{\boldsymbol{{\prime}}}\to {\mathbf{r}}_{b}\right)d{\mathbf{r}}^{\boldsymbol{{\prime}}}}{{R}_{0}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)}$$
(28)

and

$${\langle {l}_{i}\rangle }_{\phi }=\frac{{\int }_{V}{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}^{\boldsymbol{{\prime}}}\right){G}_{{\phi }_{0}}\left({\mathbf{r}}^{\boldsymbol{{\prime}}}\to {\mathbf{r}}_{b}\right)d{\mathbf{r}}^{\boldsymbol{{\prime}}}}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)}$$
(29)

where \({\mathbf{r}}_{0}=\left({x}_{0},{y}_{0},{z}_{0}\right)=\left(\text{0,0},1/{\mu }_{s}^{\prime}\right)\), is the location of the point source, \({\mathbf{r}}^{\boldsymbol{{\prime}}}=\left(x^{\prime},y^{\prime},z^{\prime}\right)\) is a point inside the region of interest, \({\mathbf{r}}_{b}=\left(\rho ,\text{0,0}\right)\) is the position of the detector, and \(V\) is the region of interest. \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{i}\to {\mathbf{r}}_{j}\right)\) is the Green’s function of the fluence rate for a point source at \({\mathbf{r}}_{i}\) and calculated at \({\mathbf{r}}_{j}\). \({R}_{0}\left({\mathbf{r}}_{i}\to {\mathbf{r}}_{b}\right)\) is obtained by Fick’s law: \({R}_{0}\left({\mathbf{r}}_{i}\to {\mathbf{r}}_{b}\right){=-D\nabla {G}_{{\phi }_{0}}\left({\mathbf{r}}_{i}\to \mathbf{r}\right)}_{|\mathbf{r}={\mathbf{r}}_{b}}\) (where, \(D=\frac{1}{\left(3{\mu }_{s}^{\prime}\right)}\) is the diffusion coefficient). Extrapolated boundary conditions (EBC) were used for the calculations of \({G}_{{\phi }_{0}}\) and \({R}_{0}\) in the semi-infinite medium geometry where the surface is orthogonal to \(\widehat{z}\) (Fig. 1). The solutions have been given in previous publications21,36. While the correct definition of fluence rate partial pathlength at the detector’s position implies that \({\mathbf{r}=\mathbf{r}}_{b}=(\rho ,0, 0)\) (see Fig. 1), for comparison with MC data we have also used \(\mathbf{r}=\left(\rho , 0, s/2\right)\) (where \(s/2=0.5 \text{ mm}\) corresponds to half a voxel size in the MC code MCX; see section "Monte Carlo simulations”) and \(\mathbf{r}=(\rho , 0, 1/{\mu ^{\prime}}_{s})\).

Fig. 1
figure 1

Schematic of the diffusive medium used, featuring one pencil beam source (S) and three detectors at the distance \(\rho =15, 25, 35 \text{ mm}\). In all the figures (except Figs. 7,8) the medium was homogeneous. For the Figs. 7,8 the medium contained a high-absorbing 1 mm thick layer. Two regions of interest were used: (1) a cube having a side of 6 mm (for the Figs. 2, 3, 9, 10); (2) a cube having a side of 1 mm (for the Figs. 4,5,6,7,8). For the calculation of Jacobians (Figs. 2, 3, 4,5,6,7, 8) the region of interest had the same optical properties of the bulk medium. For the forward problem calculations (Figs. 9, 10), the region of interest had a different absorption coefficient than the bulk medium. The outer and inner refractive indices are \({n}_{o}=1\) and \({n}_{i}=1\).4, respectively.

For the calculations of reflectance change for a wide range of localized absorption changes (see Eq. 14), higher order moments, here considered up to the fourth order, are needed. We used an approximate formula which we have proven to be effective and faster than calculating multiple volume integrals21:

$${\langle {l}_{i}^{n}\rangle }_{R}\approx {c}_{n-1}{\langle {l}_{i}\rangle }_{R}{\left[{\int }_{V}{G}_{{\phi }_{0,\infty }}\left(\left|{\mathbf{r}}_{c}-\mathbf{r}\mathbf{^{\prime}}\right| \right)d\mathbf{r}\mathbf{^{\prime}}\right]}^{n-1} n\ge 2$$
(30)

where, \({G}_{{\phi }_{0,\infty }}\left(\left|{\mathbf{r}}_{c}-\mathbf{r}\mathbf{^{\prime}}\right|\right)\) is the Green’s function of the DE in the infinite medium geometry, when the point source is at the center (\({\mathbf{r}}_{c}\)) of the region of interest. The factors \({c}_{n-1}\) (\(n=\text{2},\text{3,4}\)) are numeric quasi-constants (they are rather independent of the optical properties of the medium, the location and partly the size of the region of interest). The values of the numeric quasi-constant are: \({c}_{1}\approx 1.5, {c}_{2}\approx 3.5, {c}_{3}\approx 10\) (see Supplementary information S4). A schematic of the medium, the source-detector system and the regions of interest are shown in Fig. 1.

For the calculations of Eqs. (28), (29), the region of interest was divided in smaller voxels (1 mm side) and for comparison with MC results, we considered detectors of areas that matched those used in MC.

Monte Carlo simulations

We used two MC codes, one in FORTRAN37 and MCX (2020 release)12. The FORTRAN MC terminated when 100,000 photons were detected in each of the three detectors (Fig. 1). For MCX we launched 1 billion photons and detected a number of photons in the range \((5\times {10}^{4},4.6\times {10}^{5})\). Both MCs are white MC codes, i.e., the photons are injected in a non-absorbing medium and the distribution of the absorption coefficient is considered a posteriori by assigning to each detected photon a specific weight. Both MC codes were used for the calculation of \({\langle {l}_{i}\rangle }_{R}\) according to the formula:

$${\langle {l}_{i}\rangle }_{R\_MC}=\frac{{\sum }_{j=1}^{{n}_{total}}{l}_{ij} {e}^{-{\mu }_{ai}{l}_{ij}}{e}^{-{\mu }_{ao}{l}_{oj}}}{{\sum }_{j=1}^{{n}_{total}} {e}^{-{\mu }_{ai}{l}_{ij}}{e}^{-{\mu }_{ao}{l}_{oj}}}$$
(31)

where \({n}_{total}\) is the number of detected photons, \({\mu }_{ai}\) and \({\mu }_{ao}\) are the absorption coefficients inside and outside the region of interest, respectively, \({l}_{ij}\) and \({l}_{oj}\) are the pathlengths spent inside and outside the region of interest by the \({j}^{th}\) detected photon, respectively. Note that for the comparison of \({\langle {l}_{i}\rangle }_{R\_MC}\) obtained with the two MC codes, we have considered only the case \({\mu }_{ai}={\mu }_{ao}\). On the contrary, for the calculation of reflectance change, we have used both \({\mu }_{ai}\ge {\mu }_{ao}\) and \({\mu }_{ai}\le {\mu }_{ao}\). The comparison between the two MC codes was run for the case of isotropic scattering (asymmetry parameter \(g=0\)), and an outer and inner refractive index of \({n}_{o}=1\) and \({n}_{i}=1\).4, respectively. For both MCs we considered a pencil beam with the first scattering event occurring along the \(z\) axis. Also, we considered detectors of area \(\left(1, 4, 16\right) {\text{ mm}}^{2}\) at the source-detector separations \(\rho =15, 25, 35 \text{ mm}\), respectively.

Based on Eq. (29), MCX was also used for the calculation of \({\langle {l}_{i}\rangle }_{\phi }\) with the following caveats: (1) a pencil beam is used as a source at (0,0,0); the fluence rate calculated by this simulation provides an estimation of \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\mathbf{^{\prime}}\right)\) in Eq. (29); (2) an isotropic point source is placed in the middle of the first voxel \(\mathbf{r}=(\rho ,0,\text{s}/2),\) where \(\text{s}\) is the voxel size; this allows for the calculation of \({G}_{{\phi }_{0}}\left(\mathbf{r}\mathbf{^{\prime}}\to {\mathbf{r}}_{b}\right)\) by use of the reciprocity theorem with the approximation that \({\mathbf{r}}_{b}\approx (\rho ,0,\text{s}/2)\); (3) the normalization factor is obtained by the calculation of the fluence rate at \(\mathbf{r}=(\rho ,0,\text{s}/2)\) when the source is a pencil beam at (0,0,0). Steps 2) and 3) are also replaced by the following: a pencil beam is placed at the detector position \({\mathbf{r}}_{b}=(\rho ,0,0)\) and is used for evaluating \({G}_{{\phi }_{0}}\left(\mathbf{r}\to {\mathbf{r}}_{b}\right)\) of Eq. (29) by the reciprocity theorem. The normalization factor is the fluence rate from the pencil beam at (0,0,0) calculated at a point \(\mathbf{r}=(\rho ,\text{0,1}/{\mu ^{\prime}}_{s})\) (denominator of Eq. (29)). The reason for these two methods is because in one case we wanted to apply the correct Rytov approximation which for the adjoint simulation implies an isotropic source placed at the boundary. The best one can do is to place the source at the center of the first voxel. In the other case we were looking for a strategy to obtain the best match with the reflectance mean pathlengths. The MC simulations were carried out for \(g=0.9\). In all the fluence rate calculations, the fluence rates are calculated at the center of a \(1 {\text{ mm}}^{3}\) voxel and represent the average value inside the voxel. In the figures where MC calculations of \({\langle {l}_{i}\rangle }_{\phi }\) are shown, we also report \({\langle {l}_{i}\rangle }_{R\_MC}\) by the same MCX code. For the calculation of \({\langle {l}_{i}\rangle }_{R\_MC}\) the detector’s radius was \(0.45, 1.8 \text{ mm}\) at the source-detector distance of \(15\) and \(25 \text{ mm}\), respectively; the third receiver, at the distance of \(35 \text{ mm}\), had a radius of \(4.3 \text{ mm}\) for the highest reduced albedo and \(6.2 \text{ mm}\) for the other combinations. This was done to speed up the convergence of the calculation at the farthest source-detector separation. However, this choice affected the accuracy of the comparisons at this receiver (see discussion in section "Comparison of MCX and diffusion theory results for the calculation of fluence rate and reflectance pathlengths" and the supplementary information S5).

We also used MCX for comparison of \({\langle {l}_{i}\rangle }_{\phi }\) and \({\langle {l}_{i}\rangle }_{R\_MC}\) for a medium with a 1 mm thick highly absorbing layer (mimicking melanin) embedded in it (see Fig. 1). The optical properties of the bulk of the medium and the thin layer were \(\left({\mu }_{ab},{\mu }_{sb}^{\prime}\right)=\left(0.01, 1\right){\text{ mm}}^{-1}\) and \(\left({\mu }_{a,mel},{\mu }_{s,mel}^{\prime}\right)=\left(2, 1\right){ \text{ mm}}^{-1}\), respectively. The absorption of the thin layer can be representative of melanin in epidermis. The source-detector separation was \(\rho =35 \text{ mm}\) and the area of the detector \({1 \text{ mm}}^{2}\). The region of interest (for which the Jacobian is calculated) also in this case was a cube of side 1 mm (Fig. 1). To obtain good statistics we injected 130 billion photons.

Last, the FORTRAN MC was used for perturbation calculations with the geometry of Fig. 1 and with the features described at the beginning of this section. For each source-detector separation we calculated the change of reflectance due to a change of the absorption coefficient in a single cubic region (of side 6 mm) half-way between source and detector and at a depth of \(z=12 \text{ mm}\). The change of absorption coefficient inside the cube spanned the range \(\Delta {\mu }_{a}\in \left(-0.02, 0.04\right){\text{ mm}}^{-1}\).

Results

In section "Comparison of MC and diffusion theory results for the calculation of reflectance pathlengths" the comparison of MCX12 and the FORTRAN MC37 is carried out for the estimation of \({\langle {l}_{i}\rangle }_{R\_MC}\). Two cases are shown corresponding to the lowest and the highest reduced albedos. In the same plots \({\langle {l}_{i}\rangle }_{R}\) (Eq. 28) and \({\langle {l}_{i}\rangle }_{\phi }\)(Eq. 29) are also shown. In section "Comparison of MCX and diffusion theory results for the calculation of fluence rate and reflectance pathlengths" are shown the comparisons of \({\langle {l}_{i}\rangle }_{R\_MC}\) and \({\langle {l}_{i}\rangle }_{\phi }\), the latter obtained with both MCX and diffusion theory by using Eq. (29). In this section we also considered the optical properties relative to the lowest and highest reduced albedos, together with a case referred to typical optical properties, i.e., \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.01, 1\right){\text{ mm}}^{-1}\). In section "Comparison of fluence rate pathlengths in presence of a highly absorbing thin layer" we present the results for the medium with a thin highly absorbing layer. Finally, in section "Calculations of reflectance perturbation due to a strong localized absorption change" we present the results of perturbation of detected intensity caused by a large range of the absorption contrast between region of interest and background medium.

Comparison of MC and diffusion theory results for the calculation of reflectance pathlengths

In Fig. 2 we show the comparison of \({\langle {l}_{i}\rangle }_{R\_MC}\) obtained with the FORTRAN MC and MCX for the case of the cube of side 6 mm (Fig. 1). The calculations were run for \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.02, 0.5\right){ \text{ mm}}^{-1}\) (lowest reduced albedo). The results obtained with MCX are shown together with their errors obtained by running three independent simulations. In the same plots \({\langle {l}_{i}\rangle }_{R}\) and \({\langle {l}_{i}\rangle }_{\phi }\) obtained with DT (Eqs. 28, 29, respectively) are also shown. For the case of \({\langle {l}_{i}\rangle }_{\phi }\) the fluence rate was calculated at \(z=0\) (i.e., the detector was at \(\mathbf{r}={\mathbf{r}}_{b}=(\rho ,\text{0,0})\)). The results obtained with the two MC codes match within statistical error, which is shown only for the MCX code. Even with this choice of the optical properties, we can see a good agreement between the two MC results and with DT, especially with \({\langle {l}_{i}\rangle }_{R}\), as is expected. The maximum discrepancy between the two types of mean pathlengths according to DT is about 70% at the source-detector separation \(\rho =15 \text{ mm}\) and depth of the region of interest \(z=12 \text{ mm}\), and becomes smaller at \(\rho =25 \text{ mm}\) and \(\rho =35 \text{ mm}\). In Fig. 3 we show the same comparison for the highest reduced albedo: \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.005, 1.5\right){\text{ mm}}^{-1}\).

Fig. 2
figure 2

Comparison of \({\langle {l}_{i}\rangle }_{R\_MC}\) (Eq. 31) obtained with MCX (\({\langle {l}_{i}\rangle }_{R\_MCX}\)) and FORTRAN MC (\({\langle {l}_{i}\rangle }_{R\_MCF}\)) and with those obtained with DT (Eqs. 28, 29), \({\langle {l}_{i}\rangle }_{R}\) and \({\langle {l}_{i}\rangle }_{\phi (z=0)}\). The cube of side 6 mm (Fig. 1) was scanned along the \(x\) axis at three different depths: \(z=6, 12, 20 \text{ mm}\) (top, middle and bottom row, respectively; \(z\) is the center of the cube) and for the three detectors of Fig. 1 at the distance \(\rho =15, 25, 35 \text{ mm}\) (1st, 2nd, and 3rd column, respectively). The optical properties of the medium are: \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.02, 0.5\right){\text{ mm}}^{-1}\) (lowest reduced albedo). Double arrows indicating the discrepancy between \({\langle {l}_{i}\rangle }_{R}\) and \({\langle {l}_{i}\rangle }_{\phi (z=0)}\) (i.e., \(\left({\langle {l}_{i}\rangle }_{R}-{\langle {l}_{i}\rangle }_{\phi (z=0)}\right)/{\langle {l}_{i}\rangle }_{\phi (z=0)}\)) are shown at the source-detector distance \(\rho =15\) (panels (a), (d) and (g)).

Fig. 3
figure 3

As in Fig. 2 but for the optical properties \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.005, 1.5\right){\text{ mm}}^{-1}\) (highest reduced albedo).

There is an excellent agreement between the two MC codes and between them and DT, especially for \({\langle {l}_{i}\rangle }_{R}\). The discrepancy between the two types of mean pathlengths are less than about 10% for all cases considered. For other values of the optical properties, we found an excellent agreement between MC codes, while the agreement between the two types of pathlength depended mostly on \({\mu }_{s}^{\prime}\) and little on the values of \({\mu }_{a}\). These tests show the consistency of the independent methods of calculation.

Comparison of MCX and diffusion theory results for the calculation of fluence rate and reflectance pathlengths

In Fig. 4 we show the comparison of fluence rate partial pathlengths (Eq. 29) calculated with both MC and DT for the optical properties: \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.02, 0.5\right){\text{ mm}}^{-1}\). In this case the region of interest was a cube of 1 mm side (Fig. 1) and was scanned at the depths \(z=6.5, 12.5, 20.5 \text{ mm}\) (top, middle and bottom row, respectively).

Fig. 4
figure 4

Comparison of \({\langle {l}_{i}\rangle }_{\phi }\) obtained with MCX and DT according to Eq. (29), and \({\langle {l}_{i}\rangle }_{R\_MC}\). A cube of side 1 mm was scanned along the \(x\) axis at three different depths: \(z=6.5, 12.5, 20.5 \text{ mm}\) (top, middle and bottom row, respectively) and for the three detectors of Fig. 1 at the distance \(\rho =15, 25, 35\text{ mm}\) (1st, 2nd, and 3rd column, respectively). The symbols refer to MCX results for: \({\langle {l}_{i}\rangle }_{R\_MC}\) (\({\langle {l}_{i}\rangle }_{R\_MCX}\)), \({\langle {l}_{i}\rangle }_{\phi }\) obtained with an isotropic point source at \(\mathbf{r}=(\rho ,0,s/2)\) (\({\langle {l}_{i}\rangle }_{\phi \_MCX(z=\frac{s}{2})}\)), and for \({\langle {l}_{i}\rangle }_{\phi }\) obtained with a pencil beam at \(=(\rho ,\text{0,0})\) (\({\langle {l}_{i}\rangle }_{\phi \_MCX(z=1/{\mu }_{s}^{\prime})}\)) for the adjoint calculations. The lines refer to DT calculations of \({\langle {l}_{i}\rangle }_{\phi }\) with \(\mathbf{r}=(\rho ,0,s/2)\) (\({\langle {l}_{i}\rangle }_{\phi (z=\frac{s}{2})}\)), and with \(\mathbf{r}=(\rho ,\text{0,1}/{\mu }_{s}^{\prime})\) (\({\langle {l}_{i}\rangle }_{\phi (z=1/{\mu }_{s}^{\prime})}\)). The optical properties are: \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.02, 0.5\right){\text{ mm}}^{-1}\). A double arrow indicating the maximum discrepancy between \({\langle {l}_{i}\rangle }_{R\_MCX}\) and \({\langle {l}_{i}\rangle }_{\phi \_MCX(z=\frac{s}{2})}\) (i.e.,\(\left({\langle {l}_{i}\rangle }_{R\_MCX}-{\langle {l}_{i}\rangle }_{\phi \_MCX(z=\frac{s}{2})}\right)/{\langle {l}_{i}\rangle }_{\phi \_MCX(z=\frac{s}{2})}\)) is shown in panel (a).

For the calculation of \({\langle {l}_{i}\rangle }_{\phi }\) with MCX, the direct simulations were carried out for a pencil beam at (0, 0, 0). The adjoint simulations were carried out in two different ways: (1) for the case of an isotropic point source at \((\rho ,0,s/2)\) (\({\langle {l}_{i}\rangle }_{\phi \_MCX(z=\frac{s}{2})}\)); (2) for a pencil beam at \((\rho ,\text{0,0})\) (\({\langle {l}_{i}\rangle }_{\phi \_MCX(z=1/{\mu }_{s}^{\prime})}\)), respectively. For the denominator of Eq. (29), \(\mathbf{r}\) was chosen either as \(\mathbf{r}=(\rho ,0,s/2)\) (\(s=1 \text{ mm}\) is the voxel size) for case 1), or \(\mathbf{r}=(\rho ,\text{0,1}/{\mu }_{s}^{\prime})\) for case 2). The reflectance partial pathlength calculated with MCX (\({\langle {l}_{i}\rangle }_{R\_MCX}\)), using detectors’ areas specified in section "Monte Carlo simulations”, are also reported. MCX results are plotted with their standard deviations obtained with three independent simulations.

We note that for \({\langle {l}_{i}\rangle }_{\phi }\) the results are more sensitive on the details of the adjoint calculation and the normalization factor, especially for this case (lowest reduced albedo). All the different methods have different biases with respect to \({\langle {l}_{i}\rangle }_{R\_MC}\), which is the real target for reflectance calculations. We note a discrepancy among the methods also at the depth of \(z=20.5\text{ mm}\). One clear advantage of using the adjoint method with the MC method is the improvement of the statistical error on \({\langle {l}_{i}\rangle }_{\phi }\) with respect to the conventional method for the calculation of \({\langle {l}_{i}\rangle }_{R\_MC}\). This is visible more clearly at \(z=20.5\text{ mm}\). The comparisons of partial pathlengths for the optical properties \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.01, 1\right){\text{ mm}}^{-1}\) and \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.005, 1.5\right){\text{ mm}}^{-1}\) are shown in Figs. 5 and 6, respectively. From these figures we can see that the difference between DT and MC calculations becomes almost negligible. This is also true for the two types of mean pathlengths regardless of the details of the calculations. In Figs. 5 and 6 the discrepancies between \({\langle {l}_{i}\rangle }_{\phi }\) and \({\langle {l}_{i}\rangle }_{R\_MC}\) at the source-detector separation of 35 mm for locations close to the detector are an effect of the large detectors used for the calculation of \({\langle {l}_{i}\rangle }_{R\_MC}\): the detector’s radius is \(6.2\text{ mm}\) in Fig. 5, and \(4.3\text{ mm}\) in Fig. 6. The discrepancies disappear once we choose a detector of smaller size (see Supplementary Information S5). We finally note that the calculation of \({\langle {l}_{i}\rangle }_{\phi }\) with DT and \(\mathbf{r}=(\rho ,0,s/2)\) represents a slight overestimation of the same type of calculation for \(\mathbf{r}={\mathbf{r}}_{b}=(\rho ,0, 0)\) which amounts to about 5–6% at the shortest source-detector separation for \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.02, 0.5\right){\text{ mm}}^{-1}\). The overestimation becomes 1–2% in all the other cases. One may infer that a similar consideration would apply to MC calculations.

Fig. 5
figure 5

As in Fig. 4 for typical values of the optical properties: \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.01, 1\right){\text{ mm}}^{-1}\)

Fig. 6
figure 6

As in Fig. 4 for values of the optical properties: \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.005, 1.5\right){\text{ mm}}^{-1}\).

Comparison of fluence rate pathlengths in presence of a highly absorbing thin layer

Last, in Fig. 7, we want to explore how Rytov approximation behaves for a medium containing a highly absorbing layer (Fig. 1). By studying this case we can get a sense about the correctness of this approximation for the calculation of Jacobians, in the presence of discrete strong absorbers like blood vessels. We show the comparison between \({\langle {l}_{i}\rangle }_{R\_MC}\) and \({\langle {l}_{i}\rangle }_{\phi }\) for a medium with \(1 \text{ mm}\) thick layer at the depth of \(z=6.5\text{ mm}\). The optical properties of the bulk of the medium are \(\left({\mu }_{ab},{\mu }_{sb}^{\prime}\right)=\left(0.01, 1\right){ \text{ mm}}^{-1}\), while for the absorbing layer \(\left({\mu }_{a,mel},{\mu }_{s,mel}^{\prime}\right)=\left(2, 1\right){\text{ mm}}^{-1}\).

Fig. 7
figure 7

Comparison of \({\langle {l}_{i}\rangle }_{R\_MC}\)(\({\langle {l}_{i}\rangle }_{R\_MCX}\)) and \({\langle {l}_{i}\rangle }_{\phi }\)(\({\langle {l}_{i}\rangle }_{\phi \_MCX(z=\frac{s}{2})}\), \({\langle {l}_{i}\rangle }_{\phi \_MCX(z=1/{\mu }_{s}^{\prime})}\)) obtained with MCX for a homogeneous medium containing a \(1 \text{ mm}\) thick highly absorbing layer with its center at the depth \(z=6.5\text{ mm}\) (Fig. 1). The optical properties are: \(\left({\mu }_{ab},{\mu }_{sb}^{\prime}\right)=\left(0.01, 1\right){\text{ mm}}^{-1}\) and \(\left({\mu }_{a,mel},{\mu }_{s,mel}^{\prime}\right)=\left(2, 1\right){ \text{ mm}}^{-1}\), for the bulk of the medium and for the thin layer, respectively. The source-detector separation is \(\rho =35 \text{ mm}\) and the area of the detector is \(1 {\text{ mm}}^{2}\) (for the calculation of \({\langle {l}_{i}\rangle }_{R\_MC}\)). The region of interest (cube of \(1 \text{ mm}\) side) was scanned along the \(x\) axis at a depth of \(z=12.5 \text{ mm}\). The blue and orange symbols correspond to the adjoint simulations with isotropic source and the pencil beam, respectively.

The absorption of the thin layer can be representative of melanin in the epidermis, but the location can be representative of a blood vessel. The source-detector separation is \(\rho =35\text{ mm}\) and the area of the detector \({1\text{ mm}}^{2}\). The region of interest (cube of \(1\text{ mm}\) side) was scanned along the \(x\) axis at a depth of \(\text{z}=12.5\text{ mm}\). We considered \({\langle {\text{l}}_{\text{i}}\rangle }_{\upphi }\) calculated in the same two ways as in Figs. 4, 5 and 6. As we can see, the three calculations show a good agreement to within statistical errors. These results give us confidence that the Rytov approximation can be applied for the calculation of absorption Jacobians also in presence of strong absorbers. However, we want to caution about possible problems when the thin absorbing layer is at the boundary (as a true melanin layer). To avoid these problems, we have proposed a calibration method38.

These results are shown in Fig. 8, where we can see that the discrepancies between the two pathlengths are large, especially when we consider an adjoint simulation with a pencil beam at the detector’s location and the normalization is carried out at \(\mathbf{r}=(\rho ,\text{0,1}/{\mu }_{s}^{\prime})\) (orange symbols). We notice that the main difference between the correct Jacobian (black symbols) and the blue and orange symbols is an amplitude factor. This is mostly due to the rapid decrease of the fluence rate inside the melanin layer which causes a large overestimation of the denominator in Eq. (29) when we approximate the true normalization factor (i.e., the fluence rate at \(z=0\)) with those used in the simulations. One solution could be to use a much smaller voxel size; however, the simulation will have to run for a longer time to have the same statistical significance. Besides, there might also be a small effect due to the highly absorbing layer, which close to the detector implies that photons travelling along certain directions are more likely to be detected. Another solution is to apply a calibration method. The calibration method we have proposed38, imposes that the total mean pathlength calculated in the whole medium by using the Rytov approximation at each voxel coincides with the total mean pathlength \({\langle L\rangle }_{R\_MC}\). In other words, the mean pathlength in a voxel “\(i\)” according to the Rytov approximation \({\langle {l}_{i}\rangle }_{\phi }=\frac{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{i}^{\prime}\right){G}_{{\phi }_{0}}\left({\mathbf{r}}_{i}^{\prime}\to {\mathbf{r}}_{b}\right)}{{G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{b}\right)}dV\) is multiplied by a factor \(\beta \) such that \(\beta {\sum }_{i=1}^{N}{\langle {l}_{i}\rangle }_{\phi }={\langle L\rangle }_{R\_MC}\). This calibration is done for the two adjoint simulations discussed above, i.e., for the adjoint simulation with the isotropic source (for which \(\beta =1.76\)) and the pencil beam (for which \(\beta =9.41\)). The calibration method shows excellent comparison with \({\langle {l}_{i}\rangle }_{R\_MC}\).

Fig. 8
figure 8

Comparison of \({\langle {l}_{i}\rangle }_{R\_MC}\) (\({\langle {l}_{i}\rangle }_{R\_MCX}\)) and \({\langle {l}_{i}\rangle }_{\phi }\)(other symbols) obtained with MCX for a medium with a \(1 \text{ mm}\) thick layer at the boundary with the outer medium (i.e., depth of the center of the layer is \(z=0.5 \text{ mm}\)). The optical properties, source-detector separation, area of the detector, and voxel size are as in Fig. 7. The region of interest (cube of 1 mm side) was scanned along the \(x\) axis at a depth of \(z=6.5 \text{ mm}.\) The blue and orange symbols correspond to the adjoint simulations with isotropic source (\({\langle {l}_{i}\rangle }_{\phi \_MCX(z=\frac{s}{2})}\)) and the pencil beam (\({\langle {l}_{i}\rangle }_{\phi \_MCX(z=1/{\mu }_{s}^{\prime})}\)), respectively. The cyan (\({\langle {l}_{i}\rangle }_{\phi \_MCX\left(z=\frac{s}{2}\right)}\text{with} \beta \))and green (\({\langle {l}_{i}\rangle }_{\phi \_MCX(z=1/{\mu }_{s}^{\prime})}\text{with} \beta \)) symbols correspond to the adjoint simulations after applying the proposed calibration method (which identifies a factor \(\beta \)) to the blue and orange plots, respectively.

Calculations of reflectance perturbation due to a strong localized absorption change

In this section we show two comparisons between calculations of reflectance change with respect to a baseline value (i.e., \(\frac{\Delta R}{{R}_{0}}\)) for a wide range of absorption change (i.e., \({\Delta \mu }_{a}\)) between the region of interest and the background medium. The purpose is to show the performance of the Rytov approximation for reflectance (Eq. 22) with respect to the Born approximation and higher order Born series up to the fourth order. For higher order Born series we used a heuristic method proposed previously21 (section "Diffusion theory calculations”).

For each source-detector pair of Fig. 1 we chose a single cubic defect of side \(6 \text{ mm}\) placed at the depth of \(z=12\text{ mm}\) and half-way between the source and the detector. The exact coordinates of each defect are provided in the title of Fig. 9. The baseline optical properties of the diffusive medium were: \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.005, 1.5\right){ \text{ mm}}^{-1}\). MC simulations were run for \(g=0\). The change of the absorption coefficient between the defect and the background medium spanned the range \(\Delta {\mu }_{a}\in \left(0, 0.04\right){ \text{ mm}}^{-1}\) with increments of \(0.001 {\text{ mm}}^{-1}\). The reason to target large absorption change is to describe perturbations due to the presence of blood vessels. Figure 9 shows \(\frac{\Delta R}{{R}_{0}}\) calculated with the first four order of the Born series (the 1st one is the Born approximation discussed in section "Born and Rytov approximations and their physical meaning"), with the reflectance under the Rytov approximation (Eq. 22), and with the FORTRAN MC code. As expected, the results obtained with the Rytov approximation have a smaller discrepancy with respect to MC results than 1st order Born series. However, the higher order Born series outperforms the Rytov approximation (as expected). In Fig. 10 we show another example for different optical properties of the background medium \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.02, 1.5\right){ \text{ mm}}^{-1}\) and for values of \(\Delta {\mu }_{a}\in \left(-0.019, 0.04\right){\text{ mm}}^{-1}\).

Fig. 9
figure 9

Comparison of reflectance change \(\frac{\Delta R}{{R}_{0}}\) calculated with the first four orders of Born series, with Rytov approximation and with the FORTRAN MC code at the source-detector separation of \(\rho =15 \text{ mm}\) (left panel), \(\rho =25 \text{ mm}\) (center panel) and \(\rho =35 \text{ mm}\) (right panel). For each source-detector pair the location of the defect (a cube of side \(6 \text{ mm}\)) is indicated in the title. The baseline properties of the medium are \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.005, 1.5\right){ \text{ mm}}^{-1}\). The change of absorption coefficient between defect and background medium spans the range \(\Delta {\mu }_{a}\in \left(0, 0.04\right){ \text{ mm}}^{-1}\) with increments of \(0.001 {\text{ mm}}^{-1}\).

Fig. 10
figure 10

As in Fig. 9 but for different values of the background medium optical properties: \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.02, 1.5\right){ \text{ mm}}^{-1}\). The change of absorption coefficient was: \(\Delta {\mu }_{a}\in \left(-0.019, 0.04\right){ \text{ mm}}^{-1}\)

These two results are not surprising since the Rytov approximation (unlike the Born approximation) depends on higher powers of \(\Delta {\mu }_{a}\), but the coefficient of each power is \({\langle {l}_{i}\rangle }_{\phi }^{k}\) and not the correct higher order moment \({\langle {l}_{i}^{k}\rangle }_{\phi }\). One way to improve the Rytov approximation will be proposed in section "Conclusions and future work”.

Discussions

In this work we have revisited the current use of the Rytov approximation in diffuse optics. One of the main applications we focused on is the well-known use of the Rytov approximation for a fast computation of Jacobians with MC simulations. This method can be very time-efficient and effective to reduce statistical errors when compared to the traditional method of mean pathlength estimation, whenever diffusion conditions are verified. We have shown that the Rytov approximation has a natural physical meaning, once we reframe the photon transport process by using a statistical approach derived from the microscopic Beer–Lambert law. Given a region of interest where the Jacobian for absorption changes is calculated, and an observation point where the fluence rate is measured, the exponent in the Rytov approximation represents the mean partial pathlength travelled by photons inside the region of interest times the change of the absorption coefficient. We have called this partial pathlength, the fluence rate partial pathlength. We have argued that this partial pathlength and the partial pathlength defined for detected photons (named the reflectance partial pathlength) are distinct even when the observation point coincides with a detector point at the boundary. However, by physical arguments we can infer that these two pathlengths are similar in many situations of interest, even beyond DT. We can think of two correct methods to use MC simulations to calculate these two partial pathlengths. One way is to use a coupled forward-adjoint method17 inspired by perturbation theory of RTE (Eqs. 2527). This way can be efficient for variance reduction, but it may not be suitable for typical problems of DOT and NIRS, due to the high memory requirements. The other method, the conventional one, is to calculate the pathlength travelled by each detected photon inside a region of interest (see Supplementary Information S3). This method is suitable for NIRS and DOT applications and can be made more efficient by using the pseudorandom features of random number generators19,37. However, due to large memory requirements, even this method may not be suitable to calculate sensitivity maps for a fine voxelization of the medium. We stress that these are the only two methods for the calculation of the absorption Jacobian in a general situation under the validity of RTE. Whenever diffusion conditions are met, one can use the Rytov approximation together with the reciprocity theorem to speed up the computation of Jacobians19. However, we note that the notion of “meeting diffusion condition” is not a binary one, but it is true at different levels of approximation depending on the parameters of each case. This consideration prompted us to carry out more in-depth tests between the Jacobians obtained with the Rytov approximation (i.e., \({\langle {l}_{i}\rangle }_{\phi }\)) and those obtained with reflectance measurements using both DT and MC (i.e., \({\langle {l}_{i}\rangle }_{R}\) and \({\langle {l}_{i}\rangle }_{R\_MC}\), respectively). We have carried out comparisons for nine different combinations of the optical properties that span typical values found in NIRS and DOT. In a first series of tests (Figs. 2, 3), we have compared \({\langle {l}_{i}\rangle }_{R\_MC}\) obtained with two independent MC codes and with DT (Eq. 28). In the same figures we also reported \({\langle {l}_{i}\rangle }_{\phi }\) based on DT (Eq. 29). The comparison of the two MC codes shows excellent agreement in all the cases studied. DT calculations have shown different levels of agreement with MC data, depending on the optical properties of the medium and the source-detector separation. For the case of \(\left({\mu }_{a},{\mu }_{s}^{\prime}\right)=\left(0.02, 0.5\right){ \text{ mm}}^{-1}\) both \({\langle {l}_{i}\rangle }_{R}\) and (especially) \({\langle {l}_{i}\rangle }_{\phi }\) show some discrepancy with the partial pathlength calculated by MC simulations (Fig. 2). For the case of \({\langle {l}_{i}\rangle }_{\phi }\), these discrepancies reach to about 70% at \(\rho =15 \text{ mm}\), and \(z=12 \text{ mm}\). We note that the discrepancies between \({\langle {l}_{i}\rangle }_{\phi }\) and \({\langle {l}_{i}\rangle }_{R}\) would not exist if one adopted PCBC. However, it is unlikely that the results obtained with PCBC would be a better match with MC results. In fact, the literature has shown different results about the comparison of DE solutions based on EBC (like in this work) and PCBC, with MC results39,40,41. Therefore, we think that the most plausible explanation for these discrepancies is a combination of factors, including the relatively short source-detector separation, the choice of the optical properties (i.e., higher absorption and lower reduced scattering coefficients) and the approximate nature of any boundary condition for the DE. One should be aware of these factors if DT is used for the calculation of Jacobians. The more the diffusion conditions are fulfilled (i.e., larger source-detector separation and higher reduced scattering coefficient) the less discrepancies are found between MC and DT calculations, and between the two types of partial pathlengths (Fig. 3). These results are confirmed also in Figs. 4, 5, and 6, where we have used MCX for the calculation of \({\langle {l}_{i}\rangle }_{\phi }.\) In this case, the shortcomings due to the approximate boundary conditions of DE are taken out of the picture. As a result, we can see a better agreement between \({\langle {l}_{i}\rangle }_{R\_MC}\) and \({\langle {l}_{i}\rangle }_{\phi }\). The adjoint MC simulations for the calculation of \({\langle {l}_{i}\rangle }_{\phi }\) were run in two different ways: (a) by considering an isotropic point source at \(\mathbf{r}=(\rho ,0,\text{s}/2);\) (b) by considering a pencil beam at \({\mathbf{r}}_{b}=(\rho ,\text{0,0})\). The normalization factor was chosen as \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to \mathbf{r}\right)\) for case a) and \({G}_{{\phi }_{0}}\left({\mathbf{r}}_{0}\to {\mathbf{r}}_{1}\right)\) for case b) where \({\mathbf{r}}_{1}=(\rho ,\text{0,1}/{\mu ^{\prime}}_{s})\). In the same figures we have also reported the corresponding DT calculations of \({\langle {l}_{i}\rangle }_{\phi }\) (Eq. 29) and \({\langle {l}_{i}\rangle }_{R\_MC}\) calculated with MCX. For the lowest value of the reduced albedo (Fig. 4), method (a) shows discrepancy up to ~ 25% with respect to \({\langle {l}_{i}\rangle }_{R\_MC}\) at a depth of 6.5 mm and for \(\rho =15 \text{ mm}\); method (b) yields the best comparison with \({\langle {l}_{i}\rangle }_{R\_MC}\). As diffusion conditions are more and more satisfied (Figs. 5, 6), both methods are substantially equivalent and \({\langle {l}_{i}\rangle }_{\phi }\approx {\langle {l}_{i}\rangle }_{R}\). Therefore, these results suggest that method b) is a better choice for a wider range of optical properties.

In this work we bring to the attention of the scientific community the fact that the correct adjoint simulation for the calculation of the Jacobians for reflectance measurements should use an appropriate adjoint source at the detector’s site (section "Relationship between the correct forward-adjoint method and the present work"). This fact has been previously stated in some literature of DT6 and MC17. In other words, the calculation of the detected reflectance due to an isotropic source at the voxel \(\mathbf{r}\mathbf{^{\prime}}\) is equivalent to the calculation of the fluence rate at the same voxel due to an appropriate adjoint source located at the detector. We have described in detail the properties of this adjoint source in section "Relationship between the correct forward-adjoint method and the present work". This would allow the calculation of \({\langle {l}_{i}\rangle }_{R}\) by using MC-calculated fluence rate and reflectance. To the best of our knowledge this feature is not currently implemented in MCX. Most probably this method could yield better results than the calculation of \({\langle {l}_{i}\rangle }_{\phi }\) with method b) above.

By using only MC simulations, we have also reported two comparisons between \({\langle {l}_{i}\rangle }_{R\_MC}\) and \({\langle {l}_{i}\rangle }_{\phi }\) for a medium with a highly absorbing thin layer (having a thickness of 1 mm) embedded in it. The purpose of these examples was twofold: (a) We wanted to show that the Rytov approximation (when applied correctly) is a valid method for the calculation of Jacobians even in presence of a strong absorber. The optical properties of the thin layer were typical of melanin even when it was embedded at a depth of 6.5 mm (Fig. 7); (b) we wanted also to point out some “dangers” of the Rytov approximation when it is not applied correctly (Fig. 8). The example of Fig. 8 is particularly useful, given the widespread use of MCX for pulse oximetry simulations, where one of the current topics is the study of the effect of the melanin layer. In this case, the presence of the melanin layer causes a strong gradient of the fluence rate close to the boundary, and one would have to resort to a finer voxelization of the medium. To bypass this problem, in Fig. 8 we have also shown some results obtained with a novel calibration method that we have recently proposed38. This method yields an excellent agreement with \({\langle {l}_{i}\rangle }_{R\_MC}\).

Finally, we have tested a novel formula for reflectance calculations under the Rytov approximation Eq. (22). We have calculated the change of reflectance caused by three localized absorption perturbations (a different one for each source-detector distance) for a wide range of the absorption contrast. As expected, the Rytov approximation performs better than the Born approximation but worse than the higher order Born series. Despite the Rytov approximation being nonlinearly dependent on \(\Delta {\mu }_{a}\), it is based on the wrong assumption that \({\langle {l}_{i}\rangle }_{\phi }^{k}\) can substitute \({\langle {l}_{i}^{k}\rangle }_{\phi }\). We expect that higher order Rytov approximation3 can show faster convergence than higher order Born series, as it is discussed in the next section.

Conclusions and future work

In this work we have reviewed the use of the Rytov approximation in diffuse optics, especially the burgeoning application in MC simulations for the calculations of Jacobians. Under the validity of diffusion conditions, we have shown the natural physical meaning of the Rytov approximation, in terms of the fluence rate partial pathlength, which is directly related to the photon migration process. Based on this physical meaning, we have argued that for typical values of the optical properties in NIRS and DOT, the fluence rate pathlength is a close representation of the reflectance pathlength. In other words, in these situations the Rytov approximation can be used as a good surrogate of \({\langle {l}_{i}\rangle }_{R\_MC}\), especially if one uses a particular adjoint simulation with a pencil beam at the detector’s site and normalization of the fluence rate calculated at the diffusion length (\(1/{\mu ^{\prime}}_{s}\)). One alternative way, which is more logical for reflectance measurement is to use an appropriate adjoint simulation (based on general properties of the reciprocity theorem) that directly targets this type of measurement. This method is known in the scientific literature6,17 but not currently implemented in MCX.

In principle the method of mean pathlength estimation trough propagation of the photons and intersection with a region of interest is valid in general conditions without any approximation. Future work will be directed for finding strategies where this method can be used to provide detailed sensitivity maps of the absorption coefficient avoiding memory issues. We also plan to do more testing and to refine the calibration method proposed in Blaney et al.,38. One method can be to define several calibration factors, for example as many as some macroscopic regions of interest. For example, in a layered model of the finger one could define layer-specific calibration factors. Last, we plan to use the Rytov series together with the heuristic calculation of higher order moments for a faster convergence of perturbation calculations for large absorption contrasts. This method can be useful to speed up the convergence of forward problem calculations. In fact, the relationships between the different orders of the Born and Rytov series are well known3. These relationships can be reframed with the pathlength moments, which for the case of higher order Rytov series, leads to combinations of higher and lower moments and their powers. For example, while the second order normalized Born series leads to \(\frac{1}{2}\langle {l}_{i}^{2}\rangle {\Delta \mu }_{ai}^{2}\), the second order Rytov series is: \({\psi }_{2}=\frac{1}{2}\left[\langle {l}_{i}^{2}\rangle -{\langle {l}_{i}\rangle }^{2}\right]{\Delta \mu }_{ai}^{2}\). It is well possible that the higher order Rytov series will improve the convergence of perturbation calculations.