Abstract
Biophysical model fitting plays a key role in obtaining quantitative parameters from physiological signals and images. However, the model complexity for molecular magnetic resonance imaging (MRI) often translates into excessive computation time, which makes clinical use impractical. Here, we present a generic computational approach for solving the parameter extraction inverse problem posed by ordinary differential equation (ODE) modeling coupled with experimental measurement of the system dynamics. This is achieved by formulating a numerical ODE solver to function as a step-wise analytical one, thereby making it compatible with automatic differentiation-based optimization. This enables efficient gradient-based model fitting, and provides a new approach to parameter quantification based on self-supervised learning from a single data observation. The neural-network-based train-by-fit pipeline was used to quantify semisolid magnetization transfer (MT) and chemical exchange saturation transfer (CEST) amide proton exchange parameters in the human brain, in an in-vivo molecular MRI study (n = 4). The entire pipeline of the first whole brain quantification was completed in 18.3 ± 8.3 minutes. Reusing the single-subject-trained network for inference in new subjects took 1.0 ± 0.2 s, to provide results in agreement with literature values and scan-specific fit results.
Similar content being viewed by others
Introduction
Magnetic resonance imaging (MRI) plays a central role in clinical diagnosis and neuroscience. This modality is highly versatile and can be selectively programmed to generate a large number of image contrasts1, each sensitive to certain biophysical parameters of the tissue. In recent years, there has been extensive research into developing quantitative MRI (qMRI) methods that can provide reproducible measurements of magnetic tissue properties (such as: T1, T2, and \({{{{\rm{T}}}}}_{2}^{* }\)), while being agnostic to the scan site and the exact acquisition protocol used2. Classical qMRI quantifies each biophysical property separately3, using repeated acquisition and gradual variation of a single acquisition parameter under steady state conditions. This is followed by fitting the model to an analytical solution of magnetization vector dynamics4.
The exceedingly long acquisition times associated with the classical quantification pipeline have motivated the development of magnetic resonance fingerprinting (MRF)5, an alternative paradigm for the joint extraction of multiple tissue parameter maps from a single pseudorandom pulse sequence. Since MRF data are acquired under non-steady state conditions6, the corresponding magnetization vector can only be resolved numerically. This comes at the expense of the complexity of the inverse problem, namely finding tissue parameters that best reconstruct the signal according to the forward model of spin dynamics. Since model fitting under these conditions takes an impractically long time7, MRF is commonly solved by dictionary matching, where a large number of simulated signal trajectories are compared to experimentally measured data8. Unfortunately, the size of the dictionary scales exponentially with the number of parameters (the “curse of dimensionality”9), which rapidly escalates the compute and memory demands of both generation and subsequent use of the dictionary for pattern matching-based inference.
Recently, various deep learning (DL)-based methods have been developed for replacing the lengthy dictionary matching with neural-network (NN)-based inference10,11,12,13. While this approach greatly reduces the parameter quantification time, networks still need to be trained using a comprehensive dictionary of synthetic signals. Since dictionary generation may take days12, it constitutes an obvious bottleneck for routine use of MRF, and reduces the possibilities for addressing a wide variety of clinical scenarios. Even with a faster generation, the transfer of synthetic data-trained NN to experimental data raises concerns about biased estimates.
The complexity and time constraints associated with the MRF pipeline are drastically exacerbated for molecular imaging applications that involve a plurality of proton pools, such as chemical exchange saturation transfer (CEST) MRI14. While CEST has demonstrated great potential for dozens of biomedical applications15,16,17,18,19,20,21, some on the verge of entering clinical practice22, the inherently large number of tissue properties greatly complicate analysis23. This has prompted considerable efforts to transition from CEST-weighted imaging to fully quantitative mapping of proton exchange parameters24,25,26,27,28. Early CEST quantification used the fitting of the classical numerical model (based on the underlying Bloch-McConnell equations) after irradiation at various saturation pulse powers (B1)27. However, applying this approach in a pixelwise manner in-vivo is unrealistic because both the acquisition and reconstruction steps may require several hours. Later, faster approaches, such as quantification of the exchange by varying saturation power/time and Omega-plots25,29,30,31 still rely on steady-state (or close to steady state) conditions, and approximate analytical expressions of the signal as a function of the tissue parameters32,33. Unfortunately, a closed-form analytical solution does not exist for most practical clinical CEST protocols, which utilize a train of off-resonant radiofrequency (RF) pulses saturating multiple interacting proton pools. Similarly to the quantification of water T1 and T2, incorporating the concepts of MRF into CEST studies provided new quantification capabilities28,34,35,36,37 and subsequent biological insights, for example, in the detection of apoptosis after oncolytic virotherapy12. However, in order to further push the boundaries of CEST MRF research and expedite its progress, the long dictionary generation associated with each new application needs to be replaced by a rapid and flexible approach that adequately models multiple proton pools under saturation pulse trains.
Here, we describe a physics-based DL framework for rapid model fitting of the human brain tissue proton spin properties. While this approach is applicable for quantifying a variety of MRI parameters, we focus on a challenging CEST imaging scenario, involving multiple proton pools, a saturation pulse train, and non-steady-state MRF acquisition. The computational pipeline (Fig. 1) combines a spin physics simulator and a NN-based quantitative parameter reconstructor in a fully auto-differentiable manner38. Our system effectively solves and inverts the Bloch-McConnell ordinary differential equations (ODEs), which govern the multi-pool exchange, saturation, and relaxation dynamics of molecular MRI. Hence, we refer to this approach as “neural Bloch McConnell fitting” (NBMF). Importantly, the network can be be trained in a self-supervised manner, directly on the single-subject data of interest (inspired by related work on test-time-39, internal-40, and zero-shot-40,41,42 learning). This circumvents the need for prior curation of a large training dataset, which is often inaccessible, especially for molecular MRI.
A quantitative parameter reconstructor parameterized as a multi-layer perceptron (MLP) and a differentiable Bloch-McConnell simulator are serially connected into a single computational graph. Single-subject Magnetic Resonance Fingerprinting (MRF) data serves both as the input and as the regression target for the reconstructor-simulator circuit. The network convergence (a) provides the fitted exchange parameter maps for the examined subject as well as a trained NN reconstructor; the latter can be used to extract parameter maps for new subjects within seconds (b). The simulator can be realized using the exact numerical Bloch McConnell ODE solver or using analytical approximations when available (e.g., for 2-pool semisolid Magnetization Transfer (MT) quantification33). While not shown in the diagram, auxiliary per-voxel data such as T1, T2, B0, and B1 maps can be added as input to the neural reconstructor and the simulator. Furthermore, the pipeline main block can be serially repeated so that estimated semisolid MT volume fraction (fss) and proton exchange rate (kssw) maps inferred at the first stage are joined to the raw data used in a second reconstructor aimed to quantify the amide proton exchange parameters (fs, ksw).
Results and discussion
In-vitro CEST quantification
A phantom composed of six vials with different combinations of L-arginine concentration and pH was assembled and scanned in a 3T clinical scanner (Prisma, Siemens Healthineers, Germany) using a previously published non-steady-state rapid CEST protocol12,43. Good agreement was obtained between the NBMF-estimated and known L-arginine concentrations (Fig. 2a, c, e): Pearson’s r = 0.986, p = 3.0 × 10−4, root mean square error (RMSE) = 8.4 mM, mean absolute percentage error (MAPE) = 10.8%). The NBMF-reconstructed proton exchange rates were in good agreement with the corresponding values estimated by traditional MRF dictionary-matching (Fig. 2b, d, f): Pearson’s r = 0.999, p = 1.6 × 10−6, RMSE = 41.0 s−1, MAPE = 13.2%. The pH dependence of the NBMF reconstructed exchange rate (Fig. 2b) was a good fit for a base-catalyzed proton exchange model (R2 = 0.94, p = 1.4 × 10−3), as predicted by theory44. An additional in-depth comparison between traditional dictionary matching and NBMF is available in Supplementary Table 1, and Supplementary Figs. 1, 2.
L-arginine samples were imaged using a pulsed Chemical Exchange Saturation Transfer Magnetic Resonance Fingerprinting (CEST-MRF) protocol in a 3T clinical scanner. The neural Bloch McConnell fitting (NBMF)–based L-arginine concentrations (a) and proton exchange rates (b) were in good agreement with those obtained by dictionary-based pattern matching (c and d, respectively). The ground truth L-arginine concentrations and pH values are mentioned in the white text next to each vial. The pixelwise distributions are further compared in (e, f). Each point in the swarm plot reflects a single 1.8 mm × 1.8 mm × 5.4 mm voxel.
Quantifying the semisolid-MT and CEST proton exchange parameters in the human brain
The NBMF pipeline used in-vitro was modified for semisolid-MT and amide proton exchange parameter mapping in the human brain. Two 3D and rapid acquisition pulse sequences were applied, as described previously12,43. The first sequence varied the saturation pulse frequency offset between 6 and 14 ppm (designed to encode semisolid-MT information), whereas the second sequence fixed it at 3.5 ppm (for amide proton parameter encoding). In both cases, a total of 31 raw information encoding images were generated, with the saturation pulse power randomly varied between 0 and 4 μT12,44. Water relaxation (T1 and T2) and field maps (B1 and B0) were acquired separately and used as an extra input to the neural reconstructor described in Fig. 1 in order to avoid water-pool- and inhomogeneity-associated biases, respectively12. A two-step NBMF (see “Methods” section) was used to fit the semisolid-MT and amide proton exchange parameters to the raw data.
Quantitative semisolid-MT and amide proton exchange parameter maps derived from a representative healthy volunteer are presented in Fig. 3 and Fig. 4, respectively. The resulting proton volume fractions and exchange rates were in agreement with the literature (although the large variability in previous reports is noted; see Fig. 3d, e and Fig. 4d, e). The mean values obtained for white/gray matter (WM/GM) were: fss = 13.09 ± 3.44(%), kssw = 34.7 ± 7.8(s−1), fs = 0.33 ± 0.08(%), ksw = 305.1 ± 34.0(s−1) for white matter and fss = 6.28 ± 1.88(%), kssw = 44.2 ± 7.5(s−1), fs = 0.21 ± 0.06(%), ksw = 235.9 ± 46.0(s−1) for gray matter.
Results of neural Bloch-McConnell fitting (NBMF)-based quantification of the MT-related tissue parameters in the human brain scanned with a pulsed MT Magnetic Resonance Fingerprinting (MT-MRF) protocol are presented. Representative reconstructed parameter maps of the semisolid-MT proton volume fraction (a) and proton exchange rate (b), alongside a fidelity estimation (c) of the data-model agreement, computed as R2 = 1-NMSE (normalized mean square error). d, e Statistical analysis of the resulting proton exchange parameter values across the brain white matter and gray matter (WM/GM) regions of interest (box-plots, n=47,442/64,611 voxels, respectively), compared to literature (colored markers)12,43,45,90,91. In the boxplots, the central horizontal lines represent median values, box size represents the two central (2nd, 3rd) quartiles, the whiskers represent the 90 central percentiles, and outliers are omitted.
Results of neural Bloch-McConnell fitting (NBMF)-based quantification of the APT-related tissue parameters in a human brain scanned with a pulsed Chemical Exchange Saturation Transfer Magnetic Resonance Fingerprinting (CEST-MRF) protocol. Representative reconstructed parameter maps of the amide proton volume fraction (a) and proton exchange rate (b), alongside a fidelity estimation (c) of the data-model agreement, computed as R2 = 1-NMSE (normalized mean square error). d, e Statistical analysis of the resulting proton exchange parameter values across the brain white matter and gray matter (WM/GM) regions of interest (box-plots, n = 47442/64611 voxels, respectively), compared to literature (colored markers)12,45,91,92. In the boxplots, the central horizontal lines represent median values, box size represents the two central (2nd, 3rd) quartiles, the whiskers represent the 90 central percentiles, and outliers are omitted.
The joint fit and training of the NBMF produced a neural reconstructor, optimized on a single subject. We then re-applied the trained reconstructors to additional subjects in a fast inference mode. A representative example comparing the parameter maps obtained from single-subject NBMF with those obtained by a rapid reconstructor reuse is shown in Fig. 5. The resulting agreement metrics (Fig. 5e) were as follows; NRMSE: 7 ± 1%, 12 ± 3%, 7 ± 1%, and 18 ± 1%; Intraclass correlation coefficient ICC(2,1): 0.87 ± 0.03, 0.82 ± 0.04, 0.86 ± 0.03, 0.86 ± 0.03; SSIM: 0.93 ± 0.02, 0.87 ± 0.07, 0.94 ± 0.01, 0.90 ± 0.03, for the fss, kssw, fs, and ksw, respectively. Additional analysis is provided in Supplementary Fig. 4.
A comparison between the results of single-subject NBMF (a, b) and real-time quantification of the same subject by inferring the neural reconstructor trained while fitting another subject (c, d). A perceptually and quantitatively similar outputs were obtained for both semisolid (a, c) and amide (b, d) exchange parameters mapping. e Similarity analysis using normalized root-mean-square (NRMSE), intraclass correlation coefficient (ICC(2,1), absolute agreement-assessing variant), and structural similarity index measure (SSIM) across all (n = 50) processed brain slices. In box plots, the central horizontal lines represent median values, box size represents the two central (2nd, 3rd) quartiles, whiskers represent 1.5× the interquartile range above and below the upper and lower quartiles, and circles represent outliers.
Computational complexity, timing, and comparison with alternative approaches
The combined NBMF training and fitting procedure for all four semisolid-MT and amide proton volume fraction and exchange rate parameter maps from the whole brain of a single subject (169K–194K voxels) took 18.3 ± 8.3 min on a standard GPU-equipped (GeForce RTX 3060) desktop workstation, of which, the two-pool quantification of the semisolid MT pool parameters took 3.0 ± 0.4 min. Re-applying the trained reconstructors for whole-brain parameter mapping on a new subject took 1.0 ± 0.2 s. Overall, the complete quantification pipeline takes less than 30 min for NBMF, compared to at least several hours using previously reported implementations of traditional Bloch-Fitting45, or dictionary-based preparation and supervised neural network training.12,37 (Table 1).
Next, we performed unified benchmarking of dictionary generation, matching, and supervised learning, using the accelerated approach developed as part of this work (GPU-based JAX formulation of the Bloch McConnell numerical solution); see additional implementation details in Supplementary Note 2. Notably, this yielded comparable timing to self-supervision (Table 2), given that a non-cartesian sampling grid is used for dictionary generation. The benefit of nonetheless using the self-supervised approach compared to supervised training is highlighted in Supplementary Notes 1–3 in the context of consistency with the raw acquired data and per-subject discrepancy minimization. In general, by unlocking rapid direct fitting (via automatic differentiation) and coupling it with self-supervised learning, NBMF constitutes an alternative way to address the ill-posed in-vivo quantification challenge. It contributes to an improved consistency of the quantitative parameter estimates with the raw data given the model, compared to dictionary-based supervised learning (Supplementary Fig. 3). By combining the explicit objective of minimal data-model discrepancy with implicit neural regularization, NBMF created smoother maps with visible contrast, while keeping the data-model agreement close to that achieved by dictionary matching (Supplementary Fig. 5).
Automatic differentiation of the Bloch-McConnell (BM) equation solutions
Quantification of semisolid MT/CEST proton exchange parameters under non-steady-state conditions is a notable example of a biophysical estimation where the forward model is perceived as too complex for direct inverse solution via fitting (requiring several days for a single whole brain reconstruction45). While the solution can be approximated via MRF, the large simulated signal dictionaries12 associated with multi-pool imaging also demand significant computational resources46,47, limiting the development of new pulse sequences. A recently reported dictionary-free alternative48 proposed unsupervised learning for semisolid-MT parameter quantification. However, this method assumes continuous pulse irradiation, which is not available in many clinical scanners, and also relies on analytical solutions, which are not compatible with multi-pool pulsed CEST imaging.
The dictionary-free method presented in this work overcomes all such limitations. Our approach is based on a fundamental insight: by proper formulation, ODE models considered only numerically solvable can become step-wise analytical, and thereby compatible with automatic differentiation-based optimization. Specifically, the suggested formulation enables GPU-based matrix inversion and exponentiation, which translates into efficient gradient descent via back-propagation. Combining this concept with a recently reported high-performing automatic differentiator38 provides a new option for solving complex biophysical estimation tasks such as pulsed CEST quantification, demonstrated here. Compared to standard model fitting, this approach avoids (i) computationally expensive and inaccurate purely-numerical derivatives computed via multiple evaluations, and (ii) explicit analytical approximations, which can only be applied to a limited subset of cases and lack generalization (e.g., unsuitable for a multi-proton-pool pulsed-RF saturation). Unlike MRF dictionary-trained networks10,28, the suggested approach can provide parameter estimates that allow the model to best describe the raw data (Supplementary Fig. 3).
Supervised NN training using a synthetic signal dictionary requires the estimation of the application-specific parameter distribution, which is often unknown in advance. The self-supervised (NBMF) approach circumvents this challenge by training on the in-vivo data itself, offering improved parameter distribution matching. When it comes to re-using the trained network on new unseen subjects, one drawback of this approach lies in its reliance on previously represented proton exchange parameters. Dictionary-based approaches, on the other hand, have the flexibility for representing the expected abnormality values (if they are known) or simply using a very broad parameter distribution that covers both the healthy and diseased states. A future patient cohort investigation is needed to examine the clinical utility of transferring the self-supervised quantification approach, when trained on healthy volunteers, for quantification in unseen pathology (e.g., small lesions).
As shown in Table 1, the most time-consuming steps for supervised dictionary-based learning are the dictionary generation step followed by neural network training. However, if the imaging scenario is a priori known (e.g., brain cancer treatment monitoring) and the acquisition protocol parameters are fixed and optimized, these steps can be done once without affecting the rapid inference time for each new subject. Self-supervised data-based learning (NBMF), on the other hand, offers the flexibility of accommodating various imaging scenarios and is more easily adapted for new acquisition protocols and research directions. That being said, the biophysical modeling developed as part of this work (GPU-based JAX implementation of the Bloch McConnell numerical solution) can also accelerate both dictionary generation and supervised learning (Table 2), allowing the user to utilize and compare all different approaches.
The gradient of the forward model can be directly used for simple fitting of the unknown ODE coefficients corresponding to the physical parameters of interest (see voxelwise BMF in the “Methods” section, Supplementary Note 3, and Supplementary Fig. 6). However, when the core forward model automatic differentiator is also integrated into a self-supervised learning pipeline (NBMF, Fig. 1), a joint neural representation of the signal-to-parameter transformation can be trained and stored with little extra computational cost. This enables: (i) Faster convergence, which scales well with the number of voxels up to the full brain size, leveraging redundancy towards a spatially smoother solution (Supplementary Fig. 6). (ii) Later reuse for real-time inference on new subjects within a similar imaging scenario.
The human brain imaging results (Figs. 3–5, Supplementary Figs. 3–6, 8) reveal the potential for using an autodiff-compatible Bloch-McConnell solver for parameter quantification while training a simple multilayer perceptron (MLP) voxelwise. Combined with a 3D whole brain acquisition routine (which rapidly generates hundreds of thousands of voxels), the suggested system provides a rapid and efficient single-subject learning method. Notably, while this work presented a proof of concept for rapid inference by a network trained on a single subject, robustness and consistency of the transfer leaves a clear room for improvement (Fig. 5 and Supplementary Fig. 4). Future work could study different NN architectures with spatial awareness (via convolutional or attention layers), as well as larger datasets composed of multiple subjects and a combination of both dictionary-based synthetic data and real-world scans. Subsequent efforts could also improve the modeling accuracy by taking under consideration the contributions of additional proton pools (such as amine and guanidinium) to the 3.5 ppm signal.
The proposed approach could be further exploited for other tasks across the CEST-MRF pipeline, such as accelerated dictionary synthesis49,50(as demonstrated in Supplementary Note 2 and Table 2) and pulsed-wave irradiation compatible CEST protocol discovery and optimization46,47,51,52,53. Furthermore, NBMF is directly applicable to anatomical-MRF (proton density, T1, T2) dictionary-free parameter quantification and conversely, to classical non-MRF molecular MRI, such as pulsed multi-B1 Z-spectra fitting25,54 (see Supplementary Notes 4, 5). While auto-differentiation of the Bloch equations has previously been leveraged for several MRI-related applications47,52,55,56,57, to the best of our knowledge this is the first report of utilizing this concept for a generalized Bloch-McConnell-fitting task. Beyond molecular MRI and MRF, this approach can also be applied to any other diagnostic and biophysical domains that involve dictionary-matching58.
Learning to estimate ordinary differential equation (ODE) coefficients from observed data
The general strategy underlying NBMF can be applied to any inverse problem that involves fitting ODEs to observations of a dynamical system. In the biomedical realm alone, this includes cardiovascular59,60, pharmacokinetic61,62, and epidemiological63,64 modeling, among many other tasks. In parallel to the exponential growth and improvement in AI performance, the last decade has witnessed a surge of interest in harnessing DL for physics-based problem solving. These efforts have most often been directed into two routes: (i) seeking a solution to a partial differential equation (PDE) as an output of a physics-informed neural network (PINN) that operates on spatial and temporal coordinates65,66,67,68,69; widely applied for spatially-resolved dynamics in solid70- and fluid71 mechanics, heat transfer72, power systems73, weather/climate74, and diffusion MRI75. (ii) Modeling parts of the equation with a NN, yielding a neural ODE/PDE76, often employed as a semi-parametric approach for model discovery77,78,79. Interestingly, the relatively simple direct inverse solution approach described here (Fig. 1), whereby a NN is trained to infer the coefficients of an ODE model from a few samples of the dynamics, has not received similar attention. This could open opportunities for the current work to inform new approaches to ODE-driven inverse problems across a multitude of tasks.
Conclusions
The NBMF framework enables rapid, dictionary-free, pulsed-saturation and multiple proton-pool-compatible semisolid MT/CEST-MRF quantification. By combining a GPU-accelerated auto-differentiable numerical ODE solver and self-supervised DL, the NBMF pipeline is able to match alternative AI-reconstruction based inference, while removing the need for dictionary synthesis. NBMF is three orders of magnitude faster than traditional Bloch fitting, and provides a one-stop-shop for reconstructing quantitative molecular MRI data. The underlying approach has potential applications across a wide variety of ODE-driven inverse problem tasks.
Methods
CEST phantoms
A set of six 10 ml L-arginine (L-arg, chemical shift = 3 ppm, Sigma-Aldrich) phantoms was prepared at a concentration of 25, 50, 75, or 100 mM. The phantoms were titrated to different pH levels between 4.0 and 5.0 and placed in a 120 mm diameter cylindrical holder (MultiSample 120E, Gold Standard Phantoms, UK), filled with saline.
Human subjects
Four healthy volunteers (three females/one male, with average age 23.75 ± 0.83) were scanned at Tel Aviv University (TAU), using a 3T MRI equipped with a 64-channel coil (Prisma, Siemens Healthineers). The research protocol was approved by the TAU Institutional Ethics Board (study no. 2640007572-2) and the Chaim Sheba Medical Center Ethics Committee (0621-23-SMC). All subjects gave written, informed consent before the study.
MRI acquisition
All acquisition schedules were implemented using the Pulseq prototyping framework80 and the open-source Pulseq-CEST sequence standard81. The MRF acquisition protocols were implemented as described previously12,43, with an unsaturated M0 image added at the beginning of each sequence. A spin lock saturation train (13 × 100 ms, 50% duty-cycle) was used for each one of the 30 additional iterations of the sequence, which varied the saturation pulse power between 0 and 4 μT (average pulse amplitude). The saturation pulse frequency offset was fixed at 3 ppm for L-arginine phantom imaging44, 3.5 ppm for amide brain imaging12, or varied between 6 and 14 ppm for semisolid MT imaging43. The saturation block was followed by a 3D centric reordered EPI readout module82,83, providing a 1.8 mm isotropic resolution. The in-plane axial matrix size was 116 × 88, with 50 slices (169K–194K brain voxels) used per subject. The full sequences can be accurately reproduced using previously published Pulseq (.seq) files49. Each 3D MRF acquisition took 2:36 (min:s). The same readout module was used for acquiring additional B0, B1, T1, and T2 maps, via WASABI84, saturation recovery, and multi-echo sequences, respectively. The total scan time per subject was 9 min. The WASABI sequence used a preparation scheme realized by a rectangular pulse of 5 ms and nominal B1 = 3.7 μT. Twenty-four frequency offsets were equally spaced between −1.8 ppm and 1.8 ppm with a recovery time of 4.5 s. An M0 image was taken at -300 ppm with a recovery time of 12 s. The saturation recovery T1 mapping protocol used the following TR (s) values: 10, 6, 4, 3, 2, 1, 0.8, 0.5, 0.4, 0.3, 0.2, 0.1. The T2 mapping multi-echo sequence used the following echo times (s): 0, 0.01, 0.025, 0.03, 0.04 0.05, 0.1, 0.2, 0.3, 0.5, 1.0 with a TR = 10 s.
MRI data pre-processing
In vitro images with no L-arginine vials, partial vials, or severe image artifacts were removed. Regions of interest (ROIs) were defined using circular masks. In-vivo brain images were motion-corrected and registered using elastix85. WM/GM ROI segmentation of the T1 map was performed using statistical parameter mapping86. Quantitative reference CEST-MRF values (Fig. 2) were obtained using dot-product matching, as extensively described previously44,49.
NBMF architecture for semisolid-MT and CEST quantification
The self-supervised learning framework comprises two main components (Fig. 1, Top):
(A) Reconstructor \({{{\mathcal{R}}}}\) - a fully-connected multi-layer perceptron (MLP) NN, applied voxel-wise on the raw input data10,12,13,87. The NN is composed of three layers, with 256 neurons and ReLU activation in each hidden layer. The output layer consists of 5 neurons, encoding the estimates for the proton volume fraction and exchange rate of the compound of interest and their joint uncertainty expressed as noise covariance. It utilizes a sigmoid activation, with the output scaled to a predefined range of the parameter values, which effectively defines the prediction boundaries as follows: semisolid proton volume fraction fss ∈ [0, 30] (%), semisolid proton exchange rate kssw ∈ [0, 150] (s−1), amide proton volume fraction fs ∈ [0, 1.2] (%), amide proton exchange rate ksw ∈ [0, 400] (s−1), L-arginine concentration [L-arg] ∈ [10, 120] (mM) and Nα-amine (of L-arginine) proton exchange rate ksw ∈ [100, 1400] (s−1)44. Several auxiliary maps X, including water relaxation T1, T2, and B0/B1 inhomogeneities, are appended to the MRF raw data D to be used as inputs for the tissue parameter estimation: \(\tilde{{{{\bf{P}}}}}={{{\mathcal{R}}}}\left(({{{\bf{D}}}},{{{\bf{X}}}}),w\right)\) where w are the weights to be trained.
(B) Simulator \({{{\mathcal{F}}}}\)—a differentiable multi-pool spin physics solver. A numerical simulation of the piecewise-constant coefficient Bloch-McConnell (BM) differential equations was implemented in the open-source JAX38 computing framework, leveraging its strong auto-differentiation and GPU-acceleration capabilities for matrix operations. The simulator concatenated and chained the calculations of the BM closed-form solution across all pulses and delays of the protocol. This was carried out by inversion and exponentiation of the 9 × 9 BM-matrix A, which expresses all precession, saturation, relaxation and exchange terms of the multi-pool magnetization vector (M) dynamics, as previously defined33:
This solver is compatible with the rectangular pulse-train shape employed in this study and others12,43,81, while arbitrary pulse shapes can be supported using a simple matched-RMS approximation, or to any order through a Magnus expansion88. For two-pool imaging cases (such as semisolid MT data acquired using saturation pulses with a frequency offset higher than 6 ppm), additional acceleration was obtained by implementing the interleaved saturation-relaxation (ISAR2) approximate analytical solution33 of the saturation stage. The RF pulses of spin-lock and readout flips were approximated as hard pulses generating precise flip angle rotations.
The model is designed to represent the whole sequence by simulating the Z-magnetization dynamics during the recovery, saturation, and readout stages, provided that spoilers are applied. For each of the two (semisolid MT/amide) sequences, the Nx31 non-steady-state MRF measurements from 169K-194K brain voxels were normalized using an unsaturated M0 reference image. Thus, the resulting acquired data \({{{\bf{D}}}}={\{{D}_{n}\}}_{n = 1}^{N}\in [0,1]\) is directly related to the magnetization vector governed by Eq. (1), at the end of the saturation pulse block. Therefore, given \(\tilde{{{{\bf{P}}}}}\), an estimate of the sought parameters, the simulator provides a re-synthesis of the data as: \(\tilde{{{{\bf{D}}}}}={{{\mathcal{F}}}}(\tilde{{{{\bf{P}}}}},{{{\bf{X}}}},{\omega }_{rf},{B}_{1})\), where ωrf and B1 are the saturation pulse frequency offsets and powers implemented in the MRF protocol, and X are any known tissue parameters.
The NBMF reconstruction of the semisolid-MT proton exchange parameters from the first (1) subject, was obtained by using the MT-MRF data \({{{{\bf{D}}}}}_{ss}^{(1)}\) alongside independently quantified auxiliary parameter maps \({{{{\bf{X}}}}}_{w,B}^{(1)}=\{{T}_{1w},{T}_{2w},{B}_{1},{B}_{0}\}\), for training the weights \({w}_{ss}^{(1)}\) of a neural reconstructor \({{{{\mathcal{R}}}}}_{MT}^{(1)}\), designed to quantify the associated proton exchange parameters (\({\tilde{{{{\bf{P}}}}}}_{ss}^{(1)}={f}_{ss},{k}_{ssw}\)). To that end, the NBMF optimizes the following self-supervised objective of consistency with the biophysical model \({{{\mathcal{F}}}}\):
The L2 norm was used as the regression loss. A cosine-decay learning rate schedule and simple early-stopping upon convergence (loss trend reaching plateau) were applied. Augmentation by noise was applied, twice: (a) Adding a ±0.1% Gaussian noise to the raw samples (b) Adding a Gaussian noise to the \(\tilde{f},\tilde{k}\) tissue parameters estimate, using covariance derived from extra outputs of the NN, inspired by a recent work89.
This process was repeated using the non-steady-state amide raw MRF data \({{{{\bf{D}}}}}_{s}^{(1)}\) for NBMF quantification of the amide proton exchange parameters (Ps = fs, ksw). For human brain experiments, we also appended the semisolid MT pool parameter estimates \({\widetilde{f}}_{ss}\) and \({\tilde{k}}_{ssw}\) (obtained from the semisolid MT NBMF procedure) to the auxiliary vector X. This vector served as input for the amide reconstructor \({{{{\mathcal{R}}}}}_{s}\) and the 3-pool biophysical model \({{{{\mathcal{F}}}}}_{s}\), so that: \({X}_{B,w,ss}=\{{X}_{B,w},{P}_{ss}\}=\{{T}_{1w},{T}_{2w},{B}_{1},{B}_{0},{\tilde{f}}_{ss},{\tilde{k}}_{ssw}\}\)12. For the two-pool L-arginine phantom experiments, the auxiliary parameters were assigned constant values based on previous reports (T1w = 2800 ms, T2w = 1200 ms)43.
Importantly, we obtain both the subject-specific proton exchange parameters \({\tilde{{{{\bf{P}}}}}}^{(1)}={{{{\mathcal{R}}}}}^{(1)}({{{{\bf{D}}}}}^{(1)},{{{{\bf{X}}}}}^{(1)})\) and the trained reconstructor \({{{\mathcal{R}}}}\) at the convergence of the NBMF. This enables ultra-fast quantification of the proton exchange parameters \({\tilde{{{{\bf{P}}}}}}^{(2)}={{{{\mathcal{R}}}}}^{(1)}({{{{\bf{D}}}}}^{(2)},{{{{\bf{X}}}}}^{(2)})\) from a new subject (2) (Fig. 1 bottom). Notably, this rapid inference is only applicable for new data drawn from the same distribution and cannot be applied to entirely new systems (such as muscle creatine quantification using brain-data trained NBMF).
As a natural ablation of the system by removing the neural component, the auto-diff simulator can be used for direct voxelwise parameter fitting: \({\tilde{{{{\bf{P}}}}}}^{(1)}=argmin| | {{{\mathcal{F}}}}\left({{{{\bf{P}}}}}^{(1)}\right)-{{{{\bf{D}}}}}^{(1)}| |\), referred to here as voxelwise Bloch-McConnell fitting (VBMF). This simpler process can be described in the context of Fig. 1 as stopping the gradients at the tissue parameters, which now assume the role of independent per-voxel variables. Apart from the obvious drawback of not yielding a neural reconstructor, VBMF’s performance is inferior to NBMF for brain imaging (Supplementary Fig. 6), which we ascribe to the implicit smoothing regularization by the neural network. However, it is a viable direct method for in vitro analysis that is equally able to converge to the minimum of the modeling-error landscape (Supplementary Figs. 1, 2).
Finally, additional acceleration was achieved by parallelization of the computational graph across consecutive readout pairs {Dn−1, Dn}, decoupling the single-iteration simulators \({{{{\mathcal{F}}}}}_{n}\). Assuming that the Dn−1 snapshot captures the preceding spin history evolution, the re-synthesis stage is now formulated as \(\tilde{{{{\bf{D}}}}}={\{{{{{\mathcal{F}}}}}_{n}(\widetilde{{{{\bf{P}}}}},{D}_{n-1})\}}_{n = 1}^{N}\) and embedded in Eq. (2) as such. See Supplementary Note 4 for further elaboration.
Statistical analysis
The SSIM and ICC(2,1) were calculated using the open-source SciPy and Pingouin scientific computing libraries for Python. In slice-statistic box plots (Fig. 5e), the central horizontal lines represent median values, box size represents the two central (2nd, 3rd) quartiles, whiskers represent 1.5× the interquartile range above and below the upper and lower quartiles, and circles represent outliers. In the voxel-statistic box plots (Figs. 3, 4) the central horizontal lines represent median values, box size represents the two central (2nd, 3rd) quartiles, the whiskers represent the 90 central percentiles and outliers are omitted. Numerical results in the text are presented as mean ±SD.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The data for the phantom experiment and sample human brain datasets (a single slice for each of the n = 4 subjects) are available at https://github.com/momentum-laboratory/neural-fitting and https://doi.org/10.5281/zenodo.15021550. The complete 3D human data cannot be shared due to subject confidentiality and privacy.
Code availability
The code used in this work is available at https://github.com/momentum-laboratory/neural-fitting and https://zenodo.org/records/15021550.
References
Bernstein, M. A., King, K. F. & Zhou, X. J. Handbook of MRI Pulse Sequences (Elsevier, 2004).
Weiskopf, N., Edwards, L. J., Helms, G., Mohammadi, S. & Kirilina, E. Quantitative magnetic resonance imaging of brain anatomy and in vivo histology https://doi.org/10.1038/s42254-021-00326-1 (2021).
Radunsky, D. et al. A comprehensive protocol for quantitative magnetic resonance imaging of the brain at 3 tesla. PloS ONE 19, e0297244 (2024).
Taylor, A. J., Salerno, M., Dharmakumar, R. & Jerosch-Herold, M. T1 mapping: basic techniques and clinical applications. JACC Cardiovasc. Imaging 9, 67–81 (2016).
Ma, D. et al. Magnetic resonance fingerprinting. Nature 495, 187–192 (2013).
Panda, A. et al. Magnetic resonance fingerprinting—an overview. Curr. Opin. Biomed. Eng. 3, 56–66 (2017).
Heo, H.-Y. et al. Quantifying amide proton exchange rate and concentration in chemical exchange saturation transfer imaging of the human brain. Neuroimage 189, 202–213 (2019).
Poorman, M. E. et al. Magnetic resonance fingerprinting part 1: potential uses, current challenges, and recommendations. J. Magn. Reson. Imaging 51, 675–692 (2020).
Hsieh, J. J. & Svalbe, I. Magnetic resonance fingerprinting: from evolution to clinical applications. J. Med. Radiat. Sci. 67, 333–344 (2020).
Cohen, O., Zhu, B. & Rosen, M. S. Mr fingerprinting deep reconstruction network (drone). Magn. Reson. Med. 80, https://doi.org/10.1002/mrm.27198 (2018).
Kim, B., Schär, M., Park, H. W. & Heo, H. Y. A deep learning approach for magnetization transfer contrast MR fingerprinting and chemical exchange saturation transfer imaging. NeuroImage 221, https://doi.org/10.1016/j.neuroimage.2020.117165 (2020).
Perlman, O. et al. Quantitative imaging of apoptosis following oncolytic virotherapy by magnetic resonance fingerprinting aided by deep learning. Nat. Biomed. Eng. 6, https://doi.org/10.1038/s41551-021-00809-7 (2022).
Cohen, O. et al. Cest MR fingerprinting (CEST-MRF) for brain tumor quantification using EPI readout and deep learning reconstruction. Magn. Reson. Med. 89, https://doi.org/10.1002/mrm.29448 (2023).
Perlman, O. et al. Cest mr-fingerprinting: practical considerations and insights for acquisition schedule design and improved reconstruction. Magn. Reson. Med. 83, 462–478 (2020).
Jones, K. M., Pollard, A. C. & Pagel, M. D. Clinical applications of chemical exchange saturation transfer (CEST) MRI. J. Magn. Reson. Imaging 47, https://doi.org/10.1002/jmri.25838 (2018).
Zhou, J., Heo, H. Y., Knutsson, L., van Zijl, P. C. & Jiang, S. Apt-weighted MRI: techniques, current neuro applications, and challenging issues. https://doi.org/10.1002/jmri.26645 (2019).
Vinogradov, E., Keupp, J., Dimitrov, I. E., Seiler, S. & Pedrosa, I. CEST-MRI for body oncologic imaging: are we there yet? NMR Biomed. 36, e4906 (2023).
Bricco, A. R. et al. A genetic programming approach to engineering MRI reporter genes. ACS Synth. Biol. 12, 1154–1163 (2023).
Vladimirov, N. & Perlman, O. Molecular MRI-based monitoring of cancer immunotherapy treatment response. Int. J. Mol. Sci. 24, 3151 (2023).
Wang, K. et al. Creatine mapping of the brain at 3T by CEST MRI. Magn. Reson. Med. 91, 51–60 (2024).
Rivlin, M., Perlman, O. & Navon, G. Metabolic brain imaging with glucosamine CEST MRI: in vivo characterization and first insights. Sci. Rep. 13, 22030 (2023).
Zhou, J. et al. Review and consensus recommendations on clinical apt-weighted imaging approaches at 3T: application to brain tumors. Magn. Reson. Med. 88, https://doi.org/10.1002/mrm.29241 (2022).
Wu, B. et al. An overview of CEST MRI for non-mr physicists. https://doi.org/10.1186/s40658-016-0155-2 (2016).
Ji, Y., Zhou, I. Y., Qiu, B. & Sun, P. Z. Progress toward quantitative in vivo chemical exchange saturation transfer (CEST) MRI. Isr. J. Chem. 57, 809–824 (2017).
Zaiss, M. et al. Quesp and quest revisited—fast and accurate quantitative CEST experiments. Magn. Reson. Med. 79, https://doi.org/10.1002/mrm.26813 (2018).
Meissner, J.-E. et al. Quantitative pulsed CEST-MRI using ω-plots. NMR Biomed. 28, 1196–1208 (2015).
Woessner, D. E., Zhang, S., Merritt, M. E. & Sherry, A. D. Numerical solution of the Bloch equations provides insights into the optimum design of PARACEST agents for MRI. Magn. Reson. Med. 53, 790–799 (2005).
Perlman, O., Farrar, C. T. & Heo, H. Y. MR fingerprinting for semisolid magnetization transfer and chemical exchange saturation transfer quantification. NMR Biomed. 36, https://doi.org/10.1002/nbm.4710 (2023).
McMahon, M. T. et al. Quantifying exchange rates in chemical exchange saturation transfer agents using the saturation time and saturation power dependencies of the magnetization transfer effect on the magnetic resonance imaging signal (QUEST and QUESP): ph calibration for poly-l-lysine and a starburst dendrimer. Magn. Reson. Med. 55, 836–847 (2006).
Meissner, J. E. et al. Quantitative pulsed CEST-MRI using omega-plots. NMR Biomed. 28, https://doi.org/10.1002/nbm.3362 (2015).
Wu, R., Xiao, G., Zhou, I. Y., Ran, C. & Sun, P. Z. Quantitative chemical exchange saturation transfer (qCEST) MRI—omega plot analysis of RF-spillover-corrected inverse CEST ratio asymmetry for simultaneous determination of labile proton ratio and exchange rate. NMR Biomed. 28, https://doi.org/10.1002/nbm.3257 (2015).
Zaiss, M. et al. A combined analytical solution for chemical exchange saturation transfer and semi-solid magnetization transfer. NMR Biomed. 28, https://doi.org/10.1002/nbm.3237 (2015).
Roeloffs, V., Meyer, C., Bachert, P. & Zaiss, M. Towards quantification of pulsed spinlock and CEST at clinical MR scanners: an analytical interleaved saturation-relaxation (ISAR) approach. https://doi.org/10.1002/mrm.24560. NMR Biomed. 28, https://doi.org/10.1002/nbm.3192 (2015).
Cohen, O., Huang, S., McMahon, M. T., Rosen, M. S. & Farrar, C. T. Rapid and quantitative chemical exchange saturation transfer (CEST) imaging with magnetic resonance fingerprinting (MRF). Magn. Reson. Med. 80, https://doi.org/10.1002/mrm.27221 (2018).
Zhou, Z. et al. Chemical exchange saturation transfer fingerprinting for exchange rate quantification. Magn. Reson. Med. 80, https://doi.org/10.1002/mrm.27363 (2018).
Heo, H.-Y. et al. Unraveling contributions to the z-spectrum signal at 3.5 ppm of human brain tumors. Magn. Reson. Med. 28, 376–383 (2024).
Cohen, O. et al. Cest MR fingerprinting (CEST-MRF) for brain tumor quantification using EPI readout and deep learning reconstruction. Magn. Reson. Med. 89, 233–249 (2023).
Bradbury, J. et al. JAX: composable transformations of Python+NumPy programs. GitHub. http://github.com/google/jax (2018).
Sun, Y. et al. Test-time training with self-supervision for generalization under distribution shifts. In Proc. 37th International Conference on Machine Learning, ICML 2020, vol. PartF168147-12 (PMLR, 2020).
Shocher, A., Cohen, N. & Irani, M. Zero-shot super-resolution using deep internal learning. In Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition. https://doi.org/10.1109/CVPR.2018.00329 (2018).
Ulyanov, D., Vedaldi, A. & Lempitsky, V. Deep image prior. Int. J. Comput. Vis. 128, https://doi.org/10.1007/s11263-020-01303-4 (2020).
Yaman, B., Hosseini, S. A. H. & Akçakaya, M. Zero-shot self-supervised learning for MRI reconstruction. In ICLR 2022—10th International Conference on Learning Representations (ICLR, 2022).
Weigand-Whittier, J. et al. Accelerated and quantitative three-dimensional molecular MRI using a generative adversarial network. Magn. Reson. Med. 89, https://doi.org/10.1002/mrm.29574 (2023).
Cohen, O., Huang, S., McMahon, M. T., Rosen, M. S. & Farrar, C. T. Rapid and quantitative chemical exchange saturation transfer (CEST) imaging with magnetic resonance fingerprinting (MRF). Magn. Reson. Med. 80, 2449–2463 (2018).
Heo, H. Y. et al. Quantifying amide proton exchange rate and concentration in chemical exchange saturation transfer imaging of the human brain. NeuroImage 189, https://doi.org/10.1016/j.neuroimage.2019.01.034 (2019).
Zhu, B., Liu, J., Koonjoo, N., Rosen, B. R. & Rosen, M. S. AUTOmated pulse SEQuence generation (AUTOSEQ) for MR spatial encoding in unknown inhomogeneous B0 fields. In Proc. International Society for Magnetic Resonance in Medicine Vol. 5489 (2019).
Perlman, O., Zhu, B., Zaiss, M., Rosen, M. S. & Farrar, C. T. An end-to-end ai-based framework for automated discovery of rapid CEST/MT MRI acquisition protocols and molecular parameter quantification (AUTOCEST). Magn. Reson. Med. 87, https://doi.org/10.1002/mrm.29173 (2022).
Kang, B., Kim, B., Schär, M., Park, H. W. & Heo, H. Y. Unsupervised learning for magnetization transfer contrast MR fingerprinting: application to CEST and nuclear overhauser enhancement imaging. Magn. Reson. Med. 85, https://doi.org/10.1002/mrm.28573 (2021).
Vladimirov, N. et al. Quantitative molecular imaging using deep magnetic resonance fingerprinting. Nat. Protoc. https://doi.org/10.1038/s41596-025-01152-w (2025).
Nagar, D., Vladimirov, N., Farrar, C. T. & Perlman, O. Dynamic and rapid deep synthesis of chemical exchange saturation transfer and semisolid magnetization transfer mri signals. Sci. Rep. 13, 18291 (2023).
Loktyushin, A. et al. MRzero—automated discovery of MRI sequences using supervised learning. Magn. Reson. Med. 86, https://doi.org/10.1002/mrm.28727 (2021).
Kang, B., Kim, B., Park, H. & Heo, H.-Y. Learning-based optimization of acquisition schedule for magnetization transfer contrast mr fingerprinting. NMR Biomed. 35, e4662 (2022).
Glang, F. et al. MR-double-zero—proof-of-concept for a framework to autonomously discover MRI contrasts. J. Magn. Reson. 341, https://doi.org/10.1016/j.jmr.2022.107237 (2022).
Zaiss, M., Schmitt, B. & Bachert, P. Quantitative separation of CEST effect from magnetization transfer and spillover effects by lorentzian-line-fit analysis of z-spectra. J. Magn. Reson. 211, https://doi.org/10.1016/j.jmr.2011.05.001 (2011).
Zhu, B., Liu, J., Koonjoo, N., Rosen, B. R. & Rosen, M. S. AUTOmated pulse SEQuence generation (AUTOSEQ) using Bayesian reinforcement learning in an MRI physics simulation environment. In Proc. Joint Annual Meeting ISMRM-ESMRMB (2018).
Lee, P. K., Watkins, L. E., Anderson, T. I., Buonincontri, G. & Hargreaves, B. A. Flexible and efficient optimization of quantitative sequences using automatic differentiation of Bloch simulations. Magn. Reson. Med. 82, 1438–1451 (2019).
Luo, T., Noll, D. C., Fessler, J. A. & Nielsen, J. F. Joint design of rf and gradient waveforms via auto-differentiation for 3D tailored excitation in MRI. IEEE Trans. Med. Imaging 40, https://doi.org/10.1109/TMI.2021.3083104 (2021).
Ghodasara, S. et al. Quantifying perfusion properties with DCE-MRI using a dictionary matching approach. Sci. Rep. 10. https://doi.org/10.1038/s41598-020-66985-9 (2020).
Zenker, S., Rubin, J. & Clermont, G. From inverse problems in mathematical physiology to quantitative differential diagnoses. PLoS Comput. Biol. 3, e204 (2007).
Linial, O., Ravid, N., Eytan, D. & Shalit, U. Generative ode modeling with known unknowns. In Proc. Conference on Health, Inference, and Learning 79–94 (2021).
Donnet, S. & Samson, A. A review on estimation of stochastic differential equations for pharmacokinetic/pharmacodynamic models. Adv. drug Deliv. Rev. 65, 929–939 (2013).
Chou, W.-C. & Lin, Z. Machine learning and artificial intelligence in physiologically based pharmacokinetic modeling. Toxicol. Sci. 191, 1–14 (2023).
Gumel, A. B., Iboi, E. A., Ngonghala, C. N. & Elbasha, E. H. A primer on using mathematics to understand COVID-19 dynamics: modeling, analysis and simulations. Infect. Dis. Model. 6, 148–168 (2021).
Keeling, M. J. et al. Fitting to the UK COVID-19 outbreak, short-term forecasts and estimating the reproductive number. Stat. Methods Med. Res. 31, 1716–1737 (2022).
Raissi, M., Perdikaris, P. & Karniadakis, G. E. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 378, https://doi.org/10.1016/j.jcp.2018.10.045 (2019).
Karniadakis, G. E. et al. Physics-informed machine learning. Nat. Rev. Phys. 3, 422–440 (2021).
Cuomo, S. et al. Scientific machine learning through physics-informed neural networks: where we are and what’s next. J. Sci. Comput. 92, https://doi.org/10.1007/s10915-022-01939-z (2022).
Rajput, J. R. et al. Physics-informed conditional autoencoder approach for robust metabolic CEST MRI at 7T. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 14227 LNCS https://doi.org/10.1007/978-3-031-43993-3-44 (2023).
Kissas, G. et al. Machine learning in cardiovascular flows modeling: predicting arterial blood pressure from non-invasive 4D flow MRI data using physics-informed neural networks. Comput. Methods Appl. Mech. Eng. 358, https://doi.org/10.1016/j.cma.2019.112623 (2020).
Haghighat, E., Raissi, M., Moure, A., Gomez, H. & Juanes, R. A physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics. Comput. Methods Appl. Mech. Eng. 379, 113741 (2021).
Cai, S., Mao, Z., Wang, Z., Yin, M. & Karniadakis, G. E. Physics-informed neural networks (PINNS) for fluid mechanics: a review. Acta Mech. Sin. 37, 1727–1738 (2021).
Cai, S., Wang, Z., Wang, S., Perdikaris, P. & Karniadakis, G. E. Physics-informed neural networks for heat transfer problems. J. Heat Transf. 143, https://doi.org/10.1115/1.4050542 (2021).
Huang, B. & Wang, J. Applications of physics-informed neural networks in power systems—a review. IEEE Trans. Power Syst. 38, https://doi.org/10.1109/TPWRS.2022.3162473 (2023).
Kashinath, K. et al. Physics-informed machine learning: case studies for weather and climate modelling. Philos. Trans. R. Soc. A 379, 20200093 (2021).
Zapf, B. et al. Investigating molecular transport in the human brain from MRI with physics-informed neural networks. Sci. Rep. 12, https://doi.org/10.1038/s41598-022-19157-w (2022).
Chen, R. T., Rubanova, Y., Bettencourt, J. & Duvenaud, D. K. Neural ordinary differential equations. In Advances in Neural Information Processing Systems 31, (2018).
Bradley, W. & Boukouvala, F. Two-stage approach to parameter estimation of differential equations using neural odes. Ind. Eng. Chem. Res. 60, 16330–16344 (2021).
Lai, Z., Mylonas, C., Nagarajaiah, S. & Chatzi, E. Structural identification with physics-informed neural ordinary differential equations. J. Sound Vib. 508, https://doi.org/10.1016/j.jsv.2021.116196 (2021).
Rudy, S. H., Brunton, S. L., Proctor, J. L. & Kutz, J. N. Data-driven discovery of partial differential equations. Sci. Adv. 3, https://doi.org/10.1126/sciadv.1602614 (2017).
Layton, K. J. et al. Pulseq: a rapid and hardware-independent pulse sequence prototyping framework. Magn. Reson. Med. 77, 1544–1552 (2017).
Herz, K. et al. Pulseq-CEST: towards multi-site multi-vendor compatibility and reproducibility of CEST experiments using an open-source sequence standard. Magn. Reson. Med. 86, https://doi.org/10.1002/mrm.28825 (2021).
Mueller, S. et al. Whole brain snapshot CEST at 3T using 3D-EPI: aiming for speed, volume, and homogeneity. Magn. Reson. Med. 84, 2469–2483 (2020).
Akbey, S., Ehses, P., Stirnberg, R., Zaiss, M. & Stöcker, T. Whole-brain snapshot CEST imaging at 7T using 3D-EPI. Magn. Reson. Med. 82, 1741–1752 (2019).
Schuenke, P. et al. Simultaneous mapping of water shift and B1 (WASABI)-application to field-inhomogeneity correction of CEST MRI data. Magn. Reson. Med. 77, 571–580 (2017).
Klein, S., Staring, M., Murphy, K., Viergever, M. A. & Pluim, J. P. Elastix: a toolbox for intensity-based medical image registration. IEEE Trans. Med. Imaging 29, https://doi.org/10.1109/TMI.2009.2035616 (2010).
Ashburner, J. & Friston, K. J. Unified segmentation. NeuroImage 26, https://doi.org/10.1016/j.neuroimage.2005.02.018 (2005).
Power, I. et al. In vivo mapping of the chemical exchange relayed nuclear overhauser effect using deep magnetic resonance fingerprinting. iScience 27, 111209 (2024).
Blanes, S., Casas, F., Oteo, J. A. & Ros, J. The Magnus expansion and some of its applications https://doi.org/10.1016/j.physrep.2008.11.001 (2009).
Glang, F. et al. Deepcest 3T: Robust MRI parameter determination and uncertainty quantification with neural networks-application to cest imaging of the human brain at 3T. Magn. Reson. Med. 84, https://doi.org/10.1002/mrm.28117 (2020).
Stanisz, G. J. et al. T1, T2 relaxation and magnetization transfer in tissue at 3T. Magn. Reson. Med. 54, https://doi.org/10.1002/mrm.20605 (2005).
Liu, D. et al. Quantitative characterization of nuclear overhauser enhancement and amide proton transfer effects in the human brain at 7 tesla. Magn. Reson. Med. 70, https://doi.org/10.1002/mrm.24560 (2013).
Carradus. A. J., Bradley, J. M. P., Gowland, P. A. & Mougin, O. E. Measuring chemical exchange saturation transfer exchange rates in the human brain using a particle swarm optimisation algorithm. NMR in Biomedicine. 36, e5001 (2023).
Acknowledgements
The authors thank Tony Stöcker and Rüdiger Stirnberg for their help with the 3D EPI readout. The study was supported in part by MOONSHOT-MED, a joint grant program from Tel Aviv University and Clalit Innovation, the innovation arm of Clalit Health Services, and the Edmond J. Safra Center for Bioinformatics at Tel Aviv University. This work was supported by the Ministry of Innovation, Science and Technology, Israel, and a grant from the Blavatnik Artificial Intelligence and Data Science Fund, Tel Aviv University Center for AI and Data Science (TAD). This project was funded by the European Union (ERC, BabyMagnet, project no. 101115639). Views and opinions expressed are, however, those of the authors only and do not necessarily reflect those of the European Union or the European Research Council. Neither the European Union nor the granting authority can be held responsible for them.
Author information
Authors and Affiliations
Contributions
A.F. and O.P. conceived the computational framework with N.V. providing valuable input. M.Z., N.V., and O.P. developed and/or conducted the imaging protocols and data acquisition. A.F. was responsible for the AI design, optimization, and data analysis. A.F. wrote the manuscript. O.P. supervised the project. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Physics thanks Jianpan Huang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Finkelstein, A., Vladimirov, N., Zaiss, M. et al. Multi-parameter molecular MRI quantification using physics-informed self-supervised learning. Commun Phys 8, 164 (2025). https://doi.org/10.1038/s42005-025-02063-8
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s42005-025-02063-8







