Realtime optimization of multidimensional NMR spectroscopy on embedded sensing devices

Tang, Yiqiao; Song, Yi-Qiao

doi:10.1038/s41598-019-53929-1

Download PDF

Article
Open access
Published: 25 November 2019

Realtime optimization of multidimensional NMR spectroscopy on embedded sensing devices

Scientific Reports volume 9, Article number: 17486 (2019) Cite this article

2132 Accesses
6 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The increasingly ubiquitous use of embedded devices calls for autonomous optimizations of sensor performance with meager computing resources. Due to the heavy computing needs, such optimization is rarely performed, and almost never carried out on-the-fly, resulting in a vast underutilization of deployed assets. Aiming at improving the measurement efficiency, we show an OED (Optimal Experimental Design) routine where quantities of interest of probable samples are partitioned into distinctive classes, with the corresponding sensor signals learned by supervised learning models. The trained models, digesting the compressed live data, are subsequently executed at the constrained device for continuous classification and optimization of measurements. We demonstrate the closed-loop method with multidimensional NMR (Nuclear Magnetic Resonance) relaxometry, an analytical technique seeing a substantial growth of field applications in recent years, on a wide range of complex fluids. The realtime portion of the procedure demands minimal computing load, and is ideally suited for instruments that are widely used in remote sensing and IoT networks.

Deriving three one dimensional NMR spectra from a single experiment through machine learning

Article Open access 19 November 2025

Solution-state methyl NMR spectroscopy of large non-deuterated proteins enabled by deep neural networks

Article Open access 13 June 2024

Depth-resolved characterization of Meissner screening breakdown in surface treated niobium

Article Open access 14 September 2024

Introduction

NMR, considered as one of the most potent analytical methods, traditionally requires dedicated personnel and delicate equipment thanks to the use of superconducting magnets, sizable electronics, and intricate probe and antenna placements¹. Only recently, owing to the advancement of permanent-magnet design², integrated electronics^3,4, and antenna miniaturization⁵, portable NMR systems⁶ have emerged as a viable surrogate. Thanks to the reductions in footprint, maintenance needs and price tag, the miniaturized sensor assemblies have extended their uses beyond conventional NMR laboratories to point-of-care medical diagnostics^7,8, subterranean explorations⁹, flow metering¹⁰, fluid authentication¹¹, and artefact preservation¹². In those “field” applications, it is highly desirable that the machinery operates autonomously and self-optimizes based on properties of the samples under investigation.

The needs for optimizing NMR spectroscopy become more pressing when considering that the quantities of interest, such as relaxation times (T₁ and T₂), diffusion coefficient, J-coupling, and chemical shift oftentimes span a large numerical range up to several orders of magnitude¹³. Consequently, a fit-for-all-purpose pulse sequence (PS) often does not exist. Misuses of pulse sequence could result in either a prolonged experiment time or a loss of measurement accuracy. Previous efforts on measurement optimization have been focusing on samples of simple compositions (containing single or double fluid species) and/or 1D functional forms of forward models^14,15,16.

Another challenge for autonomous optimization at the embedded sensing devices stems from the limited computing infrastructure, where microprocessors of merely tens of MHz CPU clock-rate and fast memories of tens of KB to a few MB are available¹⁷. The so-called “constrained devices” may connect to a gateway or cloud platform of much greater computing throughput, but often the connection is slow and intermittent^18,19. In those scenarios, realtime optimizations need to be executed in its entirety at the sensory nodes of meager resources.

We wondered whether it would be possible to optimize multidimensional NMR relaxometry that measures NMR relaxation times²⁰ of complex fluids, in realtime, on a mobile sensor generally regarded too “dumb” to perform such tasks. Instead of optimizing one sequence to all probable samples^15,16, we utilized a suite of sequences that were individually optimized for samples with distinctive ranges of relaxation properties.

More specifically, we used the inversion-recovery-CPMG (IRCPMG) pulse sequence for T₁ − T₂ correlation spectroscopy. The forward model that describes the signal evolution as a function of experimental parameters is²¹:

$$S({\tau }_{1},{\tau }_{2})=\iint (1-\theta {e}^{-{\tau }_{1}/{T}_{1}}){e}^{-{\tau }_{2}/{T}_{2}}f({T}_{1},{T}_{2})d{T}_{1}d{T}_{2}+\epsilon ,$$

(1)

where $\epsilon $ is the experimental noise, θ is a calibration coefficient of the instrument, τ₁ and τ₂ are prescribed parameters that the spectrometer traverses through, with S(τ₁, τ₂) the corresponding recorded signals. Inversion methods²² are applied to obtain the sample {T₁, T₂} correlation spectrum, f(T₁, T₂). As real-life samples often have broadly distributed f(T₁, T₂), we made no assumptions other than T₁ ≥ T₂²³ on the mathematical constructs of the spectra.

Three classes of fluids were considered in the sequence design. As shown in Fig. 1, class A contained components that were longer than 0.1 s in both T₁ and T₂ dimensions; class B embodied high T₁/T₂ ratios, where T₁ had components longer than 0.1 s while T₂ spanned [1 ms, 0.1 s]; and class C had relatively short T₁ and T₂ that each spanned [1 ms, 0.1 s]. Accordingly, Table 1 shows the optimal sequences for the respective fluid classes (i.e. sequence α for fluid A, β for B, and γ for C). In particular, both sequences α and β had τ₁ up to 10 s, capable of measuring T₁ up to 2 s, while sequence γ had the maximal τ₁ of 1 s that sufficed to measure T₁ up to 0.2 s. Meanwhile, sequence α had the maximal τ₂ = N_e × t_e of 10 s, capable of measuring T₂ up to 2 s, in contrast to sequences β and γ with the maximal τ₂ of 0.6 s. The shorter echo spacing, t_e, used in sequences β and γ than in sequence α could further help resolve fast T₂ components (Fig. S1).

Table 1 The three optimal pulse sequences.

Full size table

Ideally, sequences shall always be applied to the intended samples under study; but in continuous measurements on samples of changing properties, occasions do arise in which a sequence is suboptimally applied. As sensors generally couldn’t foresee temporal progression of sample properties, any combinations of fluid class and pulse sequence are practically probable. Here we show applications of each sequence to three exemplary fluids, namely dodecane (fluid A), emulsified fluid (fluid B), and glycerol (fluid C), in Fig. 2. In the 3 by 3 panels, the diagonal time-domain images were acquired by the respective optimal sequences, while the measurements that corresponded to the off-diagonal ones were either inefficient or erroneous (Fig. S1). The key challenge was to discern the fluid class from live time-domain images, regardless of the sequences in use, and apply the intended one in the subsequent runs. In practice, we used three trained ECOC (error-correcting output codes) learners²⁴, a class of supervised learning models, for the realtime multiclass classification and inference task.

Supervised learning requires large quantities of labeled datasets for model training. The training sets can consist of either prior measurements on samples of known properties or forward-modeled simulations. We opted for the later approach thanks to the well-defined functional form of Eq. 1. Specifically, we approximated probable T₁ − T₂ distributions by a large ensemble of synthetic distributions, each consisting of three components of randomly generated $\{{\tilde{T}}_{1},{\tilde{T}}_{2}\}$ pairs. Since the measurement volume was a constant and filled with fluids of similar proton density, we assigned each component by a randomly-generated weighting coefficient, $\tilde{\mu }$, that sums to unity. The time-domain data, S_T, for model training were generated as:

$${S}_{T}({\tau }_{1},{N}_{e},{t}_{e})=\mathop{\sum }\limits_{n=1}^{n=3}\,{\tilde{\mu }}_{n}(1-\theta {e}^{-{\tau }_{1}/{\tilde{T}}_{1,n}}){e}^{-{N}_{e}\cdot {t}_{e}/{\tilde{T}}_{2,n}}+{\epsilon }_{T},$$

(2)

where ${\sum }_{n=1}^{n=3}{\tilde{\mu }}_{n}=1$. After calibrating the experimental setup, we set θ to 1.85 and ${\epsilon }_{T}$ to a Gaussian noise with zero mean and 0.012 standard deviation.

To generate $\{{\tilde{T}}_{1},{\tilde{T}}_{2}\}$ pairs, we stochastically sampled in a two-dimensional space, where each dimension consisted of 100 logarithmically distributed numbers from 1 ms to 2 s and all pairs satisfied the relation ${\tilde{T}}_{1}\ge {\tilde{T}}_{2}$. In total, 30,000 ${\tilde{T}}_{1}-{\tilde{T}}_{2}$ distributions were created. Subsequently, we sifted the sampled distributions, one by one, through a set of classification criteria, and labeled them accordingly (Fig. S3). A given distribution was labeled class A if the longest ${\tilde{T}}_{2}$ > 0.1 s and its associated weighting coefficient ≥0.05, labeled class B if the longest ${\tilde{T}}_{2}$ < 0.1 s, its associated ${\tilde{T}}_{1}$ > 0.15 s, and its associated weighting coefficient ≥0.05, and labeled C if the longest ${\tilde{T}}_{1}$ < 0.1 s. As a result, 11,663 were assigned to class A, 11,835 to B, and 1449 to C.

To further reduce the size of training datasets, we exploited the separable structures of the functional form, and applied singular value decomposition (SVD) on T₁ and T₂ kernels independently²². Consequently, for any given sample, the size of compressed datasets was 1.57 KB when acquired by sequence α, 1.34 KB when acquired by β, and 1.25 KB when acquired by γ. As elaborated in the supplementary information, a near 1000-fold reduction in memory usage was achieved with the SVD compression.

Subsequently, we trained three ECOC classifiers, an ensemble method for multiclass classification problem²⁴; each classifier is used while running the corresponding pulse sequence. The ECOC models encoded the binary classification results from three linear support vector machines (SVM) into a coding design matrix²⁵, using the “one-versus-one” strategy that distinguished a pair of labeled time-domain patterns in the training set while ignoring the third fluid class. For sequences α, β, and γ, the ECOC classifiers had the respective size of 4.7, 4.1 and 3.8 KB. In total, less than 13 KB of fast memory was required to store the models.

In the inference stage of classifying a new 2D NMR dataset, we utilized the loss-weighted decoding scheme²⁶ to aggregate predictions of the binary learners, in which the weighted “hinge” error functions²⁵ over all binary losses were minimized. After running a pulse sequence, the number of floating-point calculations for classifying the generated data, after normalization and SVD compression, is fewer than 700. More details of the model training, validation and inference can be found in the supplementary information.

We performed realtime optimizations of continuous NMR experiments with the trained ECOC classifiers, as illustrated in Fig. 3. The mobile NMR sensor²⁷, shown in Fig. 3B, was miniaturized largely due to the use of an NMR ASIC (Application Specific Integrated Circuit)³. The NMR probe, embodied in a Halbach-array magnet, was made of a solenoid coil wound around a polymer capillary, interrogating fluid samples of 17 μL in volume. During operation, the spectrometer executed a selected pulse sequence that was downloaded from the laptop, on which the acquired data were input to ECOC classifiers for realtime inference.

Figure 3C shows a series of experiments on sequentially displaced samples. The samples were six water/glycerol mixes of varying volume fractions and one emulsified fluid. As the volume fraction of glycerol increases, the relaxation times of the mixtures shorten from T₂ = T₁ = 2 s of pure water to T₂ = T₁/2 = 15 ms of pure glycerol; meanwhile, the T₁/T₂ ratio also inflates thanks to the elevated fluid viscosity. We started with sample 1 of pure water while applying sequence γ; the classifier correctly identified the fluid as type A and instructed to use sequence α for the subsequent run. Thereafter, sequence α was optimally applied for samples 1, 2 and 3. Sample 4 had T₁/T₂ slightly above 1 with T₂ = 0.1 s. Consequently, the optimization routine signified the sample as type B. Sequence β was properly applied till sample 5 was loaded, which was classified as fluid C. Subsequently, sequence γ was applied on sample 5 and 6 of rather short relaxation times. Finally, we displaced glycerol by a well-gelled emulsion sample, which the ECOC classifier correctly deduced as a class B fluid; the spectrometer subsequently applied sequence β for the rest of the experiments.

In addition to physical displacement, a given sample could also evolve over time. For example, the emulsion fluids, which are multiphasic mixes of oil, brine, organoclays, and naturally-mined barite particles, could experience phase separation under static conditions. As the emulsion collapsed and solid particles sedimented, the emancipated oil exhibited a characteristic T₂ time much longer than the original fluid, calling for a different optimal sequence.

Experimentally, we performed the optimization routine for continuously monitoring an emulsion sample under static conditions. As shown in Fig. 3D, the initial well-gelled emulsion fell in the class B fluid, to which sequence β was optimally applied. At about 50,000 s after the experiment commenced, signals of bulk oil started to appear with a T₂ at ca. 0.5 s. As volume fraction of free oil increased, the fluid gradually morphed to class A, with the corresponding optimal sequence α. Notably, A transition window presented at ca. 60,000 s, where the free fluid content was still marginal while the inference results hinged partially on noise realizations of each measurement.

In practice, it is important to ensure that each sequence has sensitivities over the entire numerical domain of T₁ − T₂ spectra under consideration. Failure to meet the requirement could cause misclassification and thereby erroneous results. For example, the maximum τ₁ in sequence γ, which is optimized for samples of fast relaxation times, should be designed so that the signal decays significantly with the maximal T₁. Mathematically, it should satisfy 1 − exp(−τ_1,max/T_1,max) ≫ σ, where σ² is the variance of the Gaussian noise. The relation is indeed held in the work, as τ_1,max = 1 s for sequence γ, T_1,max = 2 s, and σ² = 0.012².

Although we focus on relaxometry, the method can be extended to other types of NMR measurements of increasing complexities, such as multidimensional spectroscopy and MRI, at the core of which are forward models of similar mathematical constructs (exponential, sine and cosine functions). In conjunction with minimal requirements on computing resources, the demonstrated approach may further NMR methods to a substantially broadened usage in a wide range of field applications.

References

Günther, H. NMR spectroscopy: basic principles, concepts and applications in chemistry (John Wiley & Sons, 2013).
Danieli, E., Perlo, J., Blümich, B. & Casanova, F. Small magnets for portable NMR spectrometers. Angew. Chem. Int. Ed 49, 4133–4135 (2010).
Article CAS Google Scholar
Ha, D., Paulsen, J. L., Sun, N., Song, Y.-Q. & Ham, D. Scalable NMR spectroscopy with semiconductor chips. Proceedings of the National Academy of Sciences 111, 11955–11960 (2014).
Article ADS CAS Google Scholar
Huber, S. et al. Multichannel digital heteronuclear magnetic resonance biosensor. Biosensors and Bioelectronics 126, 240–248 (2019).
Article CAS Google Scholar
Wensink, H. et al. Measuring reaction kinetics in a lab-on-a-chip by microcoil NMR. Lab on a Chip 5, 280–284 (2005).
Article CAS Google Scholar
Zalesskiy, S. S., Danieli, E., Blumich, B. & Ananikov, V. P. Miniaturization of NMR systems: Desktop spectrometers, microcoil spectroscopy, and “NMR on a chip” for chemistry, biochemistry, and industry. Chemical reviews 114, 5641–5694 (2014).
Article CAS Google Scholar
Colucci, L. A. et al. Fluid assessment in dialysis patients by point-of-care magnetic relaxometry. Science Translational Medicine 11, eaau1749 (2019).
Article Google Scholar
Jeong, S. et al. Real-time quantitative analysis of metabolic flux in live cells using a hyperpolarized micromagnetic resonance spectrometer. Science Advances 3, e1700341 (2017).
Article ADS Google Scholar
Kleinberg, R. et al. Deep sea NMR: Methane hydrate growth habit in porous media and its relationship to hydraulic permeability, deposit accumulation, and submarine slope stability. Journal of Geophysical Research: Solid Earth 108 (2003).
Fridjonsson, E. O., Stanwix, P. L. & Johns, M. L. Earth’s field NMR flow meter: Preliminary quantitative measurements. Journal of Magnetic Resonance 245, 110–115 (2014).
Article ADS CAS Google Scholar
Pinter, M., Harter, T., McCarthy, M. & Augustine, M. Towards Using NMR to Screen for Spoiled Tomatoes Stored in 1,000L, Aseptically Sealed, Metal-Lined Totes. Sensors 14, 4167–4176 (2014).
Article Google Scholar
Blumich, B. et al. Noninvasive testing of art and cultural heritage by mobile NMR. Accounts of Chemical Research 43, 761–770 (2010).
Article ADS Google Scholar
Levitt, M. H. Spin dynamics: basics of nuclear magnetic resonance (John Wiley & Sons, 2001).
Song, Y.-Q., Tang, Y., Hürlimann, M. & Cory, D. Real-time optimization of nuclear magnetic resonance experiments. Journal of Magnetic Resonance 289, 72–78 (2018).
Article ADS CAS Google Scholar
Jones, J., Hodgkinson, P., Barker, A. & Hore, P. Optimal sampling strategies for the measurement of spin–spin relaxation times. Journal of Magnetic Resonance, Series B 113, 25–34 (1996).
Article ADS CAS Google Scholar
Reci, A., Ainte, M., Sederman, A. J., Mantle, M. D. & Gladden, L. F. Optimising sampling patterns for bi-exponentially decaying signals. Magnetic resonance imaging 56, 14–18 (2019).
Article CAS Google Scholar
Chiang, M. & Zhang, T. Fog and iot: An overview of research opportunities. IEEE Internet of Things. Journal 3, 854–864 (2016).
Google Scholar
Tubel, P., Bergeron, C. & Bell, S. Mud pulser telemetry system for down hole measurement-while-drilling. In Instrumentation and Measurement Technology Conference, 1992. IMTC’92., 9th IEEE, 219–223 (IEEE, 1992).
Jarrot, A., Gelman, A. & Kusuma, J. Wireless digital communication technologies for drilling: Communication in the bits/s regime. IEEE Signal Processing Magazine 35, 112–120 (2018).
Article ADS Google Scholar
Bloembergen, N., Purcell, E. M. & Pound, R. V. Relaxation effects in nuclear magnetic resonance absorption. Physical review 73, 679 (1948).
Article ADS CAS Google Scholar
Song, Y.-Q. et al. T₁− T₂ correlation spectra obtained using a fast two-dimensional Laplace inversion. Journal of Magnetic Resonance 154, 261–268 (2002).
Article ADS CAS Google Scholar
Venkataramanan, L., Song, Y.-Q. & Hurlimann, M. D. Solving Fredholm integrals of the first kind with tensor product structure in 2 and 2.5 dimensions. IEEE Transactions on Signal Processing 50, 1017–1026 (2002).
Article ADS MathSciNet Google Scholar
Traficante, D. D. Relaxation. Can T₂ be longer than T₁? Concepts in Magnetic Resonance 3, 171–177 (1991).
Article Google Scholar
Dietterich, T. G. & Bakiri, G. Solving multiclass learning problems via error-correcting output codes. Journal of artificial intelligence research 2, 263–286 (1994).
Article Google Scholar
Bishop, C. M. Pattern recognition and machine learning (Springer, 2006).
Escalera, S., Pujol, O. & Radeva, P. On the decoding process in ternary error-correcting output codes. IEEE transactions on pattern analysis and machine intelligence 32, 120–134 (2010).
Article Google Scholar
Tang, Y., McCowan, D. & Song, Y.-Q. A miniaturized spectrometer for NMR relaxometry under extreme conditions. Scientific reports 9, 11174 (2019).
Article ADS Google Scholar

Download references

Acknowledgements

The authors would like to thank Drs. Martin Hürlimann, Nick Heaton, and Reda Karoum at Schlumberger for helpful discussions. We also would like thank Dr. Reda Karoum for providing the emulsion sample.

Author information

Authors and Affiliations

Schlumberger-Doll Research, Cambridge, MA, 02139, USA
Yiqiao Tang & Yi-Qiao Song

Authors

Yiqiao Tang
View author publications
Search author on:PubMed Google Scholar
Yi-Qiao Song
View author publications
Search author on:PubMed Google Scholar

Contributions

Y.T. and Y.S. conceived the project; Y.T. performed the experiments; Y.T. and Y.S. wrote the paper.

Corresponding author

Correspondence to Yiqiao Tang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tang, Y., Song, YQ. Realtime optimization of multidimensional NMR spectroscopy on embedded sensing devices. Sci Rep 9, 17486 (2019). https://doi.org/10.1038/s41598-019-53929-1

Download citation

Received: 24 September 2019
Accepted: 07 November 2019
Published: 25 November 2019
Version of record: 25 November 2019
DOI: https://doi.org/10.1038/s41598-019-53929-1

This article is cited by

Automatic Optimization of Pulse Sequences Based on a Closed-Loop Control Strategy
- Guanghui Shi
- Lizhi Xiao
- Jihong Liu
Applied Magnetic Resonance (2024)
Adaptive control for downhole nuclear magnetic resonance excitation
- Guanghui Shi
- Lizhi Xiao
- Xueli Hou
Scientific Reports (2023)
A Monte Carlo algorithm to improve the measurement efficiency of low-field nuclear magnetic resonance
- Pan Guo
- Ruoshuang Zhang
- Bing Li
Scientific Reports (2023)