Background & Summary

Diffusion magnetic resonance imaging (dMRI) is a noninvasive technique widely used to probe the microstructure of biological tissues1. Conventional dMRI acquisitions employ a single echo time (TE)2,3,4,5,6, which imposes fixed T2 weighting across all measurements and have been shown to cause variations of dMRI metrics when the TE changes. For instance, changes with TE have been reported in the apparent diffusion coefficient (ADC)7,8, the diffusion tensor imaging (DTI) metrics including the fractional anisotropy (FA), axial diffusivity (AD), and radial diffusivity (RD) in the white matter regions9,10, as well as the compartmental signal fractions from the neurite orientation dispersion and density imaging (NODDI). This TE-dependence of dMRI metrics limits the interpretability of conventional studies.

More recent multi-echo-time (MTE) dMRI studies address this limitation by collecting multi-dimensional data that enhances sensitivity to both microstructural and compositional differences within tissue compartments11,12,13,14,15,16. With such data, it is possible to model both diffusion and T2 effects in tissue compartments. This combined diffusion-T2 modeling not only reduces biases in microstructural parameters but also provide additional sub-cellular T2 measures, offering potential for more specific biomarkers.

Despite these valuable insights, methods that incorporate multiple TEs are not widely accessible and therefore remain underutilized. Several factors attribute to this limitation: (1) acquiring dMRI data at multiple TEs necessitates additional scan time, which can be impractical in clinical settings; (2) longer TEs often result in reduced signal-to-noise ratio (SNR), complicating data acquisition in brain regions with short T2 relaxation time; and (3) echo-planar imaging (EPI), the predominant sequence used for dMRI, is prone to distortions that intensify at longer TEs because of increased susceptibility to B0 field inhomogeneities.

To bridge this gap, we present a 3 T dMRI dataset collected from three subjects across ten sessions, incorporating a diverse range of TEs. This dataset includes eight different TE sessions, ranging from 62 ms to 132 ms, with two b-values (700 and 2000 s/mm2) across 30 gradient directions and diffusion time fixed at each b-value and TE session. The multi-shell acquisition makes the data applicable to more complex models17,18. Additionally, we repeated the measurements at TE = 62 and 132 ms, considering that the shortest and longest TE sessions have been shown to critically impact the fitting of MTE model11.

By openly sharing this dataset, we aim to facilitate research in combined diffusion-relaxometry modeling, reproducibility analysis, and optimization of minimal acquisition protocols. This dataset further enables exploration of tissue-specific variations in MTE-derived microstructural properties across the brain, advancing the development of robust biomarkers for clinical research.

Methods

Participants

Data were collected from three healthy volunteers: subject 1 (male, 24 years), subject 2 (female, 27 years), and subject 3 (male, 28 years). The participants were recruited through printed advertisements. The written informed consent for data collection and data sharing was obtained from all participants and approved by the local research Ethics Committee of Zhejiang University (ref. 2018376).

Image acquisitions

All MRI scans were conducted using a Siemens 3 T Prisma scanner (Siemens, Erlangen, Germany) equipped with a 64-channel head-neck coil. The dMRI data were acquired with a repetition time (TR) of 5800 ms across ten sessions, each lasting approximately seven minutes. Imaging included eight different TEs: 62, 72, 82, 92, 102, 112, 122, and 132 ms. Two repetitions were collected at the shortest and longest TEs (62R2 and 132R2). Two b-values were used in the acquisition: b = 700 s/mm2 and b = 2000 s/mm2, applied in 30 gradient directions with uniform angular coverage, and the same directions were used for both b-values. A total of four b = 0 images were acquired: two in the anterior-to-posterior (A-P) direction and two in the posterior-to-anterior (P-A) direction. Diffusion times were fixed with gradient width and separation set at δ/Δ = 15.2/25.2 ms for all b-values and TEs. Other imaging parameters included: Field of View (FOV): 210 × 210 mm2; Number of slices: 48; Resolution: 2.5 × 2.5 × 2.5 mm³; Slice acceleration factor: 2; Phase acceleration factor: 2; Bandwidth: 2380 Hz/pixel.

Image preprocessing

The data were organized according to the Brain Imaging Data Structure (BIDS) format. The preprocessing pipeline for diffusion-weighted images (DWIs) was performed separately for each TE session (Fig. 1). This included: (1) denoising using MRtrix3 with dwidenoise function (Marchenko-Pastur Principal Component Analysis denoising)19,20; (2) correcting for B0-inhomogeneity using TOPUP function21; (3) correcting for eddy current-induced distortion using EDDY22 functions in FSL. The effect of additional preprocessing steps, specifically Gibbs ringing, B1 field inhomogeneity and Rician bias correction, was assessed. However, considering the marginal improvement that these steps had, only the results with the first three steps are reported to keep preprocessing minimal.

Fig. 1
figure 1

Preprocessing pipeline for multiple TEs data. The left panel (Raw data) shows unprocessed b = 0 images for anterior-posterior (AP) and posterior-anterior (PA) phase encoding directions, with DWIs at b = 700 s/mm2 and b = 2000 s/mm2 across 30 directions. The right panel (Preprocessed data) displays the same data after applying preprocessing steps, including denoising, B0 inhomogeneity correction, eddy and motion correction, and aligning the DWIs and b-vectors to the first TE session.

To align the DWIs across multiple TE sessions, linear registration was performed using FLIRT in FSL to register mean b = 0 images from later TEs to the first TE. The affine matrix obtained from FLIRT was then applied to the eddy-corrected DWIs. Finally, the eddy corrected b-vectors of the later TEs were rotated again using the obtained affine matrix to align with the first TE.

Data Records

The dataset is available at Science Data Bank23 (https://doi.org/10.57760/sciencedb.o00133.00038). All personal identifiers were not included and the facial features were also not included as the FOV only covered the brain. The neuroimaging dataset is available in two formats: raw and preprocessed MRI data, both of which are available for all release subjects. For each subject, unprocessed and preprocessed images for dMRI are provided within each TE session (Fig. 2).

Fig. 2
figure 2

Data organization structure for dMRI datasets. Subjects (sub-01, sub-02, sub-03) are acquired with multiple echo times (TEs), including TE62, TE72, TE82, TE92, TE102, TE112, TE122, and TE132. Two repetitions are acquired at the shortest and longest TEs (TE62R2, and TE132R2). The data is stored in different folders: raw (containing A-P and P-A b = 0 images and DWIs with b-values and vectors); and dwi (unmasked average b = 0 image and DWIs with b-values and vectors).

Technical Validation

Visual inspection

The mean DWIs from each b-value and TE session from all subjects (Fig. 3) were shown with no significant ghosting or reconstruction artifacts by visual inspection. The increase in TE resulted in reduced signal intensities for every b-value, indicating the reduction of the signal to noise ratio at longer TEs.

Fig. 3
figure 3

Multi-echo diffusion-weighted brain MRI images from all subjects (AC). In each panel, each row represents the mean diffusion-weighted images at b = 0 s/mm2, b = 700 s/mm2, and b = 2000s/mm2, and each column represents different TE sessions (TE62, TE62R2, TE72, TE82, TE92, TE102, TE112, TE122, TE132, and TE132R2).

Temporal signal to noise ratio

The temporal signal-to-noise ratio (tSNR) was calculated for each voxel as the ratio of the mean to the standard deviation derived from the sigma noise map obtained in the denoising preprocessing step. White matter (WM) and gray matter (GM) masks were generated using FAST24 function in FSL based on the average b = 0 image for each subject. The average tSNR within both masks was computed for each subject by averaging tSNR values across all voxels in the masks. The increase in TE led to a substantial reduction in average tSNR (Fig. 4), which was consistent with greater T2-related signal decay at longer echo times.

Fig. 4
figure 4

The average temporal signal-to-noise ratio (tSNR) across different echo times (TE) for white matter (WM) and gray matter (GM) computed from preprocessed four b = 0 images and the sigma noise map across all subjects.

Head motion

Head motion was estimated using the EDDY tool during DWI preprocessing. Translation and rotation parameters relative to the first DWI volume were extracted from the *.eddy_parameters file (Fig. 5A and B). Additionally, two summary measures of volumetric head motion were calculated as the root mean square (RMS) displacement: one representing displacement relative to the first volume and the other representing displacement between adjacent volumes, as listed in the *.eddy_restricted_movement_rms file (Fig. 5C). The mean and standard deviation of these head motion measurements across all subjects were reported in Table 1 (Translation), Table 2 (Rotation), and Table 3 (RMS).

Fig. 5
figure 5

Head motion measurement estimated from the rigid-body relative to the first DWI volume of a representative subject, quantified in the translation (A) along the X, Y, and Z-axis, as well as rotation (B) (X-axis: pitch, Y-axis: roll, Z-axis: yaw), and (C) the RMS displacement to the first and previous volumes.

Table 1 Mean and Standard Deviation of Translation Across All Subjects.
Table 2 Mean and Standard Deviation of Rotation Across All Subjects.
Table 3 Mean and Standard Deviation of Root Mean Square Displacement (RMS) Across All Subjects.

Impact of TE on Diffusion Model Parameters

To demonstrate the effects of TE on diffusion model parameters, we fitted the data from a representative subject to the DTI, NODDI, and the multi-shell multi-tissue constrained spherical deconvolution model (MSMT-CSD)18.

This assessment demonstrates the usage of the data for microstructural and fiber orientation mapping.

DTI

The DTI model25, a widely utilized model in clinical studies that requires data from only a single b-shell, was separately fitted to the dataset (b = 700 s/mm2) for each TE session using DTIFIT in FSL. Whole-brain analysis (Fig. 6A) demonstrated that progressive increases in TE were associated with the decrease in FA and AD, and increase in MD and RD.

To further illustrate TE-dependent effects, regions of interest (ROIs) analysis was performed on WM structures extracted from the Johns Hopkins University (JHU) WM atlas26, including the body (BCC), genu (GCC), splenium (SCC) of the corpus callosum, and the superior longitudinal fasciculus (SLF). ROI analysis (Fig. 6B) revealed that when increasing TE, FA decreased in the BCC, whereas other regions exhibited minimal percentage change. Similarly, the decrease of AD also displayed in the BCC and SCC with increasing TE but remained relatively stable in other regions. However, RD demonstrated a decline in the GCC and SCC with the increase in TE. Lastly, the mean diffusivity (MD) reduced in the SCC.

NODDI

The NODDI model, a more complex representation of tissue microstructure that requires data from both low and high b-values with multiple gradient directions, which provides three metrics such as neurite density index (NDI), orientation dispersion index (ODI), and free water (FW), was fitted to the dataset using the NODDI toolbox executed in MATLAB 2018a (MathWorks, Natick, MA, USA) with default settings for each TE session separately. Our results (Fig. 6C and D) indicated that an increase in TE led to an increase in NDI and FW, while ODI remained relatively stable in the whole brain and across all ROIs.

Fig. 6
figure 6

Maps of dMRI metrics for every echo time session for DTI model (A), and NODDI model (B), and percentage change of dMRI metrics across brain regions at different TEs compared with the first TE (TE = 62 ms) of a representative subject. Line plots illustrate TE-dependent effects on: (C) DTI metrics, FA, MD AD, and RD; (D) NODDI metrics, NDI, ODI, and FW. Negative values denote reductions, while positive values indicate increases relative to baseline (TE = 62 ms).

Comparison of effect size

The effect size analysis (Fig. 7) revealed substantial advantages for NODDI model, with NDI, ODI, and FW metrics demonstrating effect sizes 2-3 times larger than traditional DTI measures across multiple brain regions. These findings support the adoption of advanced dMRI models as complementary tools to standard DTI analysis, and may provide enhancement for detecting microstructural differences in both research and clinical applications.

Fig. 7
figure 7

Line plots illustrate TE-dependent effects on the effect size (Cohen’s d) relative to the first TE (TE = 62 ms): (A) DTI metrics, FA, MD AD, and RD; (B) NODDI metrics, NDI, ODI, and FW.

MSMT-CSD

The fiber orientation distributions (FODs) of a representative subject from dMRI data at different TE sessions was reconstructed using multi-shell multi-tissue (MSMT-CSD) with dhollander algorithm in MRtrix3 (Fig. 8). FODs resolved crossing fibers, with variations observed across TE sessions (TE62–TE132R2), underscoring influence of TE on microstructural sensitivity.

Fig. 8
figure 8

Fiber orientation distributions (FODs) from a representative subject, reconstructed using multi-shell multi-tissue constrained spherical deconvolution (MSMT-CSD) with the dhollander algorithm in MRtrix3. The left panel highlights a region of interest (red box) on a T2-weighted slice overlaid with the FODs, and the panels on the right display the zoom-in of the FODs within the red box at the crossing fiber region across multiple echo time (TE) sessions.

Usage Notes

The current dataset, which includes healthy controls, complementing prior patient-focused studies. Unlike the recently published dataset27—acquired with 4–16 gradient directions per b-value (b = 400, 800, 1600, 3200 s/mm2)—the current dataset employs 30 gradient directions per b-value, enabling robust validation of healthy control populations. This enhanced parameterization also facilitates detailed exploration of the interplay between diffusion and T2 relaxation effects, particularly for advanced modeling in tissue microstructure characterization and compartment analysis. However, the current dataset is limited by small sample size (N = 3) as the acquisition time is very long. With this dataset as a testbed for evaluating accelerated protocols, future studies can acquire larger dataset with an accelerated protocol. The current dataset doesn’t include higher b-shells considering that including them would have made the minimal TE much longer on the clinical system; a shorter minimal TE could potentially improve compartmental T2 estimation11,16. In addition, this dataset may benefit free-water elimination methods for DTI and DKI fitting.