Abstract
The hippocampus plays a critical role in memory and is prone to neural degenerative diseases. Its complex structure and distinct subfields pose challenges for automatic segmentation in 3 T MRI because of its limited resolution and contrast. While 7 T MRI offers superior anatomical details and better gray-white matter contrast, aiding in clearer differentiation of hippocampal structures, its use is restricted by high costs. To bridge this gap, algorithms synthesizing 7T-like images from 3 T scans are being developed, requiring paired datasets for training. However, the scarcity of such high-quality paired datasets, particularly those with manual hippocampal subfield segmentations as ground truth, hinders progress. Herein, we introduce a dataset comprising paired 3 T and 7 T MRI scans from 20 healthy volunteers, with manual hippocampal subfield annotations on 7 T T2-weighted images. This dataset is designed to support the development and evaluation of both 3T-to-7T MR image synthesis models and automated hippocampal segmentation algorithms on 3 T images. We assessed the image quality using MRIQC. The dataset is freely accessible on the Figshare+.
Similar content being viewed by others
Background & Summary
The hippocampus plays an important role in clinical neuroscience due to its critical functions in long-term episodic memory consolidation, spatial navigation, and learning1,2,3. Previous studies have demonstrated that the hippocampus is not a uniform structure; rather, it is comprised of distinct subfields, each exhibiting unique functional profiles, connectivity patterns, and differential vulnerabilities to specific disease processes4,5. Precise delineation of hippocampus subfields is crucial for the diagnosis and management of various neurological and psychiatric disorders. Moreover, the picture is made more complex by the fact that the hippocampus is not a homogeneous brain structure. It is composed of two convoluted structures: the dentate gyrus (DG) and the Cornu Ammonis (CA), with the latter divided into four subdivisions (CA1–4). The subiculum (SUB) is also typically recognized as a component of the hippocampal complex. In conjunction with the entorhinal cortex (ERC), these elements constitute the hippocampal formation6. Despite the availability of automated methods for hippocampal subfield segmentation, such as FreeSurfer v6.07, ASHS8, MAGeT-Brain9, HIPS10, CAST11, HippUnfold12, HSF13, the performance of these supervised methods largely depend on the accuracy of manual segmentation, which serves as the gold standard. Such manual segmentations require substantial expertise and a considerable time investment—approximately eight hours per subject when executed by an experienced delineator14. Therefore, researchers face a notable scarcity of databases providing comprehensive manuals for hippocampal subfield segmentation to guide the development of advanced and automated segmentation methods.
Further compounding the challenge is the inherent limitation of standard 3 T MRI in resolving the fine anatomical details of hippocampal subfields due to limited signal contrast and resolution. In contrast, 7 T MRI, with its superior spatial resolution and signal-to-noise ratio (SNR), provides significantly enhanced anatomical detail15,16, improving the accuracy of various neuroimaging post-processing tasks, including tissue segmentation and cortical surface reconstruction. The ability of 7 T MRI to reveal subtle pathological changes invisible on lower-field-strength scanners further enhances its value in both research and clinical diagnostics17,18,19. Additionally, imaging techniques that utilize susceptibility-induced contrast, including functional MRI (Blood Oxygenation Level Dependent, BOLD), which benefit greatly from the ultra-high field strength20, enabling the construction of mechanistic models of cognitive function21. Even in the hippocampus, where primary structures within the hippocampal formation are visible at clinical 3 T MR scanner, encounters resolution limitations that impede the delineation of finer subfield anatomy and distinct signal profiles achievable at higher field strengths22. This is especially critical considering the hippocampus’s small volume (typically measuring 1–3 ml in elderly subjects, including those with Alzheimer’s disease) which is represented by only 1000–3000 voxels at the standard clinical resolution of 1 × 1 × 1 mm3. Moreover, with 80–90% of these voxels located on the surface, partial volume effects can substantially impact volume estimation accuracy23. 7 T MRI substantially mitigates these partial volume effects, revealing more anatomical details24 and improving the interpretation of imaging data acquired at standard clinical field strengths25. The superior SNR and ultra-high resolution of 7 T MRI enable the consistent visualization of internal features while maintaining a thin slice thickness (as fine as 1 mm), essential for revealing the intricate anatomy of the hippocampal head. Notably, high-resolution T2-weighted images at 7 T have demonstrated their efficacy in demarcating medial temporal lobe (MTL) subregions via the vivid delineation of the stratum radiatum lacunosum moleculare (SRLM), which manifests as a distinct dark band and acts as a critical marker for demarcating subregion boundaries26.
Despite their significant benefits in improving tissue contrast and anatomical detail, the integration of 7 T MRI scanners in research and clinical settings is primarily restricted by high cost and complex maintenance demands27. To circumvent these challenges and enhance the utility of the more prevalent 3 T MRI datasets, various deep learning-based 3T-to-7T image synthesis methods have been proposed, demonstrating promise in generating realistic high-resolution 7T-like images from 3 T scans27,28,29. However, for super-resolution task, obtaining a paired dataset that includes both low and high-resolution MRIs for algorithm training constitutes a practical hurdle considering most existing approaches are implemented on private datasets30. While several Generative Adversarial Networks (GANs) based synthesis models can be trained in the absence of paired images, paired data remains crucial for quantitative model validation31.
Here, we disseminate a dataset of whole-brain paired T1-weighted, T2-weighted and resting-state fMRI (rfMRI) scans acquired at 3 T and 7 T scanners, featuring manual hippocampal subfield annotations on the 7 T T2-weighted images. We provide a comprehensive description of the design, acquisition, and preparation of the dataset. Image quality is assessed using quality metrics implemented in MRIQC32. This dataset has already been utilized to develop Syn_SegNet33, an end-to-end, multitask joint deep neural network that leverages 7 T MRI synthesis to improve hippocampal subfield segmentation in 3 T MRI. We anticipate that this resource will significantly advance the field by facilitating research and development in 3T-to-7T image synthesis, T1-to-T2 image translation, and functional MRI analysis, improving the accuracy and accessibility of hippocampal subfield segmentation.
Methods
Participant recruitment
Twenty healthy volunteers (10 males and 10 females) aged 18–25 years were recruited from university. Inclusion criteria were: 1) no history of brain injury, neurological, or psychiatric disorders; 2) a Mini-Mental State Examination (MMSE)34 score ≥ 24; 3) absence of contraindications to MRI scanners, such as claustrophobia or the presence of metallic implants. All participants provided written informed consent, including permission for open sharing of anonymized data. Ethical approval was granted by the Science and Ethics Committee of the School of Biological Science and Medical Engineering in Beihang University (Application No. 20140304), in accordance with the Declaration of Helsinki.
Additional demographic information collected included handedness (all participants were right hand dominant, confirmed using the Edinburgh Handedness Inventory35) and years of education (mean = 17.25 years, SD = 0.69).
Ten questionnaires were administered in the following order: Basic Information Questionnaire, Edinburgh Handedness Inventory (EHI), MMSE, Self-Rating Depression Scale (SDS)36, Self-Rating Anxiety Scale (SAS)37, Pittsburgh sleep quality index (PSQI)38, Wechsler Adults Intelligence Scale (WAIS-RC)39, Raven’s Progression Matrices (RPM), The Schutte Self Report Emotional Intelligence Test (SSEIT)40, and The Trail Making Test (TMT)41. Detailed demographic and questionnaire data are presented in Table 1.
Image acquisition
MRI data were acquired from each participant using both a 3 T Prisma and a 7 T MAGNETOM MR scanner (Siemens Healthineers, Erlangen, Germany) on the same day at the Beijing MRI Center for Brain Research (BMCBR). A 32-channel receive and single-channel transmit head coil (Nova Medical, Wilmington, MA, USA) was used for 7 T scans, and a 64-channel phased-array head coil (Siemens Healthineers, Erlangen, Germany) was used for 3 T scans.
3 T sequences
T2-weighted turbo spin echo (TSE) sequence was obtained with a resolution of 0.9 × 0.9 × 1.9 mm3. The sequence parameters were: repetition time (TR) = 11050 ms, echo time (TE) = 94 ms, flip angle (FA) = 60°, bandwidth = 100 Hz/Px, and acquisition matrix size = 256 × 256 × 90. One volume acquired perpendicular to the hippocampus long axis.
An isotropic 1mm3 T1-weighted sagittal 3D magnetization-prepared rapid gradient echo (MPRAGE) sequence was acquired along the long axis of the hippocampus, with the following settings: TR = 2300 ms, inversion time (TI) = 1000 ms, TE = 2.26 ms, FA = 8°, and acquisition matrix size = 256 × 224 × 192.
Whole-brain resting-state fMRI was obtained using a 2D echo-planar imaging (EPI) sequence with the following scanning parameters: FA = 50°, TR = 600 ms, TE = 30 ms, resolution = 3.0 × 3.0 × 3.0 mm3, and acquisition matrix size = 80 × 80 × 48. One run with a total of 800 volumes was acquired.
The diffusion-weighted imaging (DWI) scan was acquired using a 2D EPI sequence with a slice thickness of 1.3 mm and the same spacing between slices. The parameters were: TR = 5500 ms, TE = 82 ms, and flip angle = 90°. The acquisition included a multiband acceleration factor of 3 and a parallel reduction factor in-plane of 2. The base resolution and acquisition matrix were both 162, with an echo train length of 61. The EPI factor used was 128.
7 T sequences
T2-weighted Sampling Perfection with Application-optimized Contrasts by using flip angle Evolution (SPACE) sequence: 3D, 0.4 × 0.4 × 1.0 mm³, TR = 3000 ms, TE = 387 ms, FA = 120°, bandwidth = 279 Hz/Px, matrix = 480 × 512 × 224. Two volumes acquired (with and without non-distortion correction, with and without a suffix of ‘ND’ in filename).
T1-weighted MPRAGE: Sagittal, isotropic 0.7 mm³, TR = 2600 ms, TI = 1050 ms, TE = 2.72 ms, FA = 5°, matrix = 320 × 320 × 256. Two volumes acquired (with and without non-distortion correction).
Resting-state fMRI (EPI): 3D, FA = 45°, TR = 1000 ms, TE = 21 ms, resolution = 1.5 × 1.5 × 1.5 mm³, matrix = 128 × 128 × 88. One run of 800 volumes.
Segmentation of hippocampal subfields
We followed the protocol established by Wisse et al.14, which integrates elements from pre-existing methods with new rules to ensure coverage of all subfields throughout the entire length of the hippocampal formation. The hippocampal formation subfields, including the ERC, SUB, CA1, DG-CA4, CA2, CA3, and the hippocampal tail, were manually delineated. Segmentation was performed bilaterally on every coronal section of the subjects’ ultra-high-resolution 7 T T2-weighted MRI scans using ITK-SNAP software42 (version 3.8.0, www.itksnap.org), where 7 T T2-weighted images have been upsampled to a 0.7 mm slice thickness utilizing B-spline interpolation, after Gaussian smoothing denoising and N4 bias field correction with Advanced Normalization Tools (ANTs). Figure 1 displays MRI images from four different modalities of a single subject, whereas Fig. 2 illustrates the manual segmentation results of the left hippocampus alongside its corresponding 3D reconstruction.
Paired T1-weighted and T2-weighted MRI scans at 3 T and 7 T acquired for a subject (sub-10), the left hippocampus of the subject was shown for multiple views and multiple modalities. The final two rows depict zoomed-in coronal views of the hippocampus from head to body at three different slices, comparing 3 T and 7 T T2-weighted images. The 3 T images show lower resolution and significant partial volume effects, while the 7 T images, with enhanced resolution, more clearly demonstrate the hippocampal internal architecture, outer boundaries, and inner boundaries between subfields, with the hypointense line appearing more consistent and visible compared to 3 T.
Manual segmentation of the hippocampal formation in subject 12. (a) A sagittal view illustrating references to the sequential coronal sections. Coronal images depicting the hippocampal formation and accompanying manual segmentation are presented from anterior to posterior, labeled as 2c-2j. (b) A 3D reconstruction viewed from an anterolateral angle highlights the segmented hippocampal subregions. (c–j) For each coronal section, the top row displays the 7 T T2-weighted images, while the bottom row depicts manual delineation of hippocampal subfields. A detailed zoom-in view (seen in image 1 g) demonstrates the delineation of the border between CA2 and CA3 by drawing a virtual square. Abbreviations: ERC, entorhinal cortex; SUB, subiculum; CA, Cornu Ammonis; DG, dentate gyrus.
It should be noted that the implementation of the original protocol required adaptations for use with ITK-SNAP software. Specifically, the delineation of hippocampal subfields was adjusted to accommodate ITK-SNAP’s voxel-by-voxel approach, without modifying the underlying protocol principles. This is in contrast to the freehand spline drawing technique described in the reference protocol by Wisse et al.14. The subfields were manually traced on coronal images from anterior-to-posterior, beginning with Fig. 2c and continuing to 2j on coronal images. The head of the hippocampus is depicted in Fig. 2d–h, the body in Fig. 2i, and the tail in Fig. 2j. These modifications are detailed below, following the sequence of annotation.
-
(i)
The border between the SUB and CA1 in the most anterior slices of the hippocampus was determined by measuring the longest diameter from the medial to the lateral point (Fig. 2d,e). With the emergence of the DG, this border changed, and the SUB became laterally bordered by CA1 at the most medial point of the DG. Given the small size of the DG in early slices, a practical approach involved extrapolating backwards from subsequent levels where the DG was larger and examining the region from multiple perspectives, especially in the sagittal view.
-
(ii)
The delineation of CA2 and CA3 subfields commenced at a position 1.4 mm anterior to the separation point of the uncus from the hippocampus on coronal images, where only the fimbria exhibited attachment to both structures as per the previous protocol. To ensure accuracy and consistency, delineation started on a slice located two slices ahead, assuming a slice thickness of 0.7 mm (Fig. 2g,h).
-
(iii)
As outlined in the protocol14, the CA2-CA3 border was established by defining the medial side of a virtual square. This square was positioned with its lateral side touching the border of CA1 and its superior side aligned along the superior border of CA2. To ensure accuracy and consistency during the segmentation process using ITK-SNAP, a paintbrush size of 4 was employed. The delineation of the virtual square between CA2 and CA3 was accomplished using 4 × 4 voxel squares in most instances, as this size matched the thickness of CA2 at its boundary with CA3 (Fig. 2g).
-
(iv)
The delineation of hippocampal subfields encompassed the entire length of the long axis of the hippocampal formation. However, due to the limited visibility of the total length of the fornix beyond a certain section, reliable delineation of fused subfields was not feasible. To address this limitation and maintain consistency, the last 4–6 slices were labeled as “Tail” (Fig. 2j) in accordance with a previously published protocol14.
Data Records
The dataset is available at Figshare+43. The raw MRI data are stored in.ima format within the ‘rawdata_DICOM_3T/7T’ directories. The rawdata DICOM 7 T data are further divided into four parts, each containing raw DICOM data for 5 subjects. Additionally, the acquired MRI scans have been converted from DICOM to the Neuroimaging Informatics Technology Initiative (NIfTI) format and organized in accordance with the Brain Imaging Data Structure (BIDS) format by employing the BIDScoin Python application (version 4.3.0)44. These files are stored in the ‘rawdata_BIDS_3T/7T’ directories.
Hippocampal subfield segmentation related files can be found in the ‘hippo_subfield\7T_T2-weighted_0.7_for_subfield_delineation’ directory. The ‘hippo_subfield\hippo_label’ directory contains manual segmentation labels for seven hippocampal subregions for each subject. Detailed information on segmentation labels, including label values and colors for each hippocampal subfield is documented in ‘hippo_subfield\subfield label.txt’.
The ‘MRIQC_3T/7T’ directories store the results of quality control assessments. For each participant, the ‘MRIQC_3T/7T\sub*’ directories contain \anat and \func subdirectories, which hold image quality metric reports for T1-weighted, T2-weighted, rfMRI scans. These quality metrics, available in both.html and.json formats, are instrumental in evaluating data quality and provide estimates of motion, SNR, and intensity non-uniformities, supplemented with visual reports in.svg format in the ‘\figure’ subdirectory.
Notably, due to heavy head motion during the first scans, the 3T T2-weighted images for two participants (sub 06 and 12) were rescanned. Only the datasets from these subsequent sessions are retained in the ‘rawdata\BIDS’ directory for further quality evaluation. Figure 3 illustrates the directory structure of our dataset.
The directory structure of the dataset. (a) Overview of all the main folders included in the shared dataset. (b) The folder ‘T1_T2w_0.7_for_subfield_delineation’ contains 7 T T2-weighted images of each participant, which were used for hippocampal subfield delineation. (c) Contents of the ‘hippo_label’ folder, including subfield label files of each subject. (d) The directory structure for the 3T-7T paired dataset, conforming to BIDS standards. (e) Detailed image quality reports for T1-weighted, T2-weighted, and resting-state fMRI data, generated with MRIQC, are included. (f) The ‘rawdata_DICOM_3T’ and ‘rawdata_DICOM_7T’ folders contain anonymized DICOM data for 3 T and 7 T, respectively, without additional processing.
Technical Validation
Hippocampal subfield segmentation on 7 T T2-weighted high resolution structural images
High-resolution 7T T2-weighted structural images were acquired to enable precise manual segmentation of hippocampal subfields. High image quality ensured the visibility of crucial anatomical landmarks, facilitating accurate delineation in all participants. To assess inter- and intra-rater reliability, two experienced medical experts (L.C. and X.D.) independently segmented the hippocampal subfields of ten subjects (50% of the sample) on two separate occasions, one month apart. This procedure established proficiency with the segmentation protocol. Subsequently, these raters collaboratively produced the final manual segmentation for all twenty subjects, reaching consensus on any discrepancies through discussion.
Reliability of the hippocampal subfield segmentations was evaluated using inter- and intra-rater reliability measures: Intraclass Correlation Coefficients (ICC)45 and Dice Similarity Coefficients (DSC)46. These metrics were calculated for each subfield, separately for the left hemisphere, right hippocampus, and bilaterally.
Intra-rater reliability, reflecting the consistency of individual rater performance, was evaluated using both ICCs and DSCs. ICCs were calculated as:
Where MSR represents the mean square between subjects. MSE (mean square error) represents the unexplained variance attributable to random error. MSC represents the mean square between the two independent tracing sessions. n represents the number of subjects. DSCs were calculated as:
where X and Y represent the voxel sets from the two tracing sessions for a given subject, |X ∩ Y| denotes the number of overlapping voxels, and |X| and |Y| represent the total number of voxels in each segmentation. For intra-rater reliability, both ICCs and DSCs ranged from 0 to 1, with higher values indicating greater reliability. We considered ICC values of 0.75–0.90 as good, and values ≥ 0.90 as excellent.
Inter-rater reliability, reflecting the agreement between the two independent raters, was similarly assessed using ICCs and DSCs. The ICC calculation remained consistent, with MSC now representing the mean square between the two raters. The DSC calculation also remained unchanged. For inter-rater reliability, both ICCs and DSCs ranged from 0 to 1, with higher values indicating greater agreement.
Table 2 presents the inter- and intra-rater reliability analyses. The results align with those of the reference protocol14. For intra-rater reliability, our enhanced analysis yielded a DSC of 0.810 for the ERC segment, slightly below the 0.83 reported by Wisse et al.14. Similarly, the CA2 segment’s DSC was 0.80, marginally lower than the 0.83 reported by Wisse et al.14. Conversely, our DSC values for the CA1 and SUB segments(0.875, respectively) closely matched the 0.85 and 0.84 values reported by Wisse et al.14 Regarding intra-rater ICCs, our scores for the ERC, SUB, and CA1 segments are 0.78, 0.96, and 0.97, respectively, marginally below the 0.80, 0.97, and 0.98 reported by Wisse et al.14 The inter-rater reliability findings mirrored the intra-rater results but exhibited slightly lower values.
Image quality metrics (IQMs) for MRI data
All participants underwent an initial quality assessment of their neuroimaging data. We ensured that the datasets conformed to the BIDS standard using the BIDS-validator (version 1.14.3; available at https://github.com/bids-standard/bids-validator). Prior to quality assessment of the T1-weighted, T2-weighted, and rfMRI sequences, we used MRIQC32 to generate subject-specific visual quality assessment reports. These reports facilitated a comprehensive evaluation of processing quality. Detailed IQMs, calculated for all structural and functional imaging series using MRIQC are included in this data release. Six representative IQMs, presented in Fig. 4, are detailed below.
Violin plots illustrating the distribution of the six Image Quality Metrics (IQMs) across different acquisition protocols. Metrics include (a) Signal-to-Noise Ratio (SNR); (b) Entropy Focus Criterion (EFC); (c) Contrast-to-Noise Ratio (CNR); (d) Standardized DVARS (DVARS_std). (e) Total Signal-to-Noise Ratio (tSNR); and (f) Mean Framewise Displacement (FD-mean). Data collected at 3 T is shown in orange, while 7 T data is shown in purple. Asterisks (****) indicate that the p-value of the group difference is less than 0.001, while ‘ns’ denotes non-significant differences. For panels (a–c), statistically significant differences between 3 T and 7 T data are observed, except for T2-weighted images. The FD-mean metric shows no significant group difference between 3T and 7T resting-state functional MRI acquisitions.
Spatial anatomical and functional Metrics:
-
(i)
Signal-to-Noise Ratio (SNR)47: SNR was calculated within each tissue mask including gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF) masks. And the whole-brain averages (means of GM, WM, and CSF values) are reported in Fig. 4. Higher values indicate superior image quality. The estimation may be provided with only one foreground region in which the noise is computed as follows:
$${\rm{SNR}}=\frac{{\mu }_{F}}{{\sigma }_{F}\sqrt{N/(N-1)}}$$where μF is the mean intensity of the foreground and σF is the standard deviation of the same region. N is the number of voxels in foreground mask.
-
(ii)
Entropy Focus Criteria (EFC)48: EFC was derived from the Shannon entropy of voxel intensities relative to the maximum potential entropy for an image of similar size. Lower EFC indicates less ghosting and motion-induced blurring due to head movement during scanning, corresponding to enhanced image quality.
$${\rm{EFC}}=\left(\frac{N}{\sqrt{N}}\log {\sqrt{N}}^{-1}\right){\rm{E}}$$where E is expressed as:
$${\rm{E}}=-\mathop{\sum }\limits_{j=1}^{N}\frac{{x}_{j}}{{x}_{\max }}\mathrm{ln}\left[\frac{{x}_{j}}{{x}_{\max }}\right]$$Where N is the total number of voxels (i.e., independent voxels in the image), xj represents the intensity value of the j-th voxel, typically the grayscale value or signal intensity of that voxel in the image. xmax is the total energy of all voxels, calculated as: \({x}_{\max }=\sqrt{\mathop{\sum }\limits_{j=1}^{N}{x}_{j}^{2}}\)
.
Spatial anatomical Metrics:
-
(iii)
Contrast-to-Noise Ratio (CNR)47: Specific to structural MRI, CNR quantifies the distinguishability between tissue types (e.g., GM and WM). Higher CNR values indicate better tissue contrast.
$${\rm{CNR}}=\frac{\left|{\mu }_{{\rm{GM}}}-{\mu }_{{\rm{WM}}}\right|}{\sqrt{{\sigma }_{B}^{2}+{\sigma }_{{\rm{WM}}}^{2}+{\sigma }_{{\rm{GM}}}^{2}}}$$where σB is the standard deviation of the noise distribution within the air (background) mask. μWM is the mean of signal within the WM mask, μGM is the mean signal within the GM mask. σWM is the standard deviation within the WM mask. σGM is the standard deviation within the GM mask.
Temporal functional Metrics:
-
(iv)
Standardized DVARS49: This metric reflects the average voxel-wise change in signal intensity between consecutive rfMRI volumes, normalized with the standard deviation of the temporal difference time series. Lower standardized DVARS values indicate better image quality. For time point t:
$${{\rm{DVARS}}}_{t}=\sqrt{\frac{1}{N}\sum _{i}{\left[{x}_{i,t}-{x}_{i,t-1}\right]}^{2}}$$Where N is the number of voxels and xi,t is the signal intensity for voxel i.
-
(v)
Mean Framewise Displacement (FD)50: FD estimates head motion by summing the absolute translational and rotational displacements across the x, y, and z axes for each time point. Lower values indicate less head motion.
$${{\rm{FD}}}_{t}=\left|\Delta {d}_{x,t}\right|+\left|\Delta {d}_{y,t}\right|+\left|\Delta {d}_{z,t}\right|+\left|\Delta {\alpha }_{t}\right|+\left|\Delta {\beta }_{t}\right|+\left|\Delta {\gamma }_{t}\right|$$Where \(\Delta {d}_{x,t}\), \(\Delta {d}_{y,t}\), and \(\Delta {d}_{z,t}\) are the translations along the x, y, and z axes, respectively, and \(\Delta {\alpha }_{t}\), \(\Delta {\beta }_{t}\), and \(\Delta {\gamma }_{t}\) are the rotations around the x, y, and z axes, respectively.
-
(vi)
Temporal SNR (tSNR)51: tSNR was computed as the mean BOLD signal intensity across the brain volume divided by its standard deviation within each voxel, across each run and for each subject. Higher values indicate better quality.
$${\rm{tSNR}}=\frac{\langle S{\rangle }_{t}}{{\sigma }_{t}}$$where \(\langle S{\rangle }_{t}\) is the average BOLD signal (across time), and \({\sigma }_{t}\) is the corresponding temporal standard-deviation map.
Figure 4 displays IQMs for T1-weighted, T2-weighted, and rfMRI data acquired at 3 T and 7 T field strengths. Except where indicated as non-significant (ns), all other within-modality comparisons between 3 T and 7 T showed statistically significant differences (Fig. 4a–c). Figures 4d–f demonstrate significant differences (p < 0.001) in DVARS and tSNR for rfMRI. Specifically, 7 T T2-weighted images generally exhibited significantly higher SNR than 3 T images, indicating improved overall signal quality. However, increased variance at 7 T suggests greater susceptibility to magnetic field inhomogeneities, potentially compromising signal quality in specific local regions compared to 3 T. EFC values remained relatively consistent across modalities, with 7 T T1-weighted images showing slightly better results, indicating lower image complexity and higher quality. For rfMRI, 3 T data exhibited a higher standard deviation of DVARS, reflecting greater motion artifacts. In contrast, 7 T data showed more stable inter-volume signal changes. Minimal differences in motion artifacts were observed between 3 T and 7 T rfMRI, as indicated by mean FD; however, 7 T displayed slightly lower values, suggesting less movement. Conversely, 3 T rfMRI showed significantly higher tSNR, reflecting greater signal stability over time.
Usage Notes
The dataset, accessible via Figshare43, provides paired whole-brain T1-weighted, T2-weighted, and rfMRI scans acquired at both 3 T and 7 T field strength, along with manual hippocampal subfield annotations for the 7 T T2-weighted images. This unique combination of multi-contrast and multi-field strength data supports a wide range of machine learning applications.
The paired data enables diverse deep learning workflows, including cross-domain tasks such as 3T-to-7T image synthesis, which translates lower-resolution 3 T scans into high-resolution 7T-like images. This enhances image detail and quality for downstream applications. The high-quality manual annotations of hippocampal subfields on the 7 T T2-weighted images serve as reliable ground truth for training and evaluating segmentation models, particularly valuable given the importance of high-resolution detail in hippocampal substructures. This facilitates the development of advanced segmentation algorithms applicable to high-resolution imaging and studies of hippocampal morphology in health.
The multi-modality nature of the dataset further allows for multi-modal research workflows integrating structural and functional brain imaging. Researchers can explore relationships between brain anatomy and functional connectivity by aligning data across T1-weighted, T2-weighted, and rfMRI modalities, enabling joint representation learning and in-depth investigation of brain structure-function interactions. This makes the dataset particularly well-suited for multi-modal deep learning applications.
While this dataset offers a valuable benchmark for comparing 3 T and 7 T imaging, it’s crucial to acknowledge its limitations. The relatively small sample size, comprising only healthy young adults, restricts the generalizability of models trained solely on this dataset. Data augmentation is therefore recommended, particularly for deep learning applications. Furthermore, researchers studying disease or diverse age groups should supplement this dataset with additional data to enhance model robustness. Despite these limitations, the dataset provides a critical baseline for understanding fundamental hippocampal structure before considering age- or disease-related variations.
Clinical applications of generated images require caution due to potential anatomical inaccuracies. While these images can augment existing imaging modalities, they should not replace established diagnostic procedures but serve as supplementary tools to enhance diagnostic insights, especially when conventional high-resolution images are unavailable. Furthermore, continuous validation and assessment processes are necessary to ensure reliability and accuracy.
Furthermore, several technical limitations inherent to 7 T MRI must be considered. Although 7 T MRI enhances signal and contrast, challenges like field inhomogeneities and signal variability need to be acknowledged, especially in precise structural imaging. In-plane dephasing and signal loss at tissue-air interfaces can introduce artifacts, particularly when imaging structures near the skull base, like the temporal lobe. As shown in Fig. 1, substantial intensity variations and loss, particularly in the temporal lobe of 7 T T2-weighted images, result from susceptibility effects and magnetic field inhomogeneity, especially at gray matter-CSF boundaries. While these artifacts minimally affect hippocampal visualization and segmentation (the dataset’s primary focus), they can impact overall image quality and potentially affect whole-brain image generation. Users are advised to exercise caution when using whole-brain data for segmentation or generative tasks, as signal loss may occur in specific regions, particularly in the temporal lobes on 7 T T2-weighted images for certain subjects. The absence of 7 T DWI sequences reflects common technical constraints at 7 T, including shorter relaxation times, increased magnetic field inhomogeneity, and higher SAR, which prevented the acquisition of high-quality DWI data with our scanner and available sequences/software.
Therefore, users should critically assess the impact of sample size and imaging artifacts when interpreting results to ensure that the conclusions drawn align with the dataset’s scope and limitations. However, we emphasize that the high-quality manual annotations, consistent protocols, and paired 3T-7T scans offer a valuable resource for early-stage model development and evaluation. Future dataset extension could include larger, more diverse samples, encompassing pathological cases, to improve generalizability.
Code availability
The brain MRI data was processed using the following open-source software packages: BIDScoin (version 4.3.0), BIDS-validator, MRIQC (version 22.0.6). No custom code has been used.
References
Hafting, T., Fyhn, M., Molden, S., Moser, M.-B. & Moser, E. I. Microstructure of a spatial map in the entorhinal cortex. Nature 436, 801–806 (2005).
Knierim, J. J. The hippocampus. Current Biology 25, R1116–R1121 (2015).
Bird, C. M. & Burgess, N. The hippocampus and memory: insights from spatial processing. Nat Rev Neurosci 9, 182–194 (2008).
Maruszak, A. & Thuret, S. P. Why looking at the whole hippocampus is not enough—a critical role for anteroposterior axis, subfield and activation analyses to enhance predictive value of hippocampal changes for Alzheimer’s disease diagnosis. Front. Cell. Neurosci. 8 (2014).
Shahid, S. S. et al. Hippocampal-subfield microstructures and their relation to plasma biomarkers in Alzheimer’s disease. Brain 145, 2149–2160 (2022).
Duvernoy, H. M., Cattin, F. & Risold, P.-Y. The Human Hippocampus: Functional Anatomy, Vascularization and Serial Sections with MRI. (Springer Berlin Heidelberg, Berlin, Heidelberg, 2013).
Iglesias, J. E. et al. A computational atlas of the hippocampal formation using ex vivo, ultra-high resolution MRI: Application to adaptive segmentation of in vivo MRI. NeuroImage 115, 117–137 (2015).
Wisse, L. E. M. et al. Automated Hippocampal Subfield Segmentation at 7T MRI. AJNR Am J Neuroradiol 37, 1050–1057 (2016).
Pipitone, J. et al. Multi-atlas segmentation of the whole hippocampus and subfields using multiple automatically generated templates. NeuroImage 101, 494–512 (2014).
Romero, J. E., Coupé, P. & Manjón, J. V. HIPS: A new hippocampus subfield segmentation method. NeuroImage 163, 286–295 (2017).
Yang, Z., Zhuang, X., Mishra, V., Sreenivasan, K. & Cordes, D. CAST: A multi-scale convolutional neural network based automated hippocampal subfield segmentation toolbox. NeuroImage 218, 116947 (2020).
DeKraker, J. et al. Automated hippocampal unfolding for morphometry and subfield segmentation with HippUnfold. eLife 11, e77945 (2022).
Poiret, C. et al. A fast and robust hippocampal subfields segmentation: HSF revealing lifespan volumetric dynamics. Front Neuroinform 17, 1130845 (2023).
Wisse, L. E. M. et al. Subfields of the hippocampal formation at 7T MRI: In vivo volumetric assessment. NeuroImage 61, 1043–1049 (2012).
Trattnig, S. et al. Key clinical benefits of neuroimaging at 7T. NeuroImage 168, 477–489 (2018).
Okada, T. et al. Neuroimaging at 7 Tesla: a pictorial narrative review. Quant Imaging Med Surg 12, 3406–3435 (2022).
De Ciantis, A. et al. 7T MRI in focal epilepsy with unrevealing conventional field strength imaging. Epilepsia 57, 445–454 (2016).
McKiernan, E. F. & O’Brien, J. T. 7T MRI for neurodegenerative dementias in vivo: a systematic review of the literature. J Neurol Neurosurg Psychiatry 88, 564–574 (2017).
De Cocker, L. J. et al. Clinical vascular imaging in the brain at 7T. Neuroimage 168, 452–458 (2018).
van der Kolk, A. G., Hendrikse, J., Zwanenburg, J. J. M., Visser, F. & Luijten, P. R. Clinical applications of 7 T MRI in the brain. European Journal of Radiology 82, 708–718 (2013).
Cai, Y., Hofstetter, S., van der Zwaag, W., Zuiderbaan, W. & Dumoulin, S. O. Individualized cognitive neuroscience needs 7T: Comparing numerosity maps at 3T and 7T MRI. Neuroimage 237, 118184 (2021).
Giuliano, A. et al. Hippocampal subfields at ultra high field MRI: An overview of segmentation and measurement methods. Hippocampus 27, 481–494 (2017).
Lötjönen, J. et al. Fast and robust extraction of hippocampus from MR images for diagnostics of Alzheimer’s disease. NeuroImage 56, 185 (2011).
Kerchner, G. A. et al. Hippocampal CA1 apical neuropil atrophy in mild Alzheimer disease visualized with 7-T MRI. Neurology 75, 1381–1387 (2010).
Thomas, B. P. et al. High-resolution 7T MRI of the human hippocampus in vivo. Journal of Magnetic Resonance Imaging 28, 1266–1272 (2008).
Berron, D. et al. A protocol for manual segmentation of medial temporal lobe subregions in 7 Tesla MRI. NeuroImage: Clinical 15, 466–482 (2017).
Bahrami, K. et al. Reconstruction of 7T-Like Images From 3T MRI. IEEE Trans Med Imaging 35, 2085–2097 (2016).
Nie, D. et al. Medical Image Synthesis with Deep Convolutional Adversarial Networks. IEEE Transactions on Biomedical Engineering 65, 2720–2730 (2018).
Qu, L., Zhang, Y., Wang, S., Yap, P.-T. & Shen, D. Synthesized 7T MRI from 3T MRI via deep learning in spatial and wavelet domains. Medical Image Analysis 62, 101663 (2020).
Do, H., Bourdon, P., Helbert, D., Naudin, M. & Guillevin, R. 7T MRI super-resolution with Generative Adversarial Network. ei 33, 106-1–106–7 (2021).
Chen, X., Qu, L., Xie, Y., Ahmad, S. & Yap, P.-T. A paired dataset of T1- and T2-weighted MRI at 3 Tesla and 7 Tesla. Sci Data 10, 489 (2023).
Esteban, O. et al. MRIQC: Advancing the automatic prediction of image quality in MRI from unseen sites. PLOS ONE 12, e0184661 (2017).
Li, X. et al. Syn_SegNet: A Joint Deep Neural Network for Ultrahigh-Field 7T MRI Synthesis and Hippocampal Subfield Segmentation in Routine 3T MRI. Ieee J Biomed Health 27, 4866–4877 (2023).
Folstein, M. F., Folstein, S. E. & McHugh, P. R. “Mini-mental state”: A practical method for grading the cognitive state of patients for the clinician. Journal of Psychiatric Research 12, 189–198 (1975).
Oldfield, R. C. The assessment and analysis of handedness: The Edinburgh inventory. Neuropsychologia 9, 97–113 (1971).
Zung, W. W. K. A Self-Rating Depression Scale. Archives of General Psychiatry 12, 63–70 (1965).
Zung, W. W. K. A Rating Instrument For Anxiety Disorders. Psychosomatics 12, 371–379 (1971).
Buysse, D. J., Reynolds, C. F., Monk, T. H., Berman, S. R. & Kupfer, D. J. The Pittsburgh sleep quality index: A new instrument for psychiatric practice and research. Psychiatry Research 28, 193–213 (1989).
Yao-xian, G. Revision of wechsler’s adult intelligence scale in china. Acta Psychol Sin 15, 121–129 (1983).
Schutte, N. S. et al. Development and validation of a measure of emotional intelligence. Personality and Individual Differences 25, 167–177 (1998).
Partington, J. E. & Leiter, R. G. Partington’s Pathways Test. Psychological Service Center Journal 1, 11–20 (1949).
Yushkevich, P. A. et al. User-guided 3D active contour segmentation of anatomical structures: Significantly improved efficiency and reliability. NeuroImage 31, 1116–1128 (2006).
Li, S., Chu, L. & Ma, B. A paired dataset of multi-modal MRI at 3 Tesla and 7 Tesla with manual hippocampal subfield segmentations on 7T T2-weighted images. Figshare+ https://doi.org/10.25452/figshare.plus.26075713.v1 (2024).
Zwiers, M. P., Moia, S. & Oostenveld, R. BIDScoin: A User-Friendly Application to Convert Source Data to Brain Imaging Data Structure. Front. Neuroinform. 15, 770608 (2022).
Koo, T. K. & Li, M. Y. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med 15, 155–163 (2016).
Dice, L. R. Measures of the Amount of Ecologic Association Between Species. Ecology 26, 297–302 (1945).
Magnotta, V. A., Friedman, L. & FIRST BIRN. Measurement of Signal-to-Noise and Contrast-to-Noise in the fBIRN Multicenter Imaging Study. Journal of Digital Imaging 19, 140–147 (2006).
Atkinson, D., Hill, D. L., Stoyle, P. N., Summers, P. E. & Keevil, S. F. Automatic correction of motion artifacts in magnetic resonance images using an entropy focus criterion. IEEE Trans Med Imaging 16, 903–910 (1997).
Power, J. D., Barnes, K. A., Snyder, A. Z., Schlaggar, B. L. & Petersen, S. E. Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. NeuroImage 59, 2142–2154 (2012).
Jenkinson, M., Bannister, P., Brady, M. & Smith, S. Improved Optimization for the Robust and Accurate Linear Registration and Motion Correction of Brain Images. NeuroImage 17, 825–841 (2002).
Krüger, G. & Glover, G. H. Physiological noise in oxygenation-sensitive magnetic resonance imaging. Magnetic Resonance in Medicine 46, 631–637 (2001).
Acknowledgements
We extend our heartfelt thanks to the participants, whose invaluable cooperation was pivotal to the success of our research. This work was supported by National Natural Science Foundation of China (No.81471731, 81972160, 32271146) and the Startup Funds for Top-notch Talents at Beijing Normal University. This work was partially supported by National Science and Technology Innovation 2030 Major Program (2022ZD0211900, 2022ZD0211901), Youth Innovation Promotion Association CAS (2022093), and NSFC (82271985, 81961128030).
Author information
Authors and Affiliations
Contributions
Lei Chu: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Writing - Original Draft, Writing - Review & Editing, Visualization. Baoqiang Ma: Conceptualization, Data Curation, Resources, Writing - Review & Editing, Project administration. Xiaoxi Dong: Validation, Investigation, Data Curation, Writing - Review & Editing. Yirong He: Methodology, Writing - Review & Editing, Visualization. Tongtong Che: Methodology, Writing - Review & Editing, Visualization. Debin Zeng: Methodology, Writing - Review & Editing, Visualization. Zihao Zhang: Technical support for MRI acquisition, Writing - Review & Editing, Visualization. Shuyu Li: Conceptualization, Methodology, Project administration, Writing - Review & Editing Funding acquisition, Supervision.
Corresponding author
Ethics declarations
Competing interests
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chu, L., Ma, B., Dong, X. et al. A paired dataset of multi-modal MRI at 3 Tesla and 7 Tesla with manual hippocampal subfield segmentations. Sci Data 12, 260 (2025). https://doi.org/10.1038/s41597-025-04586-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-025-04586-9