Background & Summary

The hippocampus plays an important role in clinical neuroscience due to its critical functions in long-term episodic memory consolidation, spatial navigation, and learning1,2,3. Previous studies have demonstrated that the hippocampus is not a uniform structure; rather, it is comprised of distinct subfields, each exhibiting unique functional profiles, connectivity patterns, and differential vulnerabilities to specific disease processes4,5. Precise delineation of hippocampus subfields is crucial for the diagnosis and management of various neurological and psychiatric disorders. Moreover, the picture is made more complex by the fact that the hippocampus is not a homogeneous brain structure. It is composed of two convoluted structures: the dentate gyrus (DG) and the Cornu Ammonis (CA), with the latter divided into four subdivisions (CA1–4). The subiculum (SUB) is also typically recognized as a component of the hippocampal complex. In conjunction with the entorhinal cortex (ERC), these elements constitute the hippocampal formation6. Despite the availability of automated methods for hippocampal subfield segmentation, such as FreeSurfer v6.07, ASHS8, MAGeT-Brain9, HIPS10, CAST11, HippUnfold12, HSF13, the performance of these supervised methods largely depend on the accuracy of manual segmentation, which serves as the gold standard. Such manual segmentations require substantial expertise and a considerable time investment—approximately eight hours per subject when executed by an experienced delineator14. Therefore, researchers face a notable scarcity of databases providing comprehensive manuals for hippocampal subfield segmentation to guide the development of advanced and automated segmentation methods.

Further compounding the challenge is the inherent limitation of standard 3 T MRI in resolving the fine anatomical details of hippocampal subfields due to limited signal contrast and resolution. In contrast, 7 T MRI, with its superior spatial resolution and signal-to-noise ratio (SNR), provides significantly enhanced anatomical detail15,16, improving the accuracy of various neuroimaging post-processing tasks, including tissue segmentation and cortical surface reconstruction. The ability of 7 T MRI to reveal subtle pathological changes invisible on lower-field-strength scanners further enhances its value in both research and clinical diagnostics17,18,19. Additionally, imaging techniques that utilize susceptibility-induced contrast, including functional MRI (Blood Oxygenation Level Dependent, BOLD), which benefit greatly from the ultra-high field strength20, enabling the construction of mechanistic models of cognitive function21. Even in the hippocampus, where primary structures within the hippocampal formation are visible at clinical 3 T MR scanner, encounters resolution limitations that impede the delineation of finer subfield anatomy and distinct signal profiles achievable at higher field strengths22. This is especially critical considering the hippocampus’s small volume (typically measuring 1–3 ml in elderly subjects, including those with Alzheimer’s disease) which is represented by only 1000–3000 voxels at the standard clinical resolution of 1 × 1 × 1 mm3. Moreover, with 80–90% of these voxels located on the surface, partial volume effects can substantially impact volume estimation accuracy23. 7 T MRI substantially mitigates these partial volume effects, revealing more anatomical details24 and improving the interpretation of imaging data acquired at standard clinical field strengths25. The superior SNR and ultra-high resolution of 7 T MRI enable the consistent visualization of internal features while maintaining a thin slice thickness (as fine as 1 mm), essential for revealing the intricate anatomy of the hippocampal head. Notably, high-resolution T2-weighted images at 7 T have demonstrated their efficacy in demarcating medial temporal lobe (MTL) subregions via the vivid delineation of the stratum radiatum lacunosum moleculare (SRLM), which manifests as a distinct dark band and acts as a critical marker for demarcating subregion boundaries26.

Despite their significant benefits in improving tissue contrast and anatomical detail, the integration of 7 T MRI scanners in research and clinical settings is primarily restricted by high cost and complex maintenance demands27. To circumvent these challenges and enhance the utility of the more prevalent 3 T MRI datasets, various deep learning-based 3T-to-7T image synthesis methods have been proposed, demonstrating promise in generating realistic high-resolution 7T-like images from 3 T scans27,28,29. However, for super-resolution task, obtaining a paired dataset that includes both low and high-resolution MRIs for algorithm training constitutes a practical hurdle considering most existing approaches are implemented on private datasets30. While several Generative Adversarial Networks (GANs) based synthesis models can be trained in the absence of paired images, paired data remains crucial for quantitative model validation31.

Here, we disseminate a dataset of whole-brain paired T1-weighted, T2-weighted and resting-state fMRI (rfMRI) scans acquired at 3 T and 7 T scanners, featuring manual hippocampal subfield annotations on the 7 T T2-weighted images. We provide a comprehensive description of the design, acquisition, and preparation of the dataset. Image quality is assessed using quality metrics implemented in MRIQC32. This dataset has already been utilized to develop Syn_SegNet33, an end-to-end, multitask joint deep neural network that leverages 7 T MRI synthesis to improve hippocampal subfield segmentation in 3 T MRI. We anticipate that this resource will significantly advance the field by facilitating research and development in 3T-to-7T image synthesis, T1-to-T2 image translation, and functional MRI analysis, improving the accuracy and accessibility of hippocampal subfield segmentation.

Methods

Participant recruitment

Twenty healthy volunteers (10 males and 10 females) aged 18–25 years were recruited from university. Inclusion criteria were: 1) no history of brain injury, neurological, or psychiatric disorders; 2) a Mini-Mental State Examination (MMSE)34 score ≥ 24; 3) absence of contraindications to MRI scanners, such as claustrophobia or the presence of metallic implants. All participants provided written informed consent, including permission for open sharing of anonymized data. Ethical approval was granted by the Science and Ethics Committee of the School of Biological Science and Medical Engineering in Beihang University (Application No. 20140304), in accordance with the Declaration of Helsinki.

Additional demographic information collected included handedness (all participants were right hand dominant, confirmed using the Edinburgh Handedness Inventory35) and years of education (mean = 17.25 years, SD = 0.69).

Ten questionnaires were administered in the following order: Basic Information Questionnaire, Edinburgh Handedness Inventory (EHI), MMSE, Self-Rating Depression Scale (SDS)36, Self-Rating Anxiety Scale (SAS)37, Pittsburgh sleep quality index (PSQI)38, Wechsler Adults Intelligence Scale (WAIS-RC)39, Raven’s Progression Matrices (RPM), The Schutte Self Report Emotional Intelligence Test (SSEIT)40, and The Trail Making Test (TMT)41. Detailed demographic and questionnaire data are presented in Table 1.

Table 1 Demographic information and questionnaire results of the 20 participants.

Image acquisition

MRI data were acquired from each participant using both a 3 T Prisma and a 7 T MAGNETOM MR scanner (Siemens Healthineers, Erlangen, Germany) on the same day at the Beijing MRI Center for Brain Research (BMCBR). A 32-channel receive and single-channel transmit head coil (Nova Medical, Wilmington, MA, USA) was used for 7 T scans, and a 64-channel phased-array head coil (Siemens Healthineers, Erlangen, Germany) was used for 3 T scans.

3 T sequences

T2-weighted turbo spin echo (TSE) sequence was obtained with a resolution of 0.9 × 0.9 × 1.9 mm3. The sequence parameters were: repetition time (TR) = 11050 ms, echo time (TE) = 94 ms, flip angle (FA) = 60°, bandwidth = 100 Hz/Px, and acquisition matrix size = 256 × 256 × 90. One volume acquired perpendicular to the hippocampus long axis.

An isotropic 1mm3 T1-weighted sagittal 3D magnetization-prepared rapid gradient echo (MPRAGE) sequence was acquired along the long axis of the hippocampus, with the following settings: TR = 2300 ms, inversion time (TI) = 1000 ms, TE = 2.26 ms, FA = 8°, and acquisition matrix size = 256 × 224 × 192.

Whole-brain resting-state fMRI was obtained using a 2D echo-planar imaging (EPI) sequence with the following scanning parameters: FA = 50°, TR = 600 ms, TE = 30 ms, resolution = 3.0 × 3.0 × 3.0 mm3, and acquisition matrix size = 80 × 80 × 48. One run with a total of 800 volumes was acquired.

The diffusion-weighted imaging (DWI) scan was acquired using a 2D EPI sequence with a slice thickness of 1.3 mm and the same spacing between slices. The parameters were: TR = 5500 ms, TE = 82 ms, and flip angle = 90°. The acquisition included a multiband acceleration factor of 3 and a parallel reduction factor in-plane of 2. The base resolution and acquisition matrix were both 162, with an echo train length of 61. The EPI factor used was 128.

7 T sequences

T2-weighted Sampling Perfection with Application-optimized Contrasts by using flip angle Evolution (SPACE) sequence: 3D, 0.4 × 0.4 × 1.0 mm³, TR = 3000 ms, TE = 387 ms, FA = 120°, bandwidth = 279 Hz/Px, matrix = 480 × 512 × 224. Two volumes acquired (with and without non-distortion correction, with and without a suffix of ‘ND’ in filename).

T1-weighted MPRAGE: Sagittal, isotropic 0.7 mm³, TR = 2600 ms, TI = 1050 ms, TE = 2.72 ms, FA = 5°, matrix = 320 × 320 × 256. Two volumes acquired (with and without non-distortion correction).

Resting-state fMRI (EPI): 3D, FA = 45°, TR = 1000 ms, TE = 21 ms, resolution = 1.5 × 1.5 × 1.5 mm³, matrix = 128 × 128 × 88. One run of 800 volumes.

Segmentation of hippocampal subfields

We followed the protocol established by Wisse et al.14, which integrates elements from pre-existing methods with new rules to ensure coverage of all subfields throughout the entire length of the hippocampal formation. The hippocampal formation subfields, including the ERC, SUB, CA1, DG-CA4, CA2, CA3, and the hippocampal tail, were manually delineated. Segmentation was performed bilaterally on every coronal section of the subjects’ ultra-high-resolution 7 T T2-weighted MRI scans using ITK-SNAP software42 (version 3.8.0, www.itksnap.org), where 7 T T2-weighted images have been upsampled to a 0.7 mm slice thickness utilizing B-spline interpolation, after Gaussian smoothing denoising and N4 bias field correction with Advanced Normalization Tools (ANTs). Figure 1 displays MRI images from four different modalities of a single subject, whereas Fig. 2 illustrates the manual segmentation results of the left hippocampus alongside its corresponding 3D reconstruction.

Fig. 1
figure 1

Paired T1-weighted and T2-weighted MRI scans at 3 T and 7 T acquired for a subject (sub-10), the left hippocampus of the subject was shown for multiple views and multiple modalities. The final two rows depict zoomed-in coronal views of the hippocampus from head to body at three different slices, comparing 3 T and 7 T T2-weighted images. The 3 T images show lower resolution and significant partial volume effects, while the 7 T images, with enhanced resolution, more clearly demonstrate the hippocampal internal architecture, outer boundaries, and inner boundaries between subfields, with the hypointense line appearing more consistent and visible compared to 3 T.

Fig. 2
figure 2

Manual segmentation of the hippocampal formation in subject 12. (a) A sagittal view illustrating references to the sequential coronal sections. Coronal images depicting the hippocampal formation and accompanying manual segmentation are presented from anterior to posterior, labeled as 2c-2j. (b) A 3D reconstruction viewed from an anterolateral angle highlights the segmented hippocampal subregions. (cj) For each coronal section, the top row displays the 7 T T2-weighted images, while the bottom row depicts manual delineation of hippocampal subfields. A detailed zoom-in view (seen in image 1 g) demonstrates the delineation of the border between CA2 and CA3 by drawing a virtual square. Abbreviations: ERC, entorhinal cortex; SUB, subiculum; CA, Cornu Ammonis; DG, dentate gyrus.

It should be noted that the implementation of the original protocol required adaptations for use with ITK-SNAP software. Specifically, the delineation of hippocampal subfields was adjusted to accommodate ITK-SNAP’s voxel-by-voxel approach, without modifying the underlying protocol principles. This is in contrast to the freehand spline drawing technique described in the reference protocol by Wisse et al.14. The subfields were manually traced on coronal images from anterior-to-posterior, beginning with Fig. 2c and continuing to 2j on coronal images. The head of the hippocampus is depicted in Fig. 2d–h, the body in Fig. 2i, and the tail in Fig. 2j. These modifications are detailed below, following the sequence of annotation.

  1. (i)

    The border between the SUB and CA1 in the most anterior slices of the hippocampus was determined by measuring the longest diameter from the medial to the lateral point (Fig. 2d,e). With the emergence of the DG, this border changed, and the SUB became laterally bordered by CA1 at the most medial point of the DG. Given the small size of the DG in early slices, a practical approach involved extrapolating backwards from subsequent levels where the DG was larger and examining the region from multiple perspectives, especially in the sagittal view.

  2. (ii)

    The delineation of CA2 and CA3 subfields commenced at a position 1.4 mm anterior to the separation point of the uncus from the hippocampus on coronal images, where only the fimbria exhibited attachment to both structures as per the previous protocol. To ensure accuracy and consistency, delineation started on a slice located two slices ahead, assuming a slice thickness of 0.7 mm (Fig. 2g,h).

  3. (iii)

    As outlined in the protocol14, the CA2-CA3 border was established by defining the medial side of a virtual square. This square was positioned with its lateral side touching the border of CA1 and its superior side aligned along the superior border of CA2. To ensure accuracy and consistency during the segmentation process using ITK-SNAP, a paintbrush size of 4 was employed. The delineation of the virtual square between CA2 and CA3 was accomplished using 4 × 4 voxel squares in most instances, as this size matched the thickness of CA2 at its boundary with CA3 (Fig. 2g).

  4. (iv)

    The delineation of hippocampal subfields encompassed the entire length of the long axis of the hippocampal formation. However, due to the limited visibility of the total length of the fornix beyond a certain section, reliable delineation of fused subfields was not feasible. To address this limitation and maintain consistency, the last 4–6 slices were labeled as “Tail” (Fig. 2j) in accordance with a previously published protocol14.

Data Records

The dataset is available at Figshare+43. The raw MRI data are stored in.ima format within the ‘rawdata_DICOM_3T/7T’ directories. The rawdata DICOM 7 T data are further divided into four parts, each containing raw DICOM data for 5 subjects. Additionally, the acquired MRI scans have been converted from DICOM to the Neuroimaging Informatics Technology Initiative (NIfTI) format and organized in accordance with the Brain Imaging Data Structure (BIDS) format by employing the BIDScoin Python application (version 4.3.0)44. These files are stored in the ‘rawdata_BIDS_3T/7T’ directories.

Hippocampal subfield segmentation related files can be found in the ‘hippo_subfield\7T_T2-weighted_0.7_for_subfield_delineation’ directory. The ‘hippo_subfield\hippo_label’ directory contains manual segmentation labels for seven hippocampal subregions for each subject. Detailed information on segmentation labels, including label values and colors for each hippocampal subfield is documented in ‘hippo_subfield\subfield label.txt’.

The ‘MRIQC_3T/7T’ directories store the results of quality control assessments. For each participant, the ‘MRIQC_3T/7T\sub*’ directories contain \anat and \func subdirectories, which hold image quality metric reports for T1-weighted, T2-weighted, rfMRI scans. These quality metrics, available in both.html and.json formats, are instrumental in evaluating data quality and provide estimates of motion, SNR, and intensity non-uniformities, supplemented with visual reports in.svg format in the ‘\figure’ subdirectory.

Notably, due to heavy head motion during the first scans, the 3T T2-weighted images for two participants (sub 06 and 12) were rescanned. Only the datasets from these subsequent sessions are retained in the ‘rawdata\BIDS’ directory for further quality evaluation. Figure 3 illustrates the directory structure of our dataset.

Fig. 3
figure 3

The directory structure of the dataset. (a) Overview of all the main folders included in the shared dataset. (b) The folder ‘T1_T2w_0.7_for_subfield_delineation’ contains 7 T T2-weighted images of each participant, which were used for hippocampal subfield delineation. (c) Contents of the ‘hippo_label’ folder, including subfield label files of each subject. (d) The directory structure for the 3T-7T paired dataset, conforming to BIDS standards. (e) Detailed image quality reports for T1-weighted, T2-weighted, and resting-state fMRI data, generated with MRIQC, are included. (f) The ‘rawdata_DICOM_3T’ and ‘rawdata_DICOM_7T’ folders contain anonymized DICOM data for 3 T and 7 T, respectively, without additional processing.

Technical Validation

Hippocampal subfield segmentation on 7 T T2-weighted high resolution structural images

High-resolution 7T T2-weighted structural images were acquired to enable precise manual segmentation of hippocampal subfields. High image quality ensured the visibility of crucial anatomical landmarks, facilitating accurate delineation in all participants. To assess inter- and intra-rater reliability, two experienced medical experts (L.C. and X.D.) independently segmented the hippocampal subfields of ten subjects (50% of the sample) on two separate occasions, one month apart. This procedure established proficiency with the segmentation protocol. Subsequently, these raters collaboratively produced the final manual segmentation for all twenty subjects, reaching consensus on any discrepancies through discussion.

Reliability of the hippocampal subfield segmentations was evaluated using inter- and intra-rater reliability measures: Intraclass Correlation Coefficients (ICC)45 and Dice Similarity Coefficients (DSC)46. These metrics were calculated for each subfield, separately for the left hemisphere, right hippocampus, and bilaterally.

Intra-rater reliability, reflecting the consistency of individual rater performance, was evaluated using both ICCs and DSCs. ICCs were calculated as:

$${\rm{ICC}}=\frac{M{S}_{\text{R}}-M{S}_{\text{E}}}{M{S}_{\text{R}}+(M{S}_{\text{C}}-M{S}_{\text{E}})/n}$$

Where MSR represents the mean square between subjects. MSE (mean square error) represents the unexplained variance attributable to random error. MSC represents the mean square between the two independent tracing sessions. n represents the number of subjects. DSCs were calculated as:

$$\mathrm{DSC}=\frac{2\times |X\cap Y|}{|X|+|Y|}$$

where X and Y represent the voxel sets from the two tracing sessions for a given subject, |X ∩ Y| denotes the number of overlapping voxels, and |X| and |Y| represent the total number of voxels in each segmentation. For intra-rater reliability, both ICCs and DSCs ranged from 0 to 1, with higher values indicating greater reliability. We considered ICC values of 0.75–0.90 as good, and values ≥ 0.90 as excellent.

Inter-rater reliability, reflecting the agreement between the two independent raters, was similarly assessed using ICCs and DSCs. The ICC calculation remained consistent, with MSC now representing the mean square between the two raters. The DSC calculation also remained unchanged. For inter-rater reliability, both ICCs and DSCs ranged from 0 to 1, with higher values indicating greater agreement.

Table 2 presents the inter- and intra-rater reliability analyses. The results align with those of the reference protocol14. For intra-rater reliability, our enhanced analysis yielded a DSC of 0.810 for the ERC segment, slightly below the 0.83 reported by Wisse et al.14. Similarly, the CA2 segment’s DSC was 0.80, marginally lower than the 0.83 reported by Wisse et al.14. Conversely, our DSC values for the CA1 and SUB segments(0.875, respectively) closely matched the 0.85 and 0.84 values reported by Wisse et al.14 Regarding intra-rater ICCs, our scores for the ERC, SUB, and CA1 segments are 0.78, 0.96, and 0.97, respectively, marginally below the 0.80, 0.97, and 0.98 reported by Wisse et al.14 The inter-rater reliability findings mirrored the intra-rater results but exhibited slightly lower values.

Table 2 ICC and DSI of manual segmentations, intra-rater reliability of a single manual rater (L.C.), and interrater reliability of 2 independent manual raters.

Image quality metrics (IQMs) for MRI data

All participants underwent an initial quality assessment of their neuroimaging data. We ensured that the datasets conformed to the BIDS standard using the BIDS-validator (version 1.14.3; available at https://github.com/bids-standard/bids-validator). Prior to quality assessment of the T1-weighted, T2-weighted, and rfMRI sequences, we used MRIQC32 to generate subject-specific visual quality assessment reports. These reports facilitated a comprehensive evaluation of processing quality. Detailed IQMs, calculated for all structural and functional imaging series using MRIQC are included in this data release. Six representative IQMs, presented in Fig. 4, are detailed below.

Fig. 4
figure 4

Violin plots illustrating the distribution of the six Image Quality Metrics (IQMs) across different acquisition protocols. Metrics include (a) Signal-to-Noise Ratio (SNR); (b) Entropy Focus Criterion (EFC); (c) Contrast-to-Noise Ratio (CNR); (d) Standardized DVARS (DVARS_std). (e) Total Signal-to-Noise Ratio (tSNR); and (f) Mean Framewise Displacement (FD-mean). Data collected at 3 T is shown in orange, while 7 T data is shown in purple. Asterisks (****) indicate that the p-value of the group difference is less than 0.001, while ‘ns’ denotes non-significant differences. For panels (ac), statistically significant differences between 3 T and 7 T data are observed, except for T2-weighted images. The FD-mean metric shows no significant group difference between 3T and 7T resting-state functional MRI acquisitions.

Spatial anatomical and functional Metrics:

  1. (i)

    Signal-to-Noise Ratio (SNR)47: SNR was calculated within each tissue mask including gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF) masks. And the whole-brain averages (means of GM, WM, and CSF values) are reported in Fig. 4. Higher values indicate superior image quality. The estimation may be provided with only one foreground region in which the noise is computed as follows:

    $${\rm{SNR}}=\frac{{\mu }_{F}}{{\sigma }_{F}\sqrt{N/(N-1)}}$$

    where μF is the mean intensity of the foreground and σF is the standard deviation of the same region. N is the number of voxels in foreground mask.

  2. (ii)

    Entropy Focus Criteria (EFC)48: EFC was derived from the Shannon entropy of voxel intensities relative to the maximum potential entropy for an image of similar size. Lower EFC indicates less ghosting and motion-induced blurring due to head movement during scanning, corresponding to enhanced image quality.

    $${\rm{EFC}}=\left(\frac{N}{\sqrt{N}}\log {\sqrt{N}}^{-1}\right){\rm{E}}$$

    where E is expressed as:

    $${\rm{E}}=-\mathop{\sum }\limits_{j=1}^{N}\frac{{x}_{j}}{{x}_{\max }}\mathrm{ln}\left[\frac{{x}_{j}}{{x}_{\max }}\right]$$

    Where N is the total number of voxels (i.e., independent voxels in the image), xj represents the intensity value of the j-th voxel, typically the grayscale value or signal intensity of that voxel in the image. xmax is the total energy of all voxels, calculated as: \({x}_{\max }=\sqrt{\mathop{\sum }\limits_{j=1}^{N}{x}_{j}^{2}}\)

.

Spatial anatomical Metrics:

  1. (iii)

    Contrast-to-Noise Ratio (CNR)47: Specific to structural MRI, CNR quantifies the distinguishability between tissue types (e.g., GM and WM). Higher CNR values indicate better tissue contrast.

    $${\rm{CNR}}=\frac{\left|{\mu }_{{\rm{GM}}}-{\mu }_{{\rm{WM}}}\right|}{\sqrt{{\sigma }_{B}^{2}+{\sigma }_{{\rm{WM}}}^{2}+{\sigma }_{{\rm{GM}}}^{2}}}$$

    where σB is the standard deviation of the noise distribution within the air (background) mask. μWM is the mean of signal within the WM mask, μGM is the mean signal within the GM mask. σWM is the standard deviation within the WM mask. σGM is the standard deviation within the GM mask.

Temporal functional Metrics:

  1. (iv)

    Standardized DVARS49: This metric reflects the average voxel-wise change in signal intensity between consecutive rfMRI volumes, normalized with the standard deviation of the temporal difference time series. Lower standardized DVARS values indicate better image quality. For time point t:

    $${{\rm{DVARS}}}_{t}=\sqrt{\frac{1}{N}\sum _{i}{\left[{x}_{i,t}-{x}_{i,t-1}\right]}^{2}}$$

    Where N is the number of voxels and xi,t is the signal intensity for voxel i.

  2. (v)

    Mean Framewise Displacement (FD)50: FD estimates head motion by summing the absolute translational and rotational displacements across the x, y, and z axes for each time point. Lower values indicate less head motion.

    $${{\rm{FD}}}_{t}=\left|\Delta {d}_{x,t}\right|+\left|\Delta {d}_{y,t}\right|+\left|\Delta {d}_{z,t}\right|+\left|\Delta {\alpha }_{t}\right|+\left|\Delta {\beta }_{t}\right|+\left|\Delta {\gamma }_{t}\right|$$

    Where \(\Delta {d}_{x,t}\), \(\Delta {d}_{y,t}\), and \(\Delta {d}_{z,t}\) are the translations along the x, y, and z axes, respectively, and \(\Delta {\alpha }_{t}\), \(\Delta {\beta }_{t}\), and \(\Delta {\gamma }_{t}\) are the rotations around the x, y, and z axes, respectively.

  3. (vi)

    Temporal SNR (tSNR)51: tSNR was computed as the mean BOLD signal intensity across the brain volume divided by its standard deviation within each voxel, across each run and for each subject. Higher values indicate better quality.

    $${\rm{tSNR}}=\frac{\langle S{\rangle }_{t}}{{\sigma }_{t}}$$

    where \(\langle S{\rangle }_{t}\) is the average BOLD signal (across time), and \({\sigma }_{t}\) is the corresponding temporal standard-deviation map.

Figure 4 displays IQMs for T1-weighted, T2-weighted, and rfMRI data acquired at 3 T and 7 T field strengths. Except where indicated as non-significant (ns), all other within-modality comparisons between 3 T and 7 T showed statistically significant differences (Fig. 4a–c). Figures 4d–f demonstrate significant differences (p < 0.001) in DVARS and tSNR for rfMRI. Specifically, 7 T T2-weighted images generally exhibited significantly higher SNR than 3 T images, indicating improved overall signal quality. However, increased variance at 7 T suggests greater susceptibility to magnetic field inhomogeneities, potentially compromising signal quality in specific local regions compared to 3 T. EFC values remained relatively consistent across modalities, with 7 T T1-weighted images showing slightly better results, indicating lower image complexity and higher quality. For rfMRI, 3 T data exhibited a higher standard deviation of DVARS, reflecting greater motion artifacts. In contrast, 7 T data showed more stable inter-volume signal changes. Minimal differences in motion artifacts were observed between 3 T and 7 T rfMRI, as indicated by mean FD; however, 7 T displayed slightly lower values, suggesting less movement. Conversely, 3 T rfMRI showed significantly higher tSNR, reflecting greater signal stability over time.

Usage Notes

The dataset, accessible via Figshare43, provides paired whole-brain T1-weighted, T2-weighted, and rfMRI scans acquired at both 3 T and 7 T field strength, along with manual hippocampal subfield annotations for the 7 T T2-weighted images. This unique combination of multi-contrast and multi-field strength data supports a wide range of machine learning applications.

The paired data enables diverse deep learning workflows, including cross-domain tasks such as 3T-to-7T image synthesis, which translates lower-resolution 3 T scans into high-resolution 7T-like images. This enhances image detail and quality for downstream applications. The high-quality manual annotations of hippocampal subfields on the 7 T T2-weighted images serve as reliable ground truth for training and evaluating segmentation models, particularly valuable given the importance of high-resolution detail in hippocampal substructures. This facilitates the development of advanced segmentation algorithms applicable to high-resolution imaging and studies of hippocampal morphology in health.

The multi-modality nature of the dataset further allows for multi-modal research workflows integrating structural and functional brain imaging. Researchers can explore relationships between brain anatomy and functional connectivity by aligning data across T1-weighted, T2-weighted, and rfMRI modalities, enabling joint representation learning and in-depth investigation of brain structure-function interactions. This makes the dataset particularly well-suited for multi-modal deep learning applications.

While this dataset offers a valuable benchmark for comparing 3 T and 7 T imaging, it’s crucial to acknowledge its limitations. The relatively small sample size, comprising only healthy young adults, restricts the generalizability of models trained solely on this dataset. Data augmentation is therefore recommended, particularly for deep learning applications. Furthermore, researchers studying disease or diverse age groups should supplement this dataset with additional data to enhance model robustness. Despite these limitations, the dataset provides a critical baseline for understanding fundamental hippocampal structure before considering age- or disease-related variations.

Clinical applications of generated images require caution due to potential anatomical inaccuracies. While these images can augment existing imaging modalities, they should not replace established diagnostic procedures but serve as supplementary tools to enhance diagnostic insights, especially when conventional high-resolution images are unavailable. Furthermore, continuous validation and assessment processes are necessary to ensure reliability and accuracy.

Furthermore, several technical limitations inherent to 7 T MRI must be considered. Although 7 T MRI enhances signal and contrast, challenges like field inhomogeneities and signal variability need to be acknowledged, especially in precise structural imaging. In-plane dephasing and signal loss at tissue-air interfaces can introduce artifacts, particularly when imaging structures near the skull base, like the temporal lobe. As shown in Fig. 1, substantial intensity variations and loss, particularly in the temporal lobe of 7 T T2-weighted images, result from susceptibility effects and magnetic field inhomogeneity, especially at gray matter-CSF boundaries. While these artifacts minimally affect hippocampal visualization and segmentation (the dataset’s primary focus), they can impact overall image quality and potentially affect whole-brain image generation. Users are advised to exercise caution when using whole-brain data for segmentation or generative tasks, as signal loss may occur in specific regions, particularly in the temporal lobes on 7 T T2-weighted images for certain subjects. The absence of 7 T DWI sequences reflects common technical constraints at 7 T, including shorter relaxation times, increased magnetic field inhomogeneity, and higher SAR, which prevented the acquisition of high-quality DWI data with our scanner and available sequences/software.

Therefore, users should critically assess the impact of sample size and imaging artifacts when interpreting results to ensure that the conclusions drawn align with the dataset’s scope and limitations. However, we emphasize that the high-quality manual annotations, consistent protocols, and paired 3T-7T scans offer a valuable resource for early-stage model development and evaluation. Future dataset extension could include larger, more diverse samples, encompassing pathological cases, to improve generalizability.