Introduction

Quantitative computed tomography (QCT) is widely used to measure spinal bone mineral density (BMD) for diagnosing osteoporosis. QCT calculates BMD using 3D data, avoiding interference from overlapping blood vessels and organs1. Previous studies have shown that BMD measured by QCT is more accurate than dual-energy X-ray absorptiometry, mainly due to better correlation with clinical indicators and more sensitive to changes in bone mineral metabolism1,2,3,4.

However, several factors can cause errors in BMD measurements based on QCT, such as imaging equipment, acquisition parameters, patient variables, calibration methods, etc.5. Currently, 120-kVp polychromatic X-rays are predominantly used in QCT examinations, with beam hardening being a major contributing factor to measurement errors. Based on the attenuation characteristics of high and low energy X-rays, dual-energy CT (DECT) can synthesize virtual images similar to those obtained by monochrome X-rays at different energy levels, which is called virtual monochromatic images (VMI) or virtual monoenergetic images6. The VMIs at 65–75 keV has been shown to have similar attenuation compared to 120-kVp polychromatic images, with less image noise and improved contrast-to-noise ratio7. Several studies have demonstrated the superiority of DECT in bone mineral density measurements, as well as the accuracy of BMD values based on spectral data8,9. While VMI has shown potential for artifact reduction in CT-based BMD assessment, its systematic integration with established phantom-calibrated QCT protocols remains underexplored, particularly regarding the optimization of energy levels for minimizing measurement variability across diverse clinical scenarios.

In this paper, we compared the BMD measurements of conventional 120-kVp polychromatic images with VMIs at 60–76 keV by using a vertebral phantom with known hydroxyapatite (HA) concentration, and explored the optimal VMI level for BMD measurement based on QCT. Meanwhile, we simulated different imaging protocol and subject body types by varying the radiation dose and the size of the region of interest (ROI) to verify whether these factors have an impact on the optimal VMI level.

Materials and methods

As a phantom study, this work did not involve any animals or subjects, so it did not require approval from the local ethics committee.

Phantom

An anthropomorphic European Spine Phantom (ESP, QRM-50100, QRM GmbH, Moehrendorf, Germany) was used10. The ESP comprises HA-containing epoxy resin, which can be modeled as HA and water for X-ray attenuation analysis11. To cover the full physiological range of BMD for spongy and cortical bone across all age groups, the phantom contains three inserts representing trabecular bone compartments: first lumbar vertebra (L1), with a HA concentration of 50 mg/cm3; second lumbar vertebra (L2), with a HA concentration of 100 mg/cm3; and third lumbar vertebra (L3), with a HA concentration of 200 mg/cm3. CT images of ESP are shown in Fig. 1, with L1, L2, and L3 successively from top to bottom, showing the difference in HA concentration among the three vertebral inserts.

Fig. 1
figure 1

European spine phantom (ESP). Photograph (a); Axial (b), Sagittal (c), and Coronal (d) CT tomographic images.

QCT calibration and image acquisition

Imaging procedures were performed using a 256-detector-row CT scanner (Revolution CT, GE Healthcare, Waukesha, WI, USA) equipped with rapid tube voltage switching capability. Both conventional and dual-energy acquisition modes were employed for ESP imaging. The conventional acquisition protocol utilized a fixed 120-kVp tube voltage. Dual-energy acquisitions employed instantaneous tube voltage switching between 80 and 140 kVp, and set precise VMI reconstruction parameters for each scan (the reconstruction energy level range is 60–76 keV, and only the image of one energy level is reconstructed for each scan). CT scans were performed at three distinct volume CT doses index (CTDIvol) levels: 12 mGy, 15 mGy, and 24.4 mGy. These values were selected to represent the first, second, and third quartiles of radiation dose distributions specified for lumbar spine diagnostic protocols under Chinese National Occupational Health Standards12. Comprehensive scanning parameters are detailed in Table 1.

Table 1 Protocols for quantitative computed tomography in this study.

Prior to each experimental scan, system calibration was performed according to manufacturer specifications. The standard QCT calibration phantom (model 4, Mindways, Austin, TX, USA) underwent sequential conventional (120 kVp) and dual-energy (80/140 kVp) protocol scans. Acquired DICOM data were subsequently processed through the quality control module of the QCT software (CliniQCT, version 5.0, Mindways, Austin, TX, USA). Successful calibration was confirmed by system-generated “PASS” verification. Before each scan begins, Model4 needs to be used to calibrate the subsequent scan parameters. After the calibration is passed, an experimental scan of ESP is conducted. In the dual-energy scanning mode, each time the VMI reconstruction parameters are changed, a new calibration scan is required.

All the scanning protocols were scanned five times repeatedly in the experimental procedures. The average of the measured values of the five scanning data was included in the statistics to ensure the reliability of the data.

BMD measurement

All the scanning data were transmitted successively to the QCT software for BMD measurement. For each scan protocol, two sizes of ROI were used to measure BMD: the default size (249.8 mm2) and the maximum size (470 mm2). All ROIs were centrally placed in the bone trabecular compartment, ensuring that the position did not shift. The software automatically calculates BMD values based on 3D data according to the outlined ROI, and all BMD values are recorded.

Statistical analysis

All statistical analyses were performed using IBM SPSS Statistics (V22.0, IBM Corporation, Armonk, NY, USA), GraphPad Prism (V9.3.1, GraphPad Software, Boston, USA) and R software (V4.4.0, R Foundation for Statistical Computing, Vienna, Austria).

All quantitative data were tested for normality using the Shapiro–Wilk test, with quantitative data conforming to normal distribution expressed as mean ± standard deviation, quantitative data not conforming to normal distribution expressed as median (interquartile range). The Wilcoxon signed rank test was used to compare BMD measurements between different ROI sizes (default and maximum) at the same radiation dose. The one-sample T-test was used to compare BMD measurements with true HA concentrations. Spearman’s correlation coefficient (rs) described the relationship between VMI energy levels and BMD measurements. Univariate linear models were used to compare BMD differences between radiation doses (12 mGy, 15 mGy, and 24.4 mGy) at the same ROI size.

Next, ordinary least squares (OLS) regression was used to explore the optimal VMI level under the influence of ROI size and radiation dose on BMD measurements. Considering the stability of BMD values measured at the same voltage, we further developed an estimate, mean squared error (MSE): MSE = B2 + V. In this formula, V represents variance, overpowering the previous estimate by combining both bias and variance. We compared the MSE of VMIs at different keV level and determined the optimal VMI level, that is, the VMI level with the lowest MSE.

In this study, the test level α = 0.05, that is, P < 0.05 was considered to have statistical difference.

Results

In this study, we measured BMD of three lumbar inserts based on all CT images (both 120-kVp images and VMIs at 60–76 keV) with different ROI sizes and radiation doses. The measurement results are shown in Fig. 2 and Table 2. The statistical results in Table 2 showed that differences in ROI and radiation dose lead to differences in BMD measurements (P < 0.001), and BMD measurements may be affected by ROI and radiation dose. Figure 2 shows BMD measurements for all imaging protocols (scan mode, ROI size, radiation dose). For VMIs at 60–76 keV, BMD measurements gradually decrease as energy levels increase (rs: − 0.990 to − 0.998, P < 0.001, Table 3).

Fig. 2
figure 2

BMD measurements based on 120-kVp images and VMIs at 60–76 keV with different ROI sizes and radiation doses. The dashed lines represent the true HA concentration of the corresponding vertebral insert. BMD bone mineral density, HA hydroxyapatite, kVp peak tube voltage, ROI region of interest, VMI virtual monochromatic image.

Table 2 Comparison of the BMD measurements based on all CT images with different ROI sizes and radiation doses.
Table 3 Spearman’s correlation coefficient between VMI levels and BMD measurements.

The mean BMD values measured based on 120-kVp images and VMIs at 72–76 keV are listed in Table 4, and we conducted a one-sample T-test between them and the real HA concentration (Table S1). One-sample T-test results showed that BMD measurements based on 120-kVp images were higher than the true HA concentration of the corresponding vertebral inserts (P < 0.001).

Table 4 BMD measurements based on 120-kVp images and VMIs at 72–76 keV.

There was no statistical difference between BMD values measured based on part of VMIs and the true HA concentration (Table S1): (1) For default ROI and 12 mGy, BMD measurements based on VMIs at 73–74 keV showed no statistical difference with HA concentration in the L1 (P74keV = 0.244, P73keV = 0.684); (2) For default ROI and 12 mGy, there was no statistical difference between BMD measurements based on 75-keV VMI and HA concentration in L2 (P = 0.139); (3) For default ROI and 12 mGy, there was no statistical difference between BMD measurements based on 74-keV VMI and HA concentration in L3 (P = 0.976); (4) For default ROI and 15 mGy, there was no statistical difference between BMD measurements based on VMIs at 73–74 keV and HA concentration in L1 (P74keV = 0.193, P73keV = 0.467); (5) For default ROI and 24.4 mGy, there was no statistical difference between BMD measurements based on 73-keV VMI and HA concentration in L1 (P = 0.08); (6) For default ROI and 24.4 mGy, there was no statistical difference between BMD measurements based on 75-keV VMI and HA concentration in L2 (P = 0.289); (7) For max ROI and 12 mGy, there was no statistical difference between BMD measurements based on VMIs at 73–74 keV and HA concentration in L1 (P74keV = 0.210, P73keV = 0.092); (8) For max ROI and 12 mGy, there was no statistical difference between BMD measurements based on 74-keV VMI and HA concentration in L3 (P = 0.235); (9) For max ROI and 15 mGy, there was no statistical difference between BMD measurements based on VMIs at 73–74 keV and HA concentration in L1 (P74keV = 0.071, P73keV = 0.910); (10) For max ROI and 15 mGy, there was no statistical difference between BMD measurements based on 74-keV VMI and HA concentration in L3 (P = 0.054); (11) For max ROI and 24.4 mGy, there was no statistical difference between BMD measurements based on 75-keV VMI and HA concentration in L2 (P = 0.846).

To maintain control over variables and ensure measurement stability, OLS regression was used to explore the optimal VMI level. The results of OLS regression showed that 74-keV VMI had the lowest MSE with different ROI sizes and radiation doses (Fig. 3). The lowest MSE indicates that the disparity between bone mineral density (BMD) values obtained from the 74-keV VMI and the actual hydroxyapatite (HA) concentration is minimal, thereby signifying optimal measurement stability.

Fig. 3
figure 3

The MSE of BMD measurements based on VMIs at 60–76 keV. BMD bone mineral density, MSE mean squared error, ROI region of interest, VMI virtual monochromatic image.

Discussion

In this phantom study, we found that BMD measurements based on clinically commonly used 120-kVp images were slightly higher than HA concentrations in vertebrae. In addition, 74-keV VMI provided the most accurate and stable measurement of BMD, which provide immediately applicable guidance for centers utilizing existing QCT workflows.

Despite emerging alternatives, QCT is widely used in diagnosing and studying osteoporosis because it can measure the true volumetric BMD without being affected by other overlapping organs4. There are three commonly used QCT systems for BMD measurement13: (1) Synchronous calibration phantom-based system; (2) Asynchronous calibration phantom-based system; (3) Self-calibration phantom-less system. Synchronous calibration phantom-based systems require a phantom to be placed under the subject’s body during scanning to correct the formula for attenuation values and BMD measurements, but are generally more susceptible to beam hardening and scattering effects14. The asynchronous calibration phantom-based system, used in our study, performs phantom-based calibration before scanning, thus avoiding errors caused by phantoms during scanning15. Self-calibration phantom-less systems are a recent research trend, using the subject’s own fat and muscle attenuation as the reference to calculate BMD values16,17. However, regardless of the QCT system used, achieving a 100% match between BMD measurements and true HA concentration is still challenging due to the inherent limitations of CT imaging principles.

In routine clinical QCT, BMD is measured with polychromatic X-rays (120-kVp)18. Due to the beam hardening effect, lower energy photons are preferentially absorbed by tissues as X-rays traverse the human body, leads to an elevation in the average energy of the rays and instability in CT values. Similarly, in our study, although the QCT system software was calibrated with the standard calibration phantom (model 4) at the beginning of this study, there was still a deviation in the BMD measurement of ESP based on 120-kVp images. This inconsistency could stem from differences in the volumes of the two models and the varying impact of beam hardening effects caused by peripheral substances5,19. Consequently, our study revealed that 120-kVp images overestimated BMD values, which may also occur in routine clinical examinations.

VMIs reconstructed by DECT based on high and low energy datasets can effectively reduce the beam-hardening artifacts and anti-correlated noise, and VMIs at 65–75 keV have similar image quality and attenuation compared with 120-kVp images, suggesting that VMI has application value in BMD measurement based on QCT20,21,22,23. In light of these observations, we opted to explore a broader energy range (60–76 keV) as far as possible for exploration under the premise that the work load was feasible. Our results showed that VMIs at specific energy levels are able to obtain more accurate BMD measurements than 120-kVp images at different ROI sizes and radiation doses. In particular, we identified the 74-keV VMI as offering optimal measurement stability while maintaining acceptable accuracy. This finding may provide reference for subsequent relevant research and clinical applications.

In addition, our study identified ROI size and radiation dose as influential factors affecting BMD measurements, which is consistent with the results of previous studies5. Due to the change of radiation dose, the attenuation value of bone trabecula will change, and the BMD value based on the attenuation value will change24. Changes in ROI affect the number of voxels included in the calculation, and the attenuation value of some voxels is affected by a partial volume effect, which can also lead to a bias in BMD values25. These observations underscore the importance for clinicians to minimize measurement errors attributed to these factors when utilizing QCT for BMD assessment, aiming for greater stability and accuracy.

Beyond virtual monochromatic imaging, emerging technologies such as photon-counting detector CT (PCD-CT) and artificial intelligence (AI) promise to further enhance QCT-based BMD measurement accuracy. PCD-CT, with its superior energy resolution and material decomposition capabilities, minimizes beam-hardening artifacts and enables precise calcium quantification (r = 0.99 in hydroxyapatite phantoms), as demonstrated in osteoporosis models26,27. These advancements align with our findings on VMI energy optimization (e.g., 74 keV), suggesting synergistic potential for integrating spectral imaging techniques. Meanwhile, AI-driven approaches, including deep-learning image reconstruction, improve spatial resolution and reduce noise without compromising BMD accuracy28. While our study focused on optimizing VMI energy levels for conventional DECT, future work could combine PCD-CT’s spectral sensitivity with AI-enhanced protocols to address limitations in clinical standardization. Such integration may establish next-generation QCT systems capable of simultaneous structural and quantitative bone assessment, bridging the gap between phantom-based research and patient-specific applications.

There are some limitations to this study. First, our findings are based on a specific CT system. Different manufacturers and types of DECT systems have been shown to have different BMD measurement accuracy29,30. Furthermore, while emerging PCD-CT technology has shown promise in osteoporosis screening through simultaneous 3D structural imaging and calcium-water component analysis27, its performance relative to DECT-derived VMIs remains unvalidated in human subjects. Future studies should integrate PCD-CT’s material decomposition capabilities with VMI energy optimization strategies to develop more robust QCT workflows. Therefore, the optimal VMI energy level for other DECT systems requires further validation. Second, our study only considers VMIs at 60–76 keV, as it has previously been reported that VMIs obtained in this range are comparable to conventional 120-kVp images. In fact, the maximum range of VMI levels that can be reconstructed by DECT is 40–200 keV31. Although BMD measurements decrease with increasing energy levels according to our study, it is not known whether this change applies to VMI level beyond 60–76 keV. Finally, when applying the findings of this study clinically, it is important to note that BMD measurements in clinical subjects tend to be influenced by more factors (position, adipose tissue, etc.), which may alter the optimal VMI level32,33.

In conclusion, when measuring BMD values based on QCT, VMI can reduce the measurement error and improve the accuracy compared with the conventional 120-kVp images. In particular, the 74-keV VMI demonstrated superior measurement stability and reliable accuracy in this study. Our findings provide immediately applicable guidance for centers utilizing existing QCT workflows while establishing foundational knowledge for future spectral CT applications.