Introduction

Parasellar cavernous hemangiomas (CHs) are relatively rare intracranial-extraaxial vascular malformation with unknown etiology, accounting for 2–3% of all cavernous sinus tumor1. In recent years, advances in neuroradiology techniques improve the ability to detect it2,3,4. Dural cavernous angiomas occurred in parasellar cavernous sinus often share a similar appearance with parasellar meningiomas on conventional MR imaging (MRI) and exhibit a dural tail sign on enhanced-T1weighted (T1WI) imaging occasionally5. They were misdiagnosed as meningiomas commonly6,7,8,9, and the misdiagnosis rate was as high as 66.7–87.5%1,10.

Althouth parasellar CHs were benign, clinical symptoms such as headache and cranial nerve deficits may arise due to progressive tumor growth and mass effect11. The management of parasellar CHs remains a challenge for neurosurgeons due to the complex neurovascular structures of the cavernous sinus. The incidence of uncontrollable and massive hemorrhage during surgery and neurovascular function injury was high and even death12. Stereotactic radiosurgery (SRS) could alleviate symptoms and effectively reduce surgical complication12, attaining long-term CHs control13,14. However, SRS increases the risk of adhesion between meningiomas and surrounding tissues, which is not the preferred method for meningiomas. Surgical resection is considered to be an effective strategy for the treatment of parasellar meningiomas15, and SRS is an adjuvant treatment for residual or recurrent meningiomas after surgery5,16,17. Consequently, accurate preoperative diagnosis for parasellar CHs and meningiomas is crucial for individualized treatment decisions.

In recent years, advanced functional imaging features were explored to provide information for improving diagnostic accuracy, including a description of compactness of tumor cell arrangement, cerebral blood perfusion, and vascular proliferation characteristics. Pathologically, CHs can be classified as type A, B and C18. Type A was sponge-like with intact pseudocapsule; type B was mulberry-like with the pseudocapsuel incomplete or absent; and type C was composed of both mulberry-like composition and sponge-lied composition. Parasellar meningiomas are mostly meningothelial subtype19. They are obvious enhancement and hyperperfusion, with a significantly lower minimum apparent diffusion coefficient (min ADC) compared to parasellar CHs20. These provide valuable information for the identification. However, sometimes its clinical application is limited due to the following reasons: (1) the gradual “filling” features on dynamic contrast-enhanced MRI (DCE-MRI) help in the diagnosis of cavernous hemangioma, which was different from meningomas. However, type A CH, accounting for about 40% of all parasellar CHs21, is composed of thin-walled large lumen sinusoids with scanty intervening connective tissue. It shows marked homogeneous enhancement than type B and C18,22, which is similar to meningiomas; (2) identification by perfusion status is typically incomplete23. Type B cavernous hemangioma contains ample solid parenchyma and well-formed vasculature and connective tissue. It has high CBF values and is easily misdiagnosed as meningiomas20,22; (3) poor imaging effect on diffusion-weighted imaging (DWI) of parasellar lesions was inevitably, due to the low signal-to-noise ratio and magnetic susceptibility artifacts caused by skull base bone and nasal containing gas; (4) although DCE-MRI has certain value in differential diagnosis, it is inevitable to inject exogenous contrast agents. Which limits its use in specific populations of pregnant women18,20,24. The previous reports showed that parasellar CHs and meningoma were both the most frequently diagnosed parasellar disease during pregnancy25. Therefore, the exploration based on conventional MR without contrast agents is more expected.

Radiomics has become an attractive technique in recent years. It is a powerful tool for constructing decision-support models based on conventional or functional imaging for extracting large amounts of image features and quantitative data analysis26. However, to our knowledge, its application in differentiating parasellar CHs from meningioma has not been reported27,28,29,30. The present study extracted a large panel of radiomics features from T1-weighted images (T1WI), T2-weighted images (T2WI), contrast-enhanced T1-weighted images (CE-T1WI), diffusion-weighted imaging (DWI), and apparent diffusion coefficient (ADC) imaging data involving 96 patients with parasellar CHs and meningiomas. This study aimed to construct an MRI-based radiomics model as a noninvasive preoperative prediction method to facilitate the differentiation of parasellar CHs from meningiomas.

Materials and methods

Patients

Radiological and clinical databases of 96 cases of parasellar CHs and meningiomas from Second Hospital of Dalian Medical University, Jining NO. 1 People's Hospital, and First Affiliated Hospital of Wenzhou Medical University were retrospectively reviewed between January 2010 and September 2019. This retrospective study was approved by the ethics review board of Second Hospital of Dalian Medical University, Jining NO. 1 People's Hospital, and First Affiliated Hospital of Wenzhou Medical University. The requirement for informed consent was waived by our Review Board owing to the retrospective nature of the current study. The methods in the current study were performed in accordance with the relevant guidelines and regulations. Inclusion criteria included the following: (1) patients pathologically confirmed and/or clinically diagnosed with parasellar cavernous hemangioma or meningioma; (2) preoperative multi-parametric MRI scans including T1WI, T2WI, CE-T1WI, DWI, and ADC data were acquired; and (3) patients with no treatment history before magnetic resonance examination. Patients were excluded if (1) clinical data were incomplete; (2) they received any treatment before the MRI examination, and (3) MR image quality was suboptimal. As a result, 40 cases of parasellar CHs and 56 cases of parasellar meningiomas were included in the study. The flowchart for patient selection is presented in Fig. 1.

Figure 1
figure 1

Flowchart for patient selection.

MR image acquisition and data management

MR examinations were performed in 37 and 59 patients using 1.5T (HDXT, GE Healthcare, USA) and 3.0T (Siemens, Verio, Germany) MR scanners, respectively. The MR scan parameters are summarized in Table 1. CE-T1WI was acquired after administration of 0.1 mmol/kg of gadolinium-based contrast material (Gadovist; Bayer, Leverkusen, Germany). Diffusion-weighted images were transferred to a post-processing workstation to obtain ADC maps. MR data for T1WI, T2WI, and CE-T1WI were acquired for all patients. DWI was obtained for 27 patients with cavernous hemangiomas and 32 patients with meningiomas. ADC maps were obtained for 22 patients with cavernous hemangiomas and 25 patients with meningiomas. All T1WI, T2WI, DWI, ADC, and CE-T1WI data were selected for texture analysis.

Table 1 MRI protocol.

Tumor segmentation

The radcloud platform (Huiying Medical Technology Beijing Co., Ltd, https://mics.huiyihuiying.com/#/) was used to manage the imaging and clinical data and to perform subsequent radiomics statistical analysis. To minimize the MRI intensity variations, we normalized the intensity of the image using the following formula:

$$f(x) = \frac{{s(x - \mu_{x})}}{{\delta_{x}}}$$

x indicates the original intensity; f(x) indicates the normalized intensity; μ refers to the mean value; σ indicates the variance; s is an optional scaling, by default, it is set to 131.

All lesions in the training set were manually delineated by a junior radiologist on contiguous T2WI slices and then copied to the corresponding T1WI, CE-T1WI, DWI, and ADC maps for each slice. The first and last image layers were excluded to reduce the partial volume effect in all of the following series. The volume of interest (VOI) was manually adjusted to avoid interference from magnetic sensitivity artifacts. A senior radiologist with 10 years of experience reviewed all contour lines and decided on the tumor boundaries when no consensus was reached. Next, the computer automatically generated a three-dimensional VOI. Both radiologists were double-blinded to both clinical and pathological information. Figure 2 depicts a schematic of the radiomics workflow.

Figure 2
figure 2

Radiomics workflow.

Feature extraction and selection

A total of 1409 quantitative imaging features were extracted from MR images using the Radcloud platform32. All of these features were classified into four categories26,33. (1) first order statistic: these features quantitatively described the intensity distribution of voxels in MR images, but did not involve the spatial arrangement of voxels; (2) shape-based: these features reflected the shape of the depicted region; (3) texture: texture analysis quantified the variation of features within gray levels and described the statistical information related to the spatial distribution of gray levels or voxel intensities. This analysis was generally performed by second- or higher-order statistical methods that quantified the heterogeneity within the lesion. These features included gray level run length matrix, (GLRLM), gray level co-occurrence matrix (GLCM), and gray level size zone matrix, (GLSZM); (4) high order features: high order features were obtained using statistical methods after filtering the images. They included Laplacian of Gaussian, wavele, square, square root, and logarithm.

In order to avoid over-fitting and improve the generalization ability of the model, variance threshold, select K best, and LASSO algorithm were used to select the optimal features (Fig. 3). A variance threshold of 0.8 was used in the variance threshold method to remove variance eigenvalues smaller than 0.8. The select K best was chosen to remove features without a statistically significant difference (p > 0.05). For the LASSO model, the L1 regularizer was used as the cost function with a cross-validation error value of 5 and a maximum number of iterations of 1000. The LASSO algorithm was used to find the best alpha in each sequence, calculate the coefficients, and obtain the most relevant features.

Figure 3
figure 3

Dimension reduction analysis and feature selection for T2WI. (a) Variance threshold method was used to select 486 features from 1409 radiomics features (variance threshold = 0.8); (b) 145 feat ures were retained using select K best (P value < 0.05); (ce) 145 features were retained using LASSO algorithm method. Eight eigenvalues were retained.

Model training and validation

The present study constructed radiomics-based models using KNN, SVM, LR, RF, XGBoost, and DT classifiers. The radiomic features after a three-dimensional dimensionality reduction were used as the dataset. Then, 80% of the datasets were randomly selected to build the training set and the remaining 20% were used as the validation set to evaluate the accuracy of the models.

Neuroradiologist evaluation

Subsequently, two neuroradiologists (with 5 and 10 years of experience, respectively) made a diagnosis based on the characteristics of parasellar cavernous hemangiomas and meningiomas in conventional MR images (T1WI, T2WI, and CE-T1WI), including size, signal intensity on T2WI and DWI (hyperintensity, isointensity, hypointensity), morphology (roundish, irregular and spindle), the spatial relationship with the peripheral blood vessels (encapsulation, compression, close to, separation), and enhancement characteristics (homogeneous and heterogeneous). Signal intensities were recorded according to the Elster scoring criteria34. The lesion diameters detected by ADC maps, T2WI, and other sequences were recorded to compare the recognition rate of each sequences. The maximum values were taken as the focus size in this study. The two neuroradiologists were blinded to the clinical and pathology data of specific cases, but knew the patients were parasellar CHs or meningoma.

Statistical analysis

The present study compared and analyzed the area under the receiver operating curve (ROC) curve with 95% confidence interval (CI), sensitivity, specificity, and accuracy of each classifier based on the results of different MR sequence tests. Model stability was evaluated using the F-score value. The larger the F-score value, the better the stability of the model. The lesion detection rate on different MR images was also analyzed, and the relationship between lesion diameter and the detection rate on ADC maps was statistically evaluated using the SPSS 22.0 software (SPSS, Inc, Chicago, IL). Long-distance cut-off values for the Yoden index findings were obtained based on the data sensitivity and specificity. The performance of the two neuroradiologists was evaluated using ROC curve analysis and compared to the performance of the final radiomics models.

Results

Clinical and MRI characteristics

The baseline clinical factors and the semantic image analysis of 96 patients are reported in Table 2. In univariate analyses, signal intensity on T2WI and DWI, morphology, the enhancement pattern and the spatial relationship with the peripheral blood vessels showed statistical significance between cavernous hemangiomas and meningiomas (χ2 = 35.521, P = 0.000; χ2 = 9.731, P = 0.008, χ2 = 7.636, P = 0.022, and χ2 = 13.253, P = 0.004, respectively). No significant differences in age, sex, and size were observed between cavernous hemangiomas and meningiomas (P = 0.186, P = 0.420 and P = 0.212, respectively). In multivariate analyses, signal intensity on T2WI , signal intensity on DWI , and the enhancement pattern were demonstrated as independent predictors of semantic features of MRI scans (Table 2). All lesions were detectable on conventional MR images (Fig. 4). The detection rate was 18/22 for cavernous hemangiomas and 19/25 for meningiomas on ADC maps. The area under the curve (AUC) for the detection rate was 0.881 (95% CI 0.790–0.972), with an accuracy, sensitivity, and specificity of 74.2%, 67.3%, and 100%, respectively (Fig. 5c). The mean diameter was approximately 2.74 ± 0.98 cm, with a critical value of 2.25 cm for the diameter on ADC maps.

Table 2 Baseline characteristics and semantic image analysis of the population study.
Figure 4
figure 4

(ae) Images of parasellar cavernous hemangioma in a 54-year-old woman. (fj) Images of parasellar meningioma in a 58-year-old woman. MRI protocol included (a, f) axial T2-weighted images, (b, g), axial T1-weighted images, (c, h) diffusion-weighted images, (d, i) apparent diffusion coefficient maps, and (e, j) contrast-enhanced T1-weighted images. Cavernous hemangioma exhibited hyperintensity on T2-weighted images, hypointensity on T1-weighted images, DWI, and ADC map, and CE-T1WI showed homogeneous enhancement. Meningioma exhibited slightly hyperintensity on T2-weighted images, slightly hypointensity on T1-weighted images, DWI, and ADC map, and CE-T1WI showed homogeneous enhancement.

Figure 5
figure 5

Receiver operating characteristic (ROC) curves of MRI model (a, AUC = 0.805), diagnostic efficiency of two neuroradiologists (b, the AUC of Reader 1 = 0.756, the AUC of Reader 2 = 0.545), and ADC map detection rate (c, AUC = 0.881).

The AUC of the MRI model (0.805) were lower than those of the radiomics (Fig. 5a). The AUCs for the two neuroradiologists were 0.756 (95% CI 0.654–0.858) for reader 1 (Fig. 5b) and 0.545 (95% CI 0.430–0.659) for reader 2 (Fig. 5b). When comparing diagnostic performance, the radiomics classifier had a significantly higher AUCs than the two neuroradiologists (P < 0.001).

Model assessment

After three-dimensionality reductions, eight out of 1409 features were selected based on T2WI (Table 3). Features based on other sequences are listed in Supplementary Tables S1S4.

Table 3 Description of selected radiomic features with their associated feature group and filter based on T2WI.

The diagnostic performance of the prediction models is summarized in Tables 4 and 5. After removing all over-fitting results for recognizable lesions, the T2WI-based radiomics model with KNN and SVM classifiers was more effective in identifying parasellar cavernous hemangiomas from meningiomas (Fig. 6).

Table 4 Performance of KNN classifier radiomics models in differentiating parasellar cavernous hemangiomas from meningiomas in the validation set.
Table 5 Performance of SVM classifier radiomics models in differentiating parasellar cavernous hemangiomas from meningiomas in the validation set.
Figure 6
figure 6

ROC curves for the optimal classifier. (a) ROC curve for KNN model based on T2WI with AUC = 0.93; (b) ROC curve for SVM model based on T2WI with AUC = 0.88; (c) ROC curve for KNN model based on ADC maps with AUC = 0.83; (d) ROC curve for SVM model based on ADC maps with AUC = 0.81.

Discussion

The present study established an accurate classifier to distinguish parasellar cavernous hemangiomas from meningiomas by integrating a large panel of radiomic features. An efficient classifier was obtained by comparing five MRI sequences from 1.5 T and 3.0 T MR scanners at three medical imaging centers, bolstering its generalizability. Through radiomic and artificial evaluation, T2WI and DWI sequences were of great value in the differentiation of parasellar CHs and meningoma, outperforming the enhanced-T1WI. And T2WI is more universal applicable for its less artifacts and higher detection rate of parasellar lesion. MRI-based radiomic models would be a potential method for differentiating parasellar CHs from meningomas.

In this study, imaging characteristics of parasellar CHs and meningomas were analyzed. It was found that the signal intensity on DWI and T2WI, and the enhancement mode in contrast-enhanced MR imaging had advantages in the differention of them. The previous study reported that the facilitated diffusion on DWI could differentiate parasellar CHs from other lesions35. In this study, ADC sequences had a good practical value in constructing radiomics models. However, the detection rate of parasellar CHs and meningoma in DWI and ADC maps was about 78.7% (37/47), with a cut-off diameter of 2.25 cm. Which affects the clinical application of this technology. Well, the detection rate of T2WI, T1WI, and CE-T1WI was 100%, which was more conducive to the establishment of radiomics models. This study proposes for the first time that the signal intensity on T2WI is also significant for the identification. It was characterized by a high signal-to-noise ratio and homogeneity27,36. The radiomics model constructed based on T2WI had a high diagnostic accuracy and stability in distinguishing parasellar hemangiomas and meningiomas, which provides a methodological basis for diagnosis when advanced functional and enhanced MR are difficult to carry out. The progressive contrast “filling in” in the tumors can aid in differentiating between them, which was reported in the previous studies and suggested the diagnosis of cavernous hemangiomas27,37. However, contrary to our general view, the accuracy of the radiomics model based on CE-T1WI was low than T2WI and ADC, although it was improved in different ways. This might be influenced by different types of cavernous hemangiomas and meningiomas22,37,38, which is worthy of further study.

Radiomics can provide additional metabolic and biological information in addition to the traditional MRI metrics. Gray contrast, uniformity, depth, and texture roughness have been used to study tumor grading, prediction of genomic information, and differentiation of lesion and non-lesion images39,40,41. The present study found that higher-order features could better reflect the degree of tumor heterogeneity and texture information. A GLSZM can quantify gray-level zones in an image to reflect tumor heterogeneity at a local scale. The coefficient of High Gray-Level Zone Emphasis was the largest, which measured the distribution of the higher gray-level values. Larger values indicated a larger proportion of high gray-level values and size zones in the image42. Tumor heterogeneity usually reflected the gray contrast variation of the image. Therefore, the GLSZM was more sensitive in distinguishing parasellar cavernous hemangiomas from meningiomas.

Different classifier algorithms may lead to different results. The present results suggested that the radiomics models combined with SVM and KNN classifiers had better diagnostic performance in distinguishing between parasellar cavernous hemangiomas and meningiomas. SVM has been proposed by Cortes et al. in 1995 as a binary classifier based on supervised learning43,44. The critical concept of SVM involves the use of a hyperplane to define decision boundaries to separate different classes of data points. This technique finds support vectors with a high discrimination and maximizes the interval between classes. It has good adaptability and discrimination ability. The K-nearest neighbor (KNN) method is mostly used for image classification. This object classification is based on the distance between its neighbors and is mainly used to solve regression and classification problems. By selecting the KNN points of a sample when the nearest neighbors belong to a certain category, the sample is determined to belong to that category. Several previous studies have demonstrated KNN's excellent and stable performance using different datasets, which was similar to the present result45,46,47. Consistent with our study, other classifiers also suffer from over-fitting. This is manifested by the fact that the training set is too accurate, while the validation set cannot achieve the expected ideal results. In addition, there are too many feature dimensions, parameters, and noise, which lead to a too-perfect prediction of the fitted function in the training set. However, the prediction results in the new data test set were low. In the present study, SVM and KNN classifiers were suggested for use as radiological diagnostic models to distinguish between parasellar cavernous hemangiomas and meningiomas.

There are several limitations in the present study. First, the sample size was relatively small and needs to be further explored. Second, different types of parasellar cavernous hemangiomas and meningiomas were not considered. Third, the differential diagnosis mainly focused on parasellar hemangiomas and meningiomas. Other parasellar tumors that are relatively easy to diagnose were not included in the study.

In conclusion, the proposed T2WI-based radiomics model combining SVM and KNN classifiers showed favorable predictive efficacy in the preoperative differential diagnosis between parasellar cavernous hemangiomas and meningiomas. It had more general applicability in complementing conventional imaging modalities and as an alternative to functional imaging. Moreover, the more readily available T2WI could provide higher detection rates and more texture features. Other imaging modalities based on T2WI for differentiating parasellar cavernous hemangiomas and meningiomas need to be explored.