Introduction

As the most common primary intracranial tumors, meningiomas account for 25.2% of all intracranial tumors1. According to the 2021 World Health Organization (WHO) Classification of Central Nervous System Tumors, meningiomas are categorized into grades I, II, and III2. The European Neuro-Oncology Society Guidelines for the Diagnosis and Treatment of Meningiomas recommended that the main treatment for meningiomas is surgical resection3. In cases without obvious clinical symptoms, WHO grade I meningiomas can be monitored with regular follow-up. However, some symptoms, such as cognitive impairment, necessitate surgical resection. For WHO grade II or III meningiomas, surgery is mandatory and may be complemented with radiotherapy depending on the patient’s condition, or other options, including cytotoxic chemotherapy, hormone therapy, targeted therapy, etc., may be explored4. Therefore, accurately distinguishing tumor grades is crucial. MRI is the preferred method for evaluating patients with meningioma5, but reliance on visual analysis for medical image interpretation—limited to tumor shape and grayscale information—has led to some controversy in grading meningiomas due to its subjectivity and instability6,7. The advancement of artificial intelligence, particularly in image processing, offers significant support for differential diagnosis, grading, typing, and the prediction of prognosis for meningiomas.

As an emerging discipline, radiomics can objectively and quantitatively capture some valuable tumor characteristics that are difficult to identify with the naked eyes. The method for evaluating tumor heterogeneity involves image analysis8,9,10 that which can extract high-dimensional data and quantitative image features from medical images. Through machine learning algorithms, researchers can mine information related to tumor pathophysiology to diagnose diseases and predict prognosis. This research encompasses four main processes, namely data acquisition, image segmentation, feature extraction and quantification, as well as feature selection and model establishment. In previous studies, image analysis has performed well in tumor diagnosis and grading. Furthermore, it has been used to predict tumor gene mutations and protein expression differences, subsequently reflecting tumor heterogeneity and microenvironment. This, in turn, guides the selection of targeted therapy and prognosis11. Meningiomas exhibit many pathological subtypes, and the imaging manifestations of different subtypes, meningiomas, and other intracranial tumors are often similar. Conventional imaging techniques struggle to differentiate them, and the tumors’ malignancy levels and prognoses vary significantly. The accuracy of MRI in distinguishing high-grade meningiomas (HGM) from low-grade meningiomas (LGM) has yet to be definitively established, and there has been no systematic evaluation and summary of the value of image analysis in this distinction. Addressing this gap is crucial for providing an effective method for tumor identification. The ongoing development of image analysis offers an opportunity to address this challenge and potentially improve diagnostic precision. To the best of our knowledge, this study is the first meta-analysis aimed at examining the diagnostic value of MRI radiomics in distinguishing HGM from LGM, aiming to guide the clinical diagnosis of meningiomas.

Materials and methods

The study strictly adhered to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) (2020) guidelines.

Inclusion and exclusion criteria

Study type

This meta-analysis includes both prospective and retrospective diagnostic accuracy trials.

Study subjects

The study focuses on patients diagnosed with HGM and LGM.

Diagnostic methods

Evaluated diagnostic approaches include radiomics and image texture analysis. The benchmarks for comparison are puncture tissue biopsy, intraoperative frozen pathological biopsy, or postoperative histological pathological biopsy.

Outcome indicators

Assessment parameters comprise sensitivity (SEN), specificity (SPE), positive likelihood ratio (PLR), negative likelihood ratio (NLR), diagnostic odds ratio (DOR), and the area under the curve (AUC) of the summary receiver operating characteristic (SROC) curve, along with an evaluation of publication bias.

Exclusion criteria

(1) Non-English literature; (2) literature published multiple times; (3) literature with incomplete data or where data cannot be extracted; (4) animal studies, case reports, and conference literature.

Literature search strategy

A comprehensive search was conducted in PubMed, EMbase, Web of Science, and The Cochrane Library databases from the inception of each database to December 31, 2023 to identify relevant studies on grading meningiomas using MRI radiomics methods. The search strategy utilized a combination of subject headings and free words, which means to carefully tailor to each database’s specific characteristics. Concurrently, the references of the included studies were thoroughly reviewed to further enrich our collection of relevant information. Key search terms in English included “radiomics”, “texture analysis”, “meningiomas”, and “MRI”.

Literature screening and quality evaluation

The screening of literature retrieval, data extraction, and quality evaluation were meticulously carried out independently by two physicians. In the event of disputes, a decision was rendered by the chief physician, boasting over 15 years of work experience. The data extraction included: (1) essential information of the included studies, such as the first author, publication time, country of study, age, gender, and sample size; (2) detailed extraction of true-positive (TP), true-negative (TN), false-positive (FP) and false-negative (FN) values. In instances where the precise TP, TN, FP, FN data were not disclosed, SPE, SEN, PLR, and NLR were utilized to retrograde extrapolation to calculate these results. Quality evaluation was conducted using an image radiomics quality score (RQS). The risk of bias was meticulously assessed using the Quality Assessment Tool for Diagnostic Accuracy Studies (QUADAS)-2.

Statistical analysis

The statistical software used in this study included Stata 14.0 and RevMan 5.3. The Cochran-Q and I2 tests were utilized to identify heterogeneity between studies. In cases where the heterogeneity between groups was low (I2 < 50%), fixed effect models were employed to synthesize indicators; conversely, when the heterogeneity between groups was significant (I2 ≥ 50%), random effect models were used for indicator synthesis. Quality analysis of the included literature was conducted using RevMan 5.3. Additionally, Stata 14.0 facilitated the derivation of the combined SEN, SPE, PLR, NLR, DOR, and their respective 95% CIs, crafted the SROC curve, and calculated the corresponding AUC. The diagnostic value was assessed based on the AUC magnitude. The AUC of 0.7–0.9 indicated moderate diagnostic accuracy, whereas AUC > 0.9 signified high diagnostic accuracy. Deek’s test was conducted to evaluate publication bias, with P < 0.05 signaling a statistically significant difference. Subgroup analysis was also conducted using Stata 14.0.

Results

Literature screening process and results

During the preliminary search, a total of 434 relevant articles were identified. Following a meticulous review process, 21 studies were ultimately selected for inclusion. A detailed depiction of the literature screening process and results is illustrated in Fig. 1.

Fig. 1
figure 1

Flow chart of the meta-analysis. The flowchart was created following the PRISMA 2020 guidelines.

Basic characteristics of included studies

Totally, 21 studies with 2253 meningioma patients were included, among which 607 patients had HGM and 1646 had LGM. Among these, five studies focused on patients from Korea, Europe, and the USA, whereas the remaining studies featured patients from China. Detailed information is provided in Table 1.

Table 1 Baseline characteristics of the included studies.

Quality evaluation and bias risk assessment results of included studies

The quality and risk of bias in the 21 included studies were rigorously evaluated using the diagnostic meta-analysis quality assessment tool QUADAS-2. The outcomes of this qualitative evaluation are depicted in Figs. 2 and 3. Most studies were assessed to possess a low risk of bias, accompanied by minimal concerns regarding their applicability. The quality evaluation content and results included in the study, as shown in Supplementary Table 1, were conducted using the Radiomics Quality Score (RQS)32.

Fig. 2
figure 2

Methodological quality summary graphs.

Fig. 3
figure 3

Methodological quality of individual studies.

Meta-analysis results

SEN and SPE

The pooled SEN (I2 = 76.66) and SPE (I2 = 80.37) of the studies overall were relatively high, so a random effect model was used for meta-analysis. The SEN and SPE of the overall studies were 0.82 (95% CI 0.74–0.88) and 0.85 (95% CI 0.81–0.89), respectively (Fig. 4).

Fig. 4
figure 4

Forest plot of the sensitivity and specificity of magnetic resonance imaging radiomics for the diagnosis of Meningiomas.

PLR and NLR

Due to the high heterogeneity of both the PLR (I2 = 67.54) and NLR (I2 = 80.07), a random effects model was used for meta-analysis. The PLR and NLR of the overall study were 5.64 (95% CI 4.17–7.64) and 0.21 (95% CI 0.14–0.31), respectively (Fig. 5).

Fig. 5
figure 5

Forest plot of the PLR and NLR of magnetic resonance imaging radiomics in the diagnosis of Meningiomas.

DOR

Due to the high heterogeneity of DOR (I2 = 100), a random effects model was used for meta-analysis. The overall DOR of the study was 26.66 (95% CI 14.42–49.27; Fig. 6).

Fig. 6
figure 6

Forest plot of the DOR of magnetic resonance imaging radiomics for the diagnosis of Meningiomas. Diagnostic odds ratio.

The AUC of the integrated SROC curve was 0.91(95% CI 0.88–0.93). The results indicated that MRI radiomics had high diagnostic value in distinguishing the grading of meningiomas (Fig. 7).

Fig. 7
figure 7

SROC curve of magnetic resonance imaging radiomics for the diagnosis of Meningiomas. SROC summary receiver operating characteristic, SENS sensitivity, SPEC specificity, AUC area under the curve.

Publication bias

Publication bias was examined, and a Deeks funnel plot was drawn (Fig. 8). Quantitative analysis showed no statistically significant difference (P = 0.82), which suggested that there was no publication bias.

Fig. 8
figure 8

Deeks’ funnel plot of magnetic resonance imaging radiomics for the diagnosis of Meningiomas. ESS effective sample size.

Subgroup analysis

To investigate the source of heterogeneity, subgroup analysis was conducted based on country, modeling method, sample size, MRI sequence, and whether clinical factors were combined. According to Table 2, the results of each subgroup analysis were generally in line with the overall results, and there was no significant difference in heterogeneity between subgroups and the overall heterogeneity (all P values were greater than 0.05).

Table 2 Heterogeneity analysis results included in the study.

Discussion

Due to different phenotypes of tumors, tumor cell subsets are not consistent in terms of cell proliferation rate, morphology, karyotype, cell surface markers (antigenicity), biochemical metabolism, and invasion and metastasis ability. Imaging features were selected through high-throughput computers, and target data were classified using mathematical models, aiming to explore the correlation between imaging features and tumors at the cellular and molecular levels, which is currently a hot topic in medical research. Based on this, the concept of radiomics emerged in 201210. The concept of radiomics is the comprehensive analysis of features using tools, such as MaZda, MATLAB, TexRAD, MISSTA, CAD, and FireVoxel, which can provide new quantitative imaging markers without the need for new acquisition devices or tracers, and it has stronger diagnostic capabilities than ordinary inspection methods. To systematically evaluate the accuracy of radiomics in distinguishing HGM and LGM, the author conducted this meta-analysis. The results proved that the combined SEN and SPE of MRI radiomics were 0.82 and 0.85, respectively, and PLR and NLR were 5.64 and 0.21, respectively. The AUC was 0.91, which indicated that MRI radiomics has high diagnostic value in the grading of meningiomas. Previous meta-analyses33 systematically reviewed and evaluated the methodological quality of studies using radiomics for the diagnosis and prediction of intracranial meningiomas. The overall quality score and percentage quality score of radiomics have moderate to good inter-reader reproducibility. This meta-analysis showed an overall AUC of 0.88 (95% CI 0.84–0.93) with a standard error of 0.02. The AUC value in this study was 0.91 (95% CI 0.88–0.93), which surpassed that of previous studies. The primary focus of this study was the classification of HGM and LGM based on MRI radiomics. Importantly, this study not only complemented prior research but also underscored the significance of MRI radiomics.

Compared with WHO grade 1 meningiomas, HGM is potentially malignant, with a high risk of local recurrence, poor prognosis, and low overall survival rate. Therefore, the accurate preoperative prediction of meningioma recurrence and brain invasion becomes extremely important. According to a multi-center study21, a low spherical shape of meningiomas is associated with a high local recurrence rate and low survival rate. The RF model, which integrates image-based features with clinical features, demonstrates the highest predictive performance, with an AUC of 0.78. Zhang et al.8 used a SVM prediction model based on T1-weighted imaging (T1WI) enhancement and T2-weighted imaging (T2WI) sequences to foresee the potential of meningioma invasion before surgery. The research results highlighted that the AUC reached as high as 0.82. Wang et al.34 employed MRI radiomics features to develop a model based on preoperative contrast-enhanced T1WI and T2WI magnetic resonance imaging, aiming to precisely predict meningioma venous sinus infiltration before surgery. The areas under the curve for the training and validation groups were 0.86 and 0.82, respectively, which was helpful for selecting surgical approaches and predicting prognosis. In summary, precise preoperative prediction of meningioma grading holds significant importance.

The included literature used a variety of MRI sequences, and functional imaging sequences can provide more information on physiology, hemodynamics, and other information that conventional sequences cannot provide. However, unconventional sequences are not widely used in clinical practice as conventional MRI sequences. Image-based radiomics data that were extracted from conventional MRI seems to be a more practical and direct solution. How to standardize image-based radiomics in conventional sequences may help its clinical application.

The entire process of image-based radiomics research involves repeatability and quality control, including data acquisition, feature extraction and selection, modeling, and validation. Particularly, patient inclusion and exclusion criteria can lead to different usability analyses for different populations, especially for small cohort studies. Therefore, it is necessary to adopt standardized inclusion and exclusion criteria and use open dataset to proceed model validation and improve the reproducibility and quality control of radiomics research35. In total, 76.2% (16/21) of the literature included in this article underwent external validation, thus enhancing the reproducibility and quality control of MRI radiomics and mitigating the risk of false positives and overly optimistic results. Currently, the application of MRI radiomics in meningiomas is still in its initial stage. The small sample size involved in most studies may lead to overfitting of the model. What’s more, 57.1% (12/21) of the studies included in this article took measures to avoid overfitting. In the future, additional prospective, multi-center, and large-scale studies are required to enhance the applicability of findings across diverse populations. Due to differences in scanning sequences, feature extraction algorithms, and screening methods, radiomics features exhibit significant variations. To address these challenges, future research should follow standardized guidelines for radiomics research and incorporate the RQS scoring system. RQS, as a quality assessment tool in the field of radiomics, can provide objective quality evaluation standards for various stages of research, further improve the transparency and overall quality of research, and ensure the reproducibility and reliability of research results. By strictly implementing scoring criteria such as RQS, radiomics research will be able to better standardize its development process and gain wider application in clinical practice.

To explore the sources of heterogeneity, subgroup analysis was conducted in this study. There was no significant difference, which highlighted that the results of this study are reliable and highly representative. However, this study has certain limitations. (1) Most of the radiomics studies included in this research were single-center and retrospective in nature (20/21), and thus the case number was relatively small. Due to the lack of sufficient data to train the models, the statistical results may lack generalizability and reproducibility, with selection bias, which could be one of the reasons for the high accuracy observed in this study. Therefore, prospective large-sample multi-center collaborative studies are required in future research to provide reliable evidence for the clinical application of radiomics. (2) Although radiomics models help in distinguishing between HGM and LGM, they involve a variety of analytical methods. In terms of modeling approaches, various factors, such as model generalizability, feature selection methods, lack of standardization in model selection and evaluation, and model interpretability and robustness, could all impact data accuracy. (3) The included literature exhibited high heterogeneity, and subgroup analyses did not identify the sources of this heterogeneity, but was only preliminarily explored. (4) It is challenging to standardize the specifics of imaging techniques or algorithms, which leads to differences in model generalizability, challenges in reproducibility, and difficulties in clinical application. These discrepancies not only affect the integration of research findings in the field of radiomics but also hinder the application of these models in clinical practice. However, using specific imaging techniques or algorithmic details, such as high-quality imaging data or strategies to avoid overfitting during model training, the high accuracy of MRI radiomics in distinguishing between HGM and LGM can be further explained.

Radiomics provides a new and effective approach for preoperative assessment of meningioma malignancy, especially in cases where traditional imaging analysis struggles to accurately differentiate tumor grades. Meanwhile, this technology offers clinicians more precise diagnostic information for the development of personalized treatment plans, such as optimizing surgical strategies and evaluating the necessity of postoperative radiotherapy. To further enhance its clinical value, the current focus of research should be on standardizing radiomics techniques. Standardization encompasses all stages, from data acquisition and feature extraction to model validation, which could not only improve the reproducibility of radiomics research results but also enhance cross-center comparability. Such standardization will lay a solid foundation for the widespread application of radiomics in clinical practice. We suggest that future studies should focus on establishing unified technical standards and guidelines to ensure consistency in research outcomes and promote the adoption of this technology in clinical settings. At the same time, future research may benefit from using updated evaluation tools, such as CLEAR36 and METRICS37, which can help further improve the transparency and methodological rigor of radiomics research.

Conclusion

In summary, although additional research is required to validate the most suitable techniques, radiomics holds promise as an accurate tool for the identification of the grading of meningiomas. The establishment of a standardized application of radiomics will significantly enhance the diagnostic precision and the accuracy of clinical decision-making for the grading of meningiomas in the future.