Abstract
This study aimed to evaluate diffuse large B-cell lymphoma (DLBCL) patients who have refractory/relapsed disease and characterize the heterogeneity of DLBCL using patient-level radiomics analysis based on 18F-FDG PET/CT. A total of 132 patients diagnosed with DLBCL who underwent 18F-FDG PET/CT before receiving treatment were selected for the final study. Patient-level volumes of interests (VOI) were extracted from PET/CT images, and 328 radiomics features were extracted subsequently. 8 radiomics features were selected using the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm to calculate the radiomics score (rad-score). Additionally, a total of 64 potential ML classifiers were generated based on 8 distinct supervised learning algorithms. The combined model that integrates rad-scores, clinical features and standard PET parameters demonstrates excellent performance; Specifically, ML models based on Naive Bayes have the greatest predicted values (AUC = 0.73). The patient-level radiomics features were subjected to unsupervised non-negative matrix factorization (NMF) clustering analysis to identify 3 radiomics subtypes. Cluster 1 exhibited a substantially higher prevalence of refractory/relapsed DLBCL compared to Clusters 2 and 3 (P < 0.05). Moreover, Cluster 1 showed a significantly higher frequency of advanced Ann Arbor stage, high international prognostic index, and bulk disease (all P < 0.05). In conclusion, Radiomics scores and radiomics subtypes derived from patient-level data offer significant predictive value and phenotypic information for patients with refractory/relapsed DLBCL.
Similar content being viewed by others
Introduction
Diffuse large B-cell lymphoma (DLBCL) is the predominant aggressive subtype of non-Hodgkin lymphoma subtype in adults, characterized by widespread involvement and diagnostic challenges1. The conventional treatment for DLBCL is R-CHOP immunochemotherapy (which include the rituximab, cyclophosphamide, doxorubicin, vincristine, and prednisone), which aims to target cancer cells through various mechanisms, including direct cell killing and immune system regulation2. Although a significant number of patients, including those at advanced stages, achieve long-lasting survival through this approach, 10–15% patients encounter refractory disease and 20–30% relapse after an initial response3,4. Since rituximab is commonly administered during first-line chemotherapy, refractory or relapsed DLBCL patients may face worse outcomes than those in the pre-rituximab era5. Although the International Prognostic Index (IPI) can predict DLBCL patient outcomes, it falls short in accurately identifying individuals who are at risk of developing refractory or recurrent disease6,7. It is vital to promptly identify patients with refractory/relapsed DLBCL. And explore the alternative treatment options, such as salvage therapy, or clinical trials of novel drugs, ultimately enhancing their prognosis4.
DLBCL often involves multiple organs throughout the body, and these findings may be overlooked or misclassified on CT or MRI scans, especially when involvement is mild or atypical. 18F-FDG PET/CT is commonly employed for initial DLBCL staging and assessing first-line therapy response due to its effective to characterize tumor glucose metabolism and detect lymph node and extranodal involvement8. However, FDG-PET could produce false positive results due to inflammation. Conventional PET semi-quantitative parameters, such as maximum standardized uptake value (SUVmax) and total metabolic tumor volume (MTV), have been associated with the DLBCL prognosis9,10. Nonetheless, they are limited in fully describing subtle metabolic heterogeneity within targeted lesions. Tumor heterogeneity encompasses the variability in phenotypic characteristics of cancer cells, including cell structure, gene expression, metabolism, and metastasis capacity11. This complex phenomenon reflects not only genomic instability and epigenetic variability but also variations in prognostic outcomes among different tumor types12,13. Radiomics is an emerging era that involves extracting and analyzing vast amounts of quantitative data from medical images, such as CT or PET14. In the field of PET/CT radiomics in particular, the workflow necessitates detailed specification of key technical parameters to ensure methodological reproducibility, a critical requirement for both research validation and clinical translation15. These metrics derived from images effectively capture intra-tumoral and inter-tumoral heterogeneity associated with molecular and cellular characteristics16,17.
Despite awareness of the poor outcomes in patients with refractory/relapsed DLBCL, there is currently no efficient predictive tool used in clinical practice. At present, refractory/relapsed DLBCL is mainly explained through collaboration between nuclear medicine physicians and clinical physicians. Several studies18,19 focus solely on radiological characteristics of the lesion with the highest metabolic activity or largest volume, despite that a substantial proportion of DLBCL patients lack a distinct primary lesion, as the disease often disseminated throughout the body. Given the significant heterogeneity in patients and tumors, radiomics analyzing at the patient level may provide a more accurate representation of the unique variations in disease. This study aimed to evaluate the predictive value of baseline 18F-FDG PET/CT patient-level radiomics for refractory/relapsed DLBCL populations, and to investigates whether unsupervised learning-based radiomics subtypes could effectively characterize heterogeneity among DLBCL patients.
Results
Patient characteristics
The clinical characteristics of the patients are shown in Table 1, with the majority suffering from advanced diseases, elevated lactate dehydrogenase levels, and high IPI scores. Among the analyzed patients, 44 (33.33%) had refractory or relapsed DLBCL, while 88 (66.67%) did not. Statistically significant differences were found in Ann Arbor stage (P = 0.025), ECOG PS (P = 0.026), IPI (P = 0.009), and bulk disease (P < 0.001) between the refractory/relapsed DLBCL (RR) and non-refractory/relapsed DLBCL (NRR) groups. Table 2 provides a comprehensive list the PET characteristics for the patients. The RR and NRR groups exhibited significant differences in DmaxVox (P = 0.036), MTV (P < 0 0.001), and TLG (P = 0.006).
Radiomics score development
By using the LASSO regression model, we refined 8 original features with an optimal λ value of 0.0686 (Fig. 1A and B). The rad-score was derived based on the corresponding coefficients of 8 radiomics features selected (Fig. 1C). For a detailed breakdown of radiomics features and their coefficients, see Table S1 in the Supplementary Materials. The median value was 0.76, and the RR group exhibited a significant higher rad-scores (P < 0.001) compare to the NRR group. Rad-scores were shown in a bar chart, and their distribution is displayed in Fig. 1D.
Construction of radiomics scores. Radiomics features selection using the Least Absolute Shrinkage and Selection Operator (LASSO) regression with 10-fold cross-validation. (A) The optimal Lambda value was identified by the minimum mean-squared error (MSE) and by the minimum MSE within one standard error. (B) LASSO coefficient profiles of the radiomics features. (C) Specific coefficients corresponding to each radiomics feature obtained by selection. (D) Waterfall plot obtained from radiomics scores of all patients.
Machine learning model performance
MTV had a strong correlation with TLG (r = 0.94, P < 0.001). Given the established prognostic superiority of MTV (Figure S1), this parameter was prioritized for integration into the multivariate predictive modeling framework9,19. The clinical model was constructed based on significant clinical features (Ann Arbor stage, ECOG PS, IPI, Bulk disease). Conventional PET features (MTV and DmaxVox) were used to construct PET prediction models. The above models were used to predict refractory/relapsed DLBCL. To prevent overfitting of the composite model, the combinations of variables were further screened. Ultimately, the clinical-PET model was built using IPI, bulk disease, MTV, and DmaxVox with rad-scores being added to produce the combined model. For a detailed description of the prediction models included in this study, see Table S2 in the Supplementary Materials.
Figure 2 presents the performance evaluation of the 64 ML models for predicting refractory/relapsed DLBCL using 8 algorithms (with or without oversampling). For a detailed of the predictors for all models, see Table S3 in the Supplementary Materials. The combined model based on Naive Bayesian Algorithm produced the highest AUC (0.73) and the highest accuracy (0.69) among all ML models with the oversampling applied. In the clinical model, Naive Bayes algorithm had the highest AUC (0.68), and Rpart had the highest accuracy (0.69). In the PET model, the KNN algorithm with oversampling had the highest AUC (0.7). Additionally, the three ML algorithms (Log Reg, LDA, and KNN) produced the highest accuracy (0.67). The clinical-PET Model, yielded the best AUC with Log Reg and LDA algorithms, and both accuracy improved with oversampling.
The efficiency of ML. Box plots, showing performance of each model and eight machine learning models (with and without oversampling) for predicting outcomes in test set. Horizontal line in box indicated median; ends of box indicate IQR; ends of whiskers indicate interdecile range; and points beyond whiskers indicate outliers. (A) AUC values of clinical models. (B) AUC values of PET models. (C) AUC values of clinical-PET models. (D) AUC values of Combined models.
Radiomics subtype characterization
We examined the statistical association between patients in the RR and NRR groups. A total of 29 differential radiomics features were selected after conducting the difference analysis across groups. Using cophenetic, dispersion and silhouette curves (Fig. 3), we identified 3 robust clusters (radiomics subtypes), with 50, 65 and 17 patients in clusters 1, 2 and 3, respectively. The incidence of refractory/relapsed DLBCL was significantly higher in Cluster 1 (25 of 50 [50%]) compare to Cluster 2 (16 of 65 [24.6%]) and Cluster 3 (3 of 17 [17.6%]) groups (p = 0.006) (Fig. 4A).
The results of NMF. Identification of DLBCL radiological subgroups based on differential radiomics features. (A) Consensus matrix heatmap with K ranging from 2 to 10. (B) Relationship between cophenetic, dispersion, evar, residuals, rss, silhouette, and sparseness coefficients with respect to the number of clusters.
Comparison between radiomics subtypes. Differences in radiomics subgroups. (A) The Sangi diagram shows the population grouped by RR/NRR, radiomics subgroups, IPI, and other characteristics, showing the patterns of different groups. (B) Heatmap with clinical characteristics of the 3 clusters.
Clinical characteristics grouped into clusters are summarized in Fig. 4B. For a detailed of Clinical characteristics, see Table S4 in the Supplementary Materials. In cluster 1, 82% patients (41out of 50) had a higher frequency of Bulk disease, which exhibited a significant higher occurrence compared with clusters 2(2 of 65 [3.1%]) and 3(1 of 17 [5.9%]) (P < 0.001). Similarly, the cluster1 had the highest frequency of High IPI (30 of 50 [60%]) compare to cluster 2 (25 out of 65[38.5%]) and cluster 3 was 5 of 17 (29.4%) (P = 0.026). There was significant difference in Ann Arbor stage between the 3 clusters (P = 0.025), with a higher frequency of high Ann Arbor stage in cluster 1 (36 of 50 [72%]) than in cluster 3(6 of 17 [35.3%]). There was no significant difference in other clinical features between the 3 clusters (all P > 0.05).
Conventional PET features grouped into clusters are summarized in Table 3. There were no significant differences between the 3 clusters, only for SUVstd and COV (all P > 0.05). Cluster 1 had the highest median fraction of SUVmax (20.36 [IQR = 12.91–26.94], P = 0.011) and SUVmean (7.70 [IQR = 5.78–10.21], P = 0.011). There was significant difference in Dmax among the three clusters(P = 0.008), with a higher median fraction of Dmax in cluster 1 than in cluster 3. Similarly, the median fraction of DmaxVox was highest in cluster 1 (41.35 [IQR = 28.07–56.72], P = 0.001). Cluster 1 had a higher median fraction of MTV (520.63 [IQR = 348.75–1174.71], P < 0.001) and TLG (4373 [IQR = 2298–7737.25], P < 0.001) compared to clusters 2 and 3.
Discussion
The study highlights the significance of patient-level radiomics analysis using baseline 18F-FDG PET/CT in predicting refractory/relapsed DLBCL. This finding supports the hypothesis that whole-body PET/CT imaging may offer additional metabolic information for tumor heterogeneity evaluation and potentially guide risk stratification. The study was based on a retrospective cohort of 132 patients, and the developed combination of rad score, clinical data, and PET features generally performed better than clinical and/or PET models alone. It is suggested that rad-scores may provide additional value for predicting refractory/relapsed DLBCL by conventional factors such as IPI and MTV. Among all ML models evaluated, the combined model based on the Naive Bayes algorithm had the highest efficiency. Unsupervised learning was employed to analyze radiomics data from DLBCL patients in an unbiased manner, resulting in the effective categorization of patients in 3 radiomics subtypes using NMF. Multiple clinical and conventional PET parameters were significantly different between different radiomics subtypes. Higher frequency of refractory/relapsed DLBCL was found in cluster 1.
The heterogeneity observed in DLBCL patients is likely attributed to biological diversity, and both clinical and molecular heterogeneity may have an impact on the prognosis of patients. Identifying predictive biomarkers of refractory/relapsed DLBCL is crucial for treatment selection and clinical decision support. Currently, most research has focused on biomarkers identified in biological samples, such as peripheral blood and tumor tissue20,21,22. Rushton et al.23 sequenced circulating tumor DNA from 135 refractory/relapsed DLBCL and found that TP53 and KMT2D were mutated in the majority of these patients, indicating a role in first-line treatment for drug resistance. Metabolic reprogramming such as glycolysis, plays a crucial role in the formation of tumors. The heterogeneity in tumor glucose uptake is potentially visualizable by 18F-FDG PET/CT13. In our study, we computed two parameters to characterize the tumor heterogeneity: rad-score and COV.
Figure 1 demonstrates that the rad-scores, comprised of multiple radiomics features, were significantly different between groups and more accurately represented heterogeneity. This finding aligns with a recent study identifying radiomics signature and IPI as independent risk factors for 2-year PFS and OS of DLBCL24. However, as shown in Table 2, there was no significant difference in COV among the groups. Previous studies have demonstrated that COV is capable of measuring tumor heterogeneity and has the potential to serve as a prognostic risk factor for primary mediastinal large B-cell lymphoma patients35. In our research, the region of interest was based on patient-level analysis rather than focusing on a single lesion, this approach may account for the inadequacy of a basic measurement such as COV in capturing the diverse metabolic characteristics of the patients. As DLBCL often involves multiple lymph nodes and extranodular masses and exhibiting a high heterogeneity of intertumoral and intratumoral 18F-FDG uptake within the same individual. Based on these findings, we suggest that utilizing patient-level rad-scores derived from PET/CT scans may provide a more accurate representation of individual variations.
DLBCL, a systemic malignancy without typical primary tumor, is usually diagnosed through biopsy of a single lesion in routine clinical practice. Thus, exploring tumor heterogeneity over the entire tumor volume may be more relevant to patient outcomes than a single lesion in DLBCL25. However, measuring total MTV in DLBCL remains challenging compared to other tumors like lung cancer. Furthermore, analysis and interpretation of 18F-FDG PET images are usually performed by experienced physicians. While a standardized whole-body PET imaging segmentation method has yet to be established for DLBCL. This may have some impact on the robustness of PET features. Barrington et al.34 compared several segmentation methods on PET images of DLBCL and determined that automatically estimating the total MTV by SUV 4.0 to be a feasible approach. Eertink et al.26 examined 6 different semi-automatic segmentation methods for radiomics analysis, discovering that the classification performance of radiomics features remained consistent, regardless the variations in specific radiomics feature values obtained through the different segmentation methods. In our study, we employed segmentation with SUV 4.0, complemented by manual modifications made by experienced physicians, to achieve the whole lesion segmentation. This approach boasts a high segmentation rate, with over half of the patients requiring only simple clicking to remove uptake in the brain, urinary tract, and heart. However, the automated process was not entirely reliable in some cases, particularly when dealing with neighboring lesions or areas with high physiological uptake in the gastrointestinal tract, which necessitated time-consuming user interaction.
ML models based on radiomics have increasingly been incorporated into cancer imaging research. Our study demonstrates that the rad-scores calculated by LASSO was correlated with refractory/relapsed DLBCL. The patient-level model based on radiomics score has a high value in predicting refractory/relapsed DLBCL. Similarly, recent research also supports the potential of these radiomics features in predicting progression after 2 years for DLBCL patients. However, variations in both the extracted features and the quantity of characteristics makes it difficult to directly compare different studies27,28. Another study exploring the analysis of multiple single lesions in vivo has revealed that a random forest model may effectively identify primary treatment failure in refractory DLBCL patients7. Our study had a majority of patients without refractory/relapsed disease, resulting in an imbalance in the results, which we corrected by creating synthetic samples. Similar to the Jiang et al.29 study, we compared the combined model based on multiple ML algorithms with other models, such as clinical model and PET model. Figure 2 and Table S3 showed that the combined model based on Naive Bayes algorithm had the highest AUC, supporting the feasibility of combining radiomics features with clinical predictors. A study was conducted to predict the relapsed or refractory disease status of 251 Hodgkin lymphoma patients with radiomics features extracted from baseline PET scans, and the results showed that the model constructed with 5 radiomics features was superior to MTV, TLG30. In contrast, to prevent radiomics features from overwhelming clinical indicators, our study combined multiple radiomics features to construct rad-score, which was then combined with other factors to improve the clinical applicability of the model. The metrics in the combined model include multimodal data for both clinical and images. Considering that the complex biological processes of DLBCL may occur at various scales, integrating clinical and imaging factors could offer a more comprehensive understanding of disease characteristics and enhance predictive efficacy and accuracy.
Supervised methods primarily identify candidate biomarkers of known phenotypes, while unsupervised methods seek to uncover new subgroups and potentially detect relevant variables. Most studies on tumor subtypes focus on genomics, with only a few exploring radiomics31. The identification of radiomics subtypes refers to the process where patients or diseases are stratified into different groups based on quantitative imaging features, which could have implications for diagnosis, prognosis, and personalized treatment. Perez-Johnston et al.32 identified 4 radiomics clusters in I-stage lung adenocarcinoma CT scans and investigated clinical-pathological characteristics and genomic associations between the different clusters. As shown in Fig. 3,, we divided DLBCL patients into 3 radiomics subtypes using NMF based on differential radiomics features of baseline PET/CT, which suggested different efficacy profiles and tumor heterogeneity. These 3 radiomics subtypes exhibited significant differences in multiple clinical and PET factors. Half of the DLBCL patients in cluster 1 had refractory/relapsed disease, implying this subtype may have a poor prognosis; in contrast, the frequency of refractory/relapsed DLBCL in cluster 3 was below average. Among clinical factors, cluster 1 had more advanced stage patients, higher IPI scores, and a higher incidence of bulk disease; In terms of PET, cluster 1 also has a high median of MTV, TLG and DmaxVox. These factors have previously been associated with refractory/relapsed DLBCL in other studies and our results. Interestingly, cluster 2 had the lowest median of SUV conventional parameters (SUVmax, SUVmean, SUVstd). Given the heterogeneity of DLBCL, our findings suggest that radiomics clusters could be utilized for DLBCL patient classification, potentially enabling more personalized treatment approaches in the future.
This study has a few limitations. This study is a retrospective study with a small sample size, patient selection bias cannot be eliminated. It should be noted note that our results were not verified by an independent cohort of participants, although we did evaluate their reliability through internal validation using cross-validation. Second, radiomics were extracted from the total target area at the patient level, but total MTV measurement methods were inconsistent across different working groups. The possible impact of this technical issue on the generality of the conclusions. Furthermore, current molecular genetic studies have proposed DLBCL subtypes with poor prognosis, such as activated B-cell subtypes or MYC oncogene rearrangements. However, only a subset of patients in our cohort underwent subtype analysis. Further investigation is needed to determine if imaging biomarkers derived from 18F-FDG PET are associated with different DLBCL subtypes. Finally, this study did not consider the possibility of identifying feature selection frequency during cross validation, which may have an impact on model performance.
In this study, we developed machine learning models based on radiomics scores that were able to distinguish refractory/relapsed DLBCL patients by baseline 18F-FDG PET/CT; Additionally, radiological phenotypic analysis based on NMF is used to obtain relevant categories of DLBCL patients with significantly different risk of refractory/relapsed development. In summary, our findings demonstrate the potential of patient-level radiomics analysis as a method for evaluating refractory/relapsed DLBCL from both supervised and unsupervised ML perspectives, potentially contributing to more personalized treatment approaches in the future.
Methods
Patients
This study has been approved by the Medical Ethics Committee of the First Hospital of Shanxi Medical University and obtained informed consent from patients (No. KYLL-2024-280). The study strictly followed the Declaration of Helsinki. We retrospectively enrolled patients diagnosed with DLBCL who underwent 18F-FDG PET/CT examination at the First Hospital of Shanxi Medical University between August 2017 and December 2022 by querying the hospital’s medical record system. The study was carried out in accordance with the principles outlined in the Declaration of Helsinki through ethical clearance and obtained informed consent from all patients. The eligibility criteria mandated that patients fulfilled the following conditions: (1) having a pathologically confirmed diagnosis of DLBCL; (2) underwent clinical treatment with R-CHOP or a similar regimen; (3) exhibiting 18F-FDG-avid lesions on PET scans; and (4) over 18 years of age. Exclusion criteria: (1) Prior therapeutic interventions preceding PET/CT imaging acquisition; (2) Incomplete imaging datasets or lesions below 10 cc volumetric threshold for reliable quantification; (3) Insufficient clinical documentation for comprehensive analysis. Patients were ineligible if they had prior treatment before PET/CT scans, had incomplete or low-quality images that were challenging to analyze, or had no valid clinical data. Of the 143 patients included in the study, 4 patients without complete clinical data were excluded; To ensure stability and interpretability of imaging features, 7 patients were excluded (including 1 patient had diffuse liver involvement, 2 patients had diffuse inflammatory hypermetabolism postoperatively, and 4 had lesions smaller than 10 cc). This yielded a total of 132 patients who were analyzed. Figure 5 presents a schematic study flowchart.
Schematic study flowchart.
All patients underwent a comprehensive medical history assessment, laboratory tests, and an 18F-FDG PET/CT scan. Additional information that was collected included: age, Ann Arbor stage, Eastern Cooperative Oncology Group (ECOG) performance status, lactate dehydrogenase (LDH) levels, and extranodal involvement. Tumor masses exceeding a diameter of 7.5 cm were categorized as bulky disease. Refractory DLBCL was defined as progressive or stable disease with first-line treatment of chemotherapy greater than 4 cycles or later treatment of 2 cycles3,4. Relapsed disease reflects the appearance of new lesions after attainment of complete response3,4.
PET/CT scanning protocol
All patients were scanned using a GE Healthcare Discovery MI PET/CT scanner (GE Healthcare, USA) after fasting for a minimum of 6 h and maintaining their blood glucose levels below 200 mg/dl. The 18F-FDG was administered intravenously at a dose of 2.96–4.44 MBq (0.08–0.12 mCi) per kilogram of body weight. Following a waiting period of 50–60 min in a quiet environment, a CT scan was performed from the top of the skull to the middle of the femur, with a tube voltage of 120 kV, a tube current of 60–230 mA, and 3.75 mm slice thickness. PET acquisition was then subsequently performed over the same range, with 5–7 bed positions acquired for 3 min/bed. The matrix size of PET/CT images is 256 × 256, with voxel sizes of 0.98 × 0.98 × 3.3 mm (CT) and 5.5 × 5.5 × 3.3 mm. The PET data were automatically attenuated using the CT data to obtain PET reconstruction and PET/CT fusion images, which then were stored on the workstation in DICOM format.
Image segmentation and feature extraction
The 18F-FDG PET/CT images were evaluated by two nuclear medicine specialists (10 and 20 years’ experience receptively) who were blinded to patient outcomes. To minimize the inter-observer variability, all images were analyzed by the same observer with LIFEX 7.3 software33, and confirmed by an experienced senior nuclear medicine specialist. PET image volume of interest (VOI) segmentation was performed using the semi-automated segmentation module in the LIFEX. The lesion delineation protocol comprised three sequential optimization phases: (1) Primary segmentation was performed using semi-automated algorithm with dual-threshold parameters (SUVmax ≥ 4.0; minimum volume 3 mL) to generate initial volumetric contours; (2) Systematic refinement involved manually excluding physiological 18F-FDG biodistribution patterns (myocardial, cerebral, renal parenchymal, and urinary bladder activity) alongside non-malignant tracer retention sites; (3) Complementary augmentation addressed potential under-segmentation through expert-guided inclusion of subthreshold metabolic foci (SUVmax 2.5–4.0) and small volume metabolic lesions (The volume size 1–3 ml) exhibiting morphospatial continuity with primary lesions34. All VOIs were merged to form a consolidated patient-level VOI for PET. The patient-level VOIs of PET were verified with fused PET/CT images and transferred to corresponding CT slices.
For extranodal involvement, liver, lung, and bone marrow involvement were considered only if there was a focal uptake. Homogeneous bone marrow uptake was excluded from the tumor volume, while splenic involvement was defined as focal uptake or diffuse uptake higher than 150% of the liver background.
The standardized uptake value (SUV) was used as a scaling factor for the voxel values, based on the net injection tracer dose per kilogram of body weight. Five conventional patient-level SUV features were retrieved without any resampling: SUVmax, SUVmean, SUVstd, MTV and total lesion glycolysis (TLG). In addition, dissemination characteristics (Dmax and DmaxVox) and heterogeneity parameters (COV) were calculated. Dmax refers to the euclidean distance between the centre locations of two lesions that are farthest apart; While DmaxVox represents the distance between the two lesions farthest away using the outermost voxels. For the patient had only one lesion, the Dmax and DmaxVox values were both recorded as 0 cm. COV, an intuitive and straightforward parameter, was defined as the ratio of SUVstd to SUVmean within an automatically segmented tumor volume to estimate heterogeneity of 18F-FDG uptake35.
Prior to identifying radiomics features, we preprocessed both PET and CT images, including resampling all PET images to a voxel size of 1 × 1 × 1 mm³ using bilinear interpolation and discretizing PET images with a fixed bin count of 6436. CT images were subjected to the same resampling procedure employing a fixed bin count of 256. Based on previous research24 radiomics feature extraction was performed on patient level VOI. The patient level VOI quantification was performed through LIFEx software’s integrated whole-volume preservation algorithm (“save all in one” function), enabling comprehensive radiomic characterization across all metabolically active tumor subregions. A total of 328 radiomics features were extracted, including grayscale histogram parameters, grayscale co-occurrence matrix (GLCM, Haralick), neighborhood grayscale difference matrix (NGLDM, Amadasum), grayscale run length matrix (GLRLM, Xu), and grayscale zone length matrix (GLZLM, Thibault). (comprehensive descriptions of these textures were available at http://www.lifexsoft.org). These radiomics features comply with the feature definition described by the Imaging Biomarker Standardization Initiative (IBSI)37.
Radiomics analysis and model construction
Min-Max normalization [x* = (x − min)/(max − min), max is the maximum value of the sample data, and min is the minimum value of the sample data] was applied to all radiomics features. All radiomics features were scaled to within [0,1] to reduce the influence of different dimensions. To minimize overfitting, highly correlated features (with correlation coefficients greater than 0.9) were removed. Then, the LASSO regression algorithm was employed to further select the remaining features. LASSO adds L1 regularization terms to the least squares algorithm to reduce overfitting. The Lambda parameters were optimized using a 10-fold cross-validation technique. The most optimal Lambda value was determined for which the cross-validation error was within one standard or error of the minimum. To identify relevant features that may differentiate refractory/relapsed DLBCL, coefficients of irrelevant features were set to zero. The selected radiomics features were used to compute rad-score for each patient.
ML algorithms may handle data with complex non-linear relationships to clinical outcomes. We developed prediction models for clinical features, conventional PET features, rad-scores, and various combinations of features using 8 different machine learning (ML) methods (Table S5): Logistic Regression (Log Reg), Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA), Naive Bayes, K-Nearest Neighbor (KNN), Rpart, RandomForest, and Xgboost. The stratified 5-fold cross-validation method was utilized to assess the generalizability of the model. We trained on 80% of the data and validated on a 20% unseen subset in each cross-validation fold. Up until each fold was utilized as a test set, this procedure was repeated. Oversampling of patients with refractory/relapsed DLBCL was performed to correct for imbalance in the classification of patients with refractory/relapsed DLBCL and non-refractory/relapsed DLBCL28. We created 44 positive samples and 0 negative samples using SMOTE, resulting in a 1:1 ratio of positive to negative samples in the total population. We performed all pre-processing and optimization steps on each training fold to prevent test data from contaminating the trained model. To evaluate model performance, we calculated the area under the ROC curve (AUC), accuracy, sensitivity, and specificity for each model using ROC curve analysis.
Radiomics subtypes analysis
To investigate whether patient-level radiomics features reflect heterogeneity among patients with DLBCL, we analyzed radiomics features between groups and considered those with a p value of < 0.05 to be differential. We then used non-negative matrix decomposition (NMF) in unsupervised clustering to identify the radiomics matrix of DLBCL patients, an approach that allows identification of the appropriate number of clusters based on differences in tumor texture features. Radiomics subtypes have been constructed using NMF to differentiate patients with different risk levels based on similarities and differences in tumor texture features. To identify this optimal number of clusters, a range of K values is typically explored. Various evaluation metrics, such as cophenetic or dispersion, can be calculated for each K value to assess the quality and separation of the resulting clusters. The range is often determined based on prior knowledge of the dataset or through empirical observation. We set k to range from 2 to 10 and divided DLBCL samples into different clusters. Finally, we examined the correlation between clinical and PET features of each radiomics subtype obtained by clustering.
Statistical analysis
All statistical analyses were conducted using R (version 4.0) and SPSS software (version 25.0, IBM). We assessed the difference in related clinical and PET characteristics between the refractory/relapsed and non-refractory/relapsed DLBCL groups using either the Chi-square or Mann-Whitney U tests, depending on appropriateness. We used the Chi-square or Kruskal-Wallis H test to compare characteristics across different radiomics subtypes. All statistical comparisons were two-sided, and p < 0.05 was considered statistically significant.
Data availability
The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.
Code availability
The custom code and machine learning algorithms we use are open source, and readers can access the custom code through GitHub (DOI: 10.5281/zenodo.15087401).
References
Cai, W., Zeng, Q., Zhang, X. & Ruan, W. Trends analysis of Non-Hodgkin lymphoma at the national, regional, and global level, 1990–2019: results from the global burden of disease study 2019. Front. Med. (Lausanne). 8, 738693 (2021).
Lugtenburg, P. J. & Rituximab -CHOP with early rituximab intensification for diffuse large B-Cell lymphoma: A randomized phase III trial of the HOVON and the nordic lymphoma group (HOVON-84). J. Clin. Oncol. 38, 3377–3387 (2020).
Raut, L. S. & Chakrabarti, P. P. Management of relapsed-refractory diffuse large B cell lymphoma. South. Asian J. Cancer. 3, 66–70 (2014).
Friedberg, J. W. Relapsed/Refractory Diffuse Large B-Cell Lymphoma. Hematology Am Soc Hematol Educ Program. 498 (2011). (2011).
Fuji, S. et al. Analysis of real-world data in patients with relapsed/refractory diffuse large B cell lymphoma who received salvage chemotherapy in the rituximab era. Ann. Hematol. 100, 2253–2260 (2021).
Ruppert, A. S. et al. International prognostic indices in diffuse large B-cell lymphoma: a comparison of IPI, R-IPI, and NCCN-IPI. Blood 135, 2041–2048 (2020).
Santiago, R. et al. CT-based radiomics model with machine learning for predicting primary treatment failure in diffuse large B-cell lymphoma. Transl Oncol. 14, 101188 (2021).
Wirth, A. & Mikhaeel, N. G. PET response-guided radiotherapy for advanced DLBCL? Blood 137, 866–867 (2021).
Cottereau, A. S. Risk stratification in diffuse large B-cell lymphoma using lesion dissemination and metabolic tumor burden calculated from baseline PET/CT†. Ann. Oncol. 32, 404–411 (2021).
Ferrari, C. et al. Lesion-to-Liver SUVmax ratio to improve the prognostic value of the end of treatment PET/CT in diffuse large B-Cell lymphoma. J. Clin. Med. 11, 5541 (2022).
Pe’er, D. et al. Tumor heterogeneity. Cancer Cell. 39, 1015–1017 (2021).
Miao, Y., Medeiros, L. J., Li, Y., Li, J. & Young, K. H. Genetic alterations and their clinical implications in DLBCL. Nat. Rev. Clin. Oncol. 16, 634–652 (2019).
Ceriani, L. et al. Metabolic heterogeneity on baseline 18FDG-PET/CT scan is a predictor of outcome in primary mediastinal B-cell lymphoma. Blood 132, 179–186 (2018).
Lambin, P. Radiomics: extracting more information from medical images using advanced feature analysis. Eur. J. Cancer. 48, 441–446 (2012).
Alessandro, S. Challenges and limitations in applying radiomics to PET imaging: possible opportunities and avenues for research[. J] Comput. Biol. Med. 179, 0 (2024).
Sollini, M. et al. Methodological framework for radiomics applications in hodgkin’s lymphoma. Eur. J. Hybrid. Imaging. 4, 9 (2020).
Parvez, A., Tau, N., Hussey, D., Maganti, M. & Metser, U. 18F-FDG PET/CT metabolic tumor parameters and radiomics features in aggressive non-Hodgkin’s lymphoma as predictors of treatment outcome and survival. Ann. Nucl. Med. 32, 410–416 (2018).
Aide, N., Fruchart, C., Nganoa, C., Gac, A. C. & Lasnon, C. Baseline 18F-FDG PET radiomic features as predictors of 2-year event-free survival in diffuse large B cell lymphomas treated with immunochemotherapy. Eur. Radiol. 30, 4623–4632 (2020).
Ceriani, L. et al. SAKK38/07 study: integration of baseline metabolic heterogeneity and metabolic tumor volume in DLBCL prognostic model. Blood Adv. 4, 1082–1092 (2020).
Meriranta, L. et al. Molecular features encoded in the ctdna reveal heterogeneity and predict outcome in high-risk aggressive B-cell lymphoma. Blood 139, 1863–1877 (2022).
Jiménez-Cortegana, C. et al. Circulating myeloid-derived suppressor cells and regulatory T cells as immunological biomarkers in refractory/relapsed diffuse large B-cell lymphoma: translational results from the R2-GDP-GOTEL trial. J. Immunother Cancer. 9, e002323 (2021).
Greenawalt, D. M. et al. Comparative analysis of primary versus relapse/refractory DLBCL identifies shifts in mutation spectrum. Oncotarget 8, 99237–99244 (2017).
Rushton, C. K. et al. Genetic and evolutionary patterns of treatment resistance in relapsed B-cell lymphoma. Blood Adv. 4, 2886–2898 (2020).
Zhang, X. et al. A novel analytic approach for outcome prediction in diffuse large B-cell lymphoma by [18F] FDG PET/CT. Eur. J. Nucl. Med. Mol. Imaging. 49, 1298–1310 (2022).
Lue, K. H. et al. Prognostic value of baseline radiomic features of 18F-FDG PET in patients with diffuse large B-Cell lymphoma. Diagnostics (Basel). 11, 36 (2020).
Eertink, J. J. et al. Quantitative radiomics features in diffuse large B-Cell lymphoma: does segmentation method matter?? J. Nucl. Med. 63, 389–395 (2022).
Eertink, J. J. et al. 18F-FDG PET baseline radiomics features improve the prediction of treatment outcome in diffuse large B-cell lymphoma. Eur. J. Nucl. Med. Mol. Imaging. 49, 932–942 (2022).
Eertink, J. J. et al. Comparing lesion and feature selections to predict progression in newly diagnosed DLBCL patients with FDG PET/CT radiomics features. Eur. J. Nucl. Med. Mol. Imaging. 49, 4642–4651 (2022).
Jiang, C. et al. Optimal PET-based radiomic signature construction based on the cross-combination method for predicting the survival of patients with diffuse large B-cell lymphoma. Eur. J. Nucl. Med. Mol. Imaging. 49, 2902–2916 (2022).
Milgrom, S. A. et al. A PET radiomics model to predict refractory mediastinal hodgkin lymphoma. Sci. Rep. 9, 1322 (2019).
Li, Z. et al. Gene set-based identification of two immune subtypes of diffuse large B cell lymphoma for guiding immune checkpoint blocking therapy. Front. Genet. 13, 1000460 (2022).
Perez-Johnston, R. et al. CT-based radiogenomic analysis of clinical stage I lung adenocarcinoma with histopathologic features and oncologic outcomes. Radiology 303, 664–672 (2022).
Nioche, C. et al. LIFEx: a freeware for radiomic feature calculation in multimodality imaging to accelerate advances in the characterization of tumor heterogeneity. Cancer Res. 78, 4786–4789 (2018).
Ferrández, M. C. et al. Combatting the effect of image reconstruction settings on lymphoma. EJNMMI Res. 12, 44 (2022).
Watabe, T. et al. Intratumoral heterogeneity of F-18 FDG uptake differentiates between Gastrointestinal stromal tumors and abdominal malignant lymphomas on PET/CT. Ann. Nucl. Med. 26, 222–227 (2012).
Pfaehler, E. et al. Experimental multicenter and multivendor evaluation of the performance of PET radiomic features using 3-Dimensionally printed Phantom inserts. J. Nucl. Med. 61, 469–476 (2020).
Zwanenburg, A. et al. The image biomarker standardization initiative: standardized quantitative radiomics for High-Throughput image-based phenotyping. Radiology 295, 328–338 (2020).
Funding
This research was funded by the National Natural Science Foundation of China (NSFC 81971655); Shanxi Province Graduate Research Innovation Project (2023KY366).
Author information
Authors and Affiliations
Contributions
Caozhe Cui: Writing – original draft, Software, Methodology, Data curation. Jianbo Cao: Writing - review & editing, Validation. Yayuan Li: Writing - review & editing, Visualization, Resources. Boren Jia: Visualization, Validation. Ning Ma: Visualization, Validation. Xiaomeng Li: Validation, Supervision. Meng Liang: Visualization, Validation. Mingxia Hou: Visualization, Validation. Yanan Zhang: Investigation, Formal analysis. Hongliang Wang: Supervision, Resources, Investigation. Zhifang Wu: Writing - review & editing, Validation, Supervision, Resources, Investigation.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cui, C., Cao, J., Li, Y. et al. A proof of concept study of 18F-FDG PET/CT patient-level radiomics identify refractory/relapsed diffuse large B-cell lymphoma. Sci Rep 15, 33914 (2025). https://doi.org/10.1038/s41598-025-08223-8
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-025-08223-8







