Introduction

Lung cancer is a global health concern, ranking as the second most commonly diagnosed cancer and the leading cause of cancer-related deaths in 20201,2,3,4. Among the histological subtypes of lung cancer, lung adenocarcinoma is the most prevalent5,6. Over the years, advancements in research have led to the identification of distinct pathological subtypes of lung adenocarcinoma, including atypical adenomatous hyperplasia (AAH), adenocarcinoma in situ (AIS), minimally invasive adenocarcinoma (MIA), and invasive adenocarcinoma (IAC)7,8. Recently, AAH and AIS have been recognized as precursor lesions to lung adenocarcinoma9.

Studies have demonstrated that patients with precursor lesions and MIA have a favorable prognosis, with a 5-year disease-free survival rate of up to 100% following surgery10,11,12,13,14,15. However, it has been observed that the invasive pattern of MIA can impact its prognosis. MIA with poorly differentiated invasive patterns, such as micropapillary or solid predominant patterns, is associated with a higher risk of recurrence or the development of metachronous adenocarcinoma, necessitating closer monitoring and additional treatment post-surgery16. The progression from AIS to MIA signifies a significant increase in disease severity. Therefore, accurate prediction of the pathological subtypes of lung adenocarcinoma holds great clinical significance for treatment decisions and prognosis.

On computed tomography (CT) images, early-stage lung adenocarcinomas often manifest as ground glass opacities (GGOs)17,18,19,20. GGOs are characterized by hazy increased lung opacity while preserving the bronchial and vascular margins21. Based on density, GGOs can be further classified as pure ground glass opacity (pGGO) or mixed ground glass opacity (mGGO)22. Numerous studies have explored the diagnostic criteria and performance of CT imaging features in identifying the pathological subtypes of lung adenocarcinoma23,24,25. Most studies found that the appearance of invasive component mainly results in the solid component in CT imaging. However, most studies have included the IAC group where the area of invasive components is generally large enough to causes many visible features in CT imaging. Compared with IAC, the area of invasive component in MIA is rather small, which could only results in subtle changes in imaging that are difficult to be observed, especially in pGGO. Therefore, most of the CT features that could help pathological diagnosis were attributed to IAC instead of MIA. The difference between MIA and precursor lesions in CT imaging is still left to be less known. So, in our study, we focus on MIA and precursor lesions that appeared as pGGOs. Our study aimed to compare the semantic features we observed and the shape features and first-order features obtained from artificial intelligence among MIA and precursor lesion, which could help reveal the differences in CT imaging between the two subtypes. And we hope to determine the diagnostic value of these imaging features, which could offer new understanding of distinguishing MIA from precursor lesions. The imaging finding might be a great help in improving precise treatment of MIA and precursor lesions.

Methods

Research subjects

We retrospectively reviewed the clinical, pathological, and imaging data of 1521 patients who were diagnosed with lung adenocarcinoma or precursor lesions at our hospital between January 1, 2015, and March 1, 2021. These cases were selected based on specific inclusion and exclusion criteria. This retrospective study was reviewed and approved by the Ethic Committees of the Third Affiliated Hospital of Sun Yat-Sen University and informed consent was waived. All experiments were performed in accordance with relevant guidelines and regulations.


Inclusion criteria:

  1. 1.

    Patients who underwent surgical resection for lesions and were pathologically diagnosed with atypical adenomatous hyperplasia (AAH), adenocarcinoma in situ (AIS), or minimally invasive adenocarcinoma (MIA).

  2. 2.

    Patients who had a CT scan of the lung within 1 month before surgery.

  3. 3.

    Patients who did not receive any anti-tumor therapy before surgery.

  4. 4.

    Patients whose lesions appeared as pure ground-glass opacities (pGGOs) on CT.

  5. 5.

    Patients with peripheral lesions.

  6. 6.

    Patients with complete clinical information available in the electronic medical record system.


Exclusion criteria:

  1. 1.

    Patients who only underwent nodules pathological biopsy or bronchial biopsy, or had an uncertain pathological diagnosis.

  2. 2.

    Patients with poor-quality CT imaging due to significant motion artifacts.

  3. 3.

    Patients whose CT images could not be analyzed by the artificial intelligence scientific research platform system.

A total of 127 patients were included in the study, among which 139 cases had pGGOs. Twelve patients had two pGGOs each, while the remaining patients had a single pGGO (Supplemental Table S1). The pGGOs were further divided into two groups: AAH/AIS group and MIA group (Fig. 1).

Fig. 1
figure 1

Flow chart of inclusion.

Clinical data collection

Clinical information, including patient name, gender, age, main symptoms, hospitalization number, examination number, pathology number, pathological diagnosis description, pathology type, date of pathology report, interval between the last CT scan before surgery and surgery, invasion of pleura, lymph nodes or blood vessels, surgical method, and smoking history, was collected from the electronic medical record system. The pathological diagnosis of each lesion, made by one pathologist, was reviewed and confirmed by another senior pathologist.

CT scan and imaging features acquisition

CT scans were performed using a 320-slice Aquilion ONE CT scanner (Toshiba Medical Systems, Tokyo, Japan) and an IQon Spectral scanner (Philips Healthcare, Best, the Netherlands). The patients were positioned supine with their arms raised and head advanced. All scans were performed at the end of inspiration while holding the breath. The CT protocol included the following parameters: 120 kV, automatic tube current modulation (180 mA-440 mA), iterative reconstruction technique, detector 160/40 mm, collimation 0.5/0.625 mm, pitch 1.5, matrix 512 × 512, and rotation time 0.5/0.27 s. The reconstructed axial layer thickness and layer distance were 1 mm. Images were reconstructed using adaptive iterative dose reduction (AIDR 3D, Toshiba Medical Systems) or hybrid iterative reconstruction (iDose4, Philips Healthcare).

The CT images were independently reviewed by two radiologists who were blinded to the pathological type. The results were confirmed by a senior radiologist. The lung window was set to a window level of -600 HU and a window width of 1,600 HU, while the mediastinal window was set to a window level of 40 HU and a window width of 300 HU. pGGOs are defined as hazy increased lung opacities while preserving the bronchial and vascular margins and show no identifiable solid attenuation on lung window or soft-tissue on mediastinal window. The following semantic features of pGGOs were evaluated: lobe location, shape (round or not), margin (clear or blurred), spiculation, lobulation, vacuole sign, air bronchogram, pleural indentation, vascular convergence sign, and pleural thickening. Spiculation was defined as a radial, unbranched, straight, fine, and short line that extended from the edge of the lung nodule. Lobulation was defined as a lobulated opacity formed by the arcuated and convex contour of the lung nodule. Vacuole sign was defined as a 1–2 mm punctate transparent shadow within the lung nodule. Air bronchogram was defined as the presence of air-filled (low-attenuated) bronchi in the background of low-air, high-attenuated pulmonary nodules. Pleural indentation was defined as the invagination, shrinkage, and shifting of the visceral pleura towards the lung nodules. Vascular convergence sign was defined as the appearance of one or more vessels drawn to or passing through the lesion or truncated at the edge of the lesion.

The CT images were uploaded to the Dr. Wise® artificial intelligence-assisted diagnosis system (Shenrui Medical, Beijing, China) to automatically segment the region of interest and extract shape features and first-order features, including maximum diameter, 3D maximum diameter, maximum cross-sectional area, volume, maximum radiodensity, minimum radiodensity, mean radiodensity, variance, kurtosis, skewness, surface area, compactness, sphericity, and entropy. Any discrepancies in the interpretation of CT images between the two radiologists were resolved through mutual consultation to obtain a final consistent result.

Statistical analysis

Statistical analysis of the clinical data and CT imaging feature data was performed using IBM SPSS Statistics, version 25.0 (SPSS, Inc., Chicago, Ill., USA). The interobserver agreement on pGGOs was assessed by calculating Cohen’s Kappa coefficients. The normality of quantitative data was evaluated using the Shapiro-Wilk test. Chi-square analysis, t-test, and Mann-Whitney U test were used to analyze the statistical differences in imaging features between the groups. Logistic regression was performed to compare the two groups and evaluate the risk of invasiveness. A forward elimination process was carried out in multivariate analysis. Receiver operating characteristic (ROC) analysis was used to evaluate the diagnostic accuracy of imaging features for different pathological subtypes. A p-value < 0.05 was considered statistically significant.

Results

In this study, a total of 139 pGGOs were divided into two groups: AAH/AIS group (80 cases) and MIA group (59 cases). Reliability of interobserver agreement on pGGOs was good (Kappa = 0.934). Among the 127 patients included in the study, 34 were male (26.77%) and 93 were female (73.23%). The average interval between the last CT scan before surgery and surgery was 11.20 ± 7.34 days. All AAH lesions and most AIS/MIA lesions were resected due to their adjacency to other resected invasive adenocarcinomas. Some AIS and MIA lesions were resected based on suspicious morphology (spiculation, lobulation, vacuole sign, etc.), persistent size ≥ 10 mm, or increase in size (Supplemental Table S2). There were no significant differences in clinical features between the two groups (Table 1).

Table 1 Clinical characteristics of 115 patients diagnosed as MIA or precursor lesion with single pGGO.

Regarding CT imaging features, there was a statistically significant difference in spiculation between the two groups, while no differences were observed in the rest of the CT imaging signs (Table 2; Fig. 2). The maximum radiodensity, minimum radiodensity, mean radiodensity, variance, and skewness were found to be significantly different between the two groups. Specifically, pGGOs of AAH or AIS were more likely to show lower intensity, smaller variance, and larger skewness compared to MIA (Table 3). Univariate logistic regression analysis revealed that the levels of maximum radiodensity, minimum radiodensity, mean radiodensity, variance, skewness, and entropy were significantly associated with the pathological subtypes. Multivariate logistic regression analysis further identified maximum radiodensity and skewness as independent risk factors for MIA.

Table 2 Semantic features of 139 cases of pGGOs diagnosed as MIA or precursor lesions.
Fig. 2
figure 2

CT images of pGGOs. (a) A 38-year-old male with an AIS lesion in the posterior segment of the right upper lobe, appearing as a non-circular and blurred-margin pGGO with air bronchogram. (b) A 63-year-old female with an AIS lesion in the outer segment of the right middle lobe, appearing as a round and blurred-margin pGGO with spiculation and vascular convergence sign. (c) A 44-year-old male with an MIA lesion in the outer basal segment of the right lower lobe, appearing as a non-circular and clear-margin pGGO with spiculation, lobulation and vascular convergence sign. (d) A 28-year-old female with an MIA lesion in the posterior basal segment of the right lower lobe, appearing as a non-circular and clear-margin pGGO with pleural indentation. (e) A 43-year-old male with an MIA lesion in the outer segment of the right middle lobe, appearing as a round and blurred-margin pGGO with spiculation, vacuole sign and vascular convergence sign. AIS, adenocarcinoma in situ. MIA, minimally invasive adenocarcinoma. pGGO, pure ground-glass opacity.

Table 3 Shape features and first-order features of 139 cases of pGGO diagnosed as MIA or precursor lesions.

pGGOs with a maximum radiodensity of -430.00~-220.00 Hu and > -220.00 Hu had a 3.109-fold and 12.955-fold increased risk of MIA, respectively, compared to pGGOs with radiodensity < -430.00 Hu. Similarly, pGGOs with a skewness of -0.07~-0.02 had a 0.136-fold increased risk of MIA compared to pGGOs with skewness < -0.07 Hu (Table 4). The area under the curve (AUC) of the predicted value from the multivariate logistic regression was 0.747 (95% CI, 0.666–0.816), which was significantly larger than the AUC of maximum radiodensity (0.698 (95% CI, 0.614–0.773), p = 0.035) and skewness (0.608 (95% CI, 0.522–0.690)). However, there was no significant difference between maximum radiodensity and skewness (p = 0.070) (Fig. 3).

Table 4 Logistic regression model for differentiate MIA from precursor lesions.
Fig. 3
figure 3

The ROC curve of maximum radiodensity, skewness and predicted value of multivariate Logistic regresson to distinguish AAH / AIS from MIA in pGGOs. AAH, atypical adenomatous hyperplasia. AIS, adenocarcinoma in situ. MIA, minimally invasive adenocarcinoma. pGGO, pure ground-glass opacity. ROC curve, receiver operating characteristic curve.

Discussion

In this study, we observed a higher proportion of female patients compared to male patients, which is consistent with previous studies12,26,27. This may be attributed to the increasing incidence of lung adenocarcinoma in women in recent years28. The age distribution of patients in our study is similar to previous studies, indicating a decreasing age of onset of lung cancer12,29.

The majority of patients in our study chose sublobectomy as it has a similar prognosis to lobectomy among the three pathological subtypes and allows for the preservation of more normal lung tissue with good function30,31.

The development from atypical adenomatous hyperplasia (AAH), adenocarcinoma in situ (AIS), and minimally invasive adenocarcinoma (MIA) to invasive adenocarcinoma (IAC) is a continuous evolution process8. In lesions with the pathological subtypes of AAH or AIS, tumor cells proliferate along the alveolar wall, leading to an increase in the thickness of the alveolar wall without collapse of the alveolar cavity. As tumor cells infiltrate and fibrotic components increase, an area of invasive adenocarcinoma measuring ≤ 5 mm in diameter appears, eventually developing into MIA32,33. These pathological changes can affect the texture, density, and morphology of the lesion.

Our study found that spiculation was more likely to appear in pGGOs of MIA compared to precursor lesions. This may be attributed to the direct infiltration of the tumor into adjacent bronchovascular sheaths in MIA. Previous studies have also highlighted the importance of morphology features of ground-glass opacities (GGOs) in predicting invasiveness34,35,36. However, in our study, most semantic features were not significantly different between precursor lesions and MIA. This may be because MIA lesions begin to appear with invasive components and increased tumor heterogeneity, resulting in subtle pathological changes that do not manifest visibly in morphology.

Our study also found that differences between precursor lesions and MIA were primarily attributed to intensity features, indicating that subtle progression in pathology may be manifested by intensity features to some extent. Our study demonstrated that the levels of maximum radiodensity and skewness were independent risk factors for pathology. Furthermore, the combination of these two features significantly improved the diagnostic accuracy compared to using them individually. A pGGO with higher maximum radiodensity and smaller negative skewness is more likely to be MIA, possibly due to increased consolidation. This is consistent with previous studies36,37,38. However, it is important to note that CT features with significant differences did not demonstrate high diagnostic efficiency in our study, similar to previous research39. This suggests that further research utilizing radiomics and extracting more comprehensive imaging features from CT images is necessary to overcome the limitations of conventional imaging diagnosis and assist in classification and diagnosis.

According to multiple studies, both MIA and AIS have good prognosis. It has been shown shown 100% 5-year disease-free survival in both tumor types when completely resected10,11,12,13,14,15. However, during the process of AIS developing into MIA, there is an appearance of invasive component which is attributed to progression of invasiveness. Though the area of invasive component is quite small, as the same as it’s in invasive adenocarcinoma, it could show different growth patterns in MIA, such as lepidic, acinar, papillary, micropapillary and solid pattern. Among these growth patterns, papillary pattern is found to be related to intermediate prognosis40. Whereas micropapillary and solid patterns are considered to be high-grade pattern because of their association of poor prognosis. It has been found that micropapillary growth is associated to lymphatic and pleural invasion and nodal metastases which are adverse prognostic factors13,29. Even a small amount of micropapillary pattern could lead to significantly worse overall survival41. In this case, if the invasive component in MIA appears high-grade growth pattern, it might not show the same prognosis as the common types do. There have been few studies that have investigated the prognosis of MIA with different invasive components. Jia et al. found a few cases of MIA showed papillary, micropapillary and solid pattern and all the cases showed recurrence or metachronous adenocarcinoma. Among these lesions, there is one that appears as pGGO showed papillary pattern16. Also, they found significant differences in the expressions of p53, Ki67 and cyclin D1 and EGFR mutation rate between the AIS and MIA groups, which might associate with differences of tumor progression. In addition, Honda et al. found a pGGO containing micropapillary component and the same type of component was also found in mediastinal lymph nodes metastases42. Even though this lesion is invasive adenocarcinoma, this finding can still provide the prove of the possibility of existence of micropapillary component in pGGO and it’s associated with lymph nodes metastases. Considering that the good prognosis of MIA being shown in other studies might be due to the low occurrence rate of these unique types of MIA, these factors that could affect prognosis should not be neglected when clinical decisions on MIA being made. Therefore, our findings in differences between MIA and precursor lesions in pGGOs might have significant value on adjusting follow-up strategy or clinical treatment. The CT features with significant differences between MIA and precursor lesions could make a great help of improving the diagnostic accuracy of pathology and assisting precise management of patients.

This study has several limitations. Firstly, it is a single-center study, which may introduce selection bias and limit the generalizability of the results. Additionally, the relatively small number of cases in this study may have affected the accuracy of the statistical analysis and the clinical application of the research conclusions. Furthermore, the combination of the AAH and AIS groups in our study prevented us from exploring the differences in imaging features between AAH and the other two pathological subtypes. Also, all AAH lesions and most AIS/MIA lesions were resected due to their adjacency to other resected invasive adenocarcinomas, which might cause selection bias.

In conclusion, our study analyzed the correlation between CT imaging features of pGGOs and the pathological subtypes of AAH/AIS and MIA. We found significant differences in intensity features between the two groups, and maximum radiodensity and skewness were identified as independent risk factors for pathology. However, these features did not exhibit strong diagnostic efficiency. Future research should focus on utilizing radiomics and extracting more comprehensive imaging features to improve the accuracy of diagnosis and classification.