|
1.INTRODUCTIONLung cancer, particularly non-small cell lung cancer (NSCLC), remains a major global health challenge due to its high incidence and mortality rates [1, 2]. NSCLC includes subtypes such as squamous carcinoma, pulmonary adenocarcinoma, and large cell carcinoma, accounting for approximately 85% of all lung cancer cases [3-5]. Among these, pulmonary adenocarcinoma can be further classified into adenocarcinoma in situ (AIS), minimally invasive adenocarcinoma (MIA), and invasive adenocarcinoma (IAC), each requiring different treatment approaches. Computed tomography (CT) imaging is a primary diagnostic tool for lung adenocarcinoma. However, distinguishing between the subtypes of pulmonary mini-nodules can be challenging due to limited resolution and the sheer volume of images needing interpretation [6]. Therefore, a non-invasive, repeatable, and quantitative method for the classification of these subtypes would be highly beneficial in clinical practice. Despite the life-saving benefits of early CT detection, this imaging modality has several limitations, including high rates of detection of indeterminate pulmonary nodules [7-10]. Radiomics, which involves the extraction of quantitative features from medical images, offers a promising approach to this challenge [10-14]. This approach allows doctors to gain a more comprehensive view of lesions and enables quantitative, high-throughput analyses of lesion development and progression [15-19] which can reflect both biological and medical image information of lesions, providing valuable insights for diagnosis, prognosis, and disease prediction[20-25]. This study aims to develop a model for the classification of pulmonary adenocarcinoma subtypes using radiomics and a random forest algorithm. The integration of advanced feature selection techniques with a robust classification model is anticipated to significantly improve diagnostic accuracy 2.MATERIALS AND METHODSThe study was approved by the Ethics Board of Shanghai University of Medicine & Health Sciences. The Ethics Committee gave up the written informed consent of the patient because it was a retrospective experiment and did not involve patient privacy. 2.1Data collection and preprocessingA total of 295 lesions were detected in 289 patients at the Shanghai Public Health Clinical Center (hospital 1). Additionally, in a recent examination, 127 patients with 138 lesions were re-selected at the same hospital (hospital 1.1). The basic patient information is detailed in Table 1. The selection criteria included: (1) patients who had undergone thin-slice chest CT imaging, (2) patients who had undergone an unenhanced chest CT exam, and (3) patients diagnosed with lung adenocarcinoma by doctors. Among these patients, one pGGN was detected in 187 patients and two in 6 patients. All lesions were pathologically diagnosed as 174 MIA, 54 AIS, and 67 IAC. Table 1.Demographic characteristics
Ages and size are shown as mean ± standard deviation; M male, F female *p value < 0.05 For the study, 236 lesions were allocated to the training set, while 59 were included in the testing set. Two to eight images with clear features were selected from the CT scans of each lesion, resulting in 853 CT images in the training set and 213 in the testing set. Additionally, 138 lesions diagnosed as 75 MIA, 36 AIS, and 27 IAC from two other hospitals were included. One to three images with clear features were selected from the CT scans of each lesion in this new dataset, yielding 267 CT images in the validation set. The patient selection flowchart is depicted in Figure 1. 2.2CT scanningAn unenhanced chest CT exam was performed during the entire lung scan of each patient with a United-Imaging 760 CT device (42-126 mA 120 kV, slice thickness of 1 mm), a Siemens Emotion 16 CT device (34-123 mA, 130 kV, 1 mm) and a Siemens Perspective 77427 CT device (155mA, 110kV, 1mm), under a 512×512 resolution. 2.3Feature extractionA semi-automated, unsupervised method was employed to segment the region of interest (ROI) for each CT image[26]. This segmentation was subsequently confirmed by a radiologist with a decade of experience in chest CT interpretation. Radiographic characteristics were independently evaluated by two experienced thoracic radiologists, with 6 and 13 years of experience in chest CT interpretation, respectively, who were blinded to the pathological results. Discrepancies between the observers were resolved through consensus. The radiographic characteristics analyzed for each lesion included: (1) margin (clear, blurred), (2) lobulation (absent, present), (3) spiculation (absent, present), (4) pleural attachment, including pleural tag and indentation (absent, present), (5) air bronchogram (absent, present), (6) vessel change (absent, present), and (7) bubble lucency (absent, present) (Table 2). Table 2.The Parameters of lesions in datasets
A total of 1,068 features were extracted from the ROI using commercial software (PyRadiomics 3.0.1)[27]. These features included tumor size, shape, first-order statistical descriptors (histogram features), and high-order texture features (gray level co-occurrence matrix and gray level run length). The original images were normalized prior to feature extraction. 2.4Feature selectionGiven that the initial number of features extracted was close to the number of patients, a systematic feature selection process was implemented to avoid overfitting. The feature set was divided into radiographic characteristics and radiomic features. All radiographic characteristic features were preserved due to their clinical relevance. For the radiomic features, an initial screening was conducted where features with low variance were removed as they provide minimal information and can introduce noise. Furthermore, highly correlated features (correlation coefficient > 0.9) were identified, and one feature from each pair was removed to reduce multicollinearity. Feature importance was then assessed using the ’RandomForest’ [28] package to compute the ’MeanDecreaseGini’ metric, ranking features in decreasing order of their importance scores. To determine the optimal subset of radiomic features, cross-validation was employed, calculating the root mean squared error (RMSE) for various feature subsets and identifying the subset that minimized RMSE. This iterative process ensured the retention of the most predictive features. Additionally, the correlation of each feature in the optimal radiomic feature set was analyzed to ensure that the selected features were not redundant and provided unique information. The final selected features included original_shape2D_Perimeter, squareroot_firstorder_90Percentile, original_shape_Maximum2DDiameterRow, wavelet.LL_glcm_ClusterShade, wavelet.HL_glszm_SmallAreaLowGrayLevelEmphasis, original_shape2D_Sphericity, diagnostics_Image.original_Mean, wavelet.LL_firstorder_Minimum, and logarithm_firstorder_90Percentile. The optimal radiomic feature subset was then combined with the radiographic characteristics feature set to form a comprehensive training feature set, which was used for model training. This thorough feature selection process ensured that the final model included only the most relevant and non-redundant features, thereby enhancing its predictive performance and robustness. 2.5Model training and validationBased on the preliminary training of all features, the optimal values of parameters ’mtry’ and ’ntree’ required by the random forest model were calculated (mtry=20, ntree=900, Figure 3). Then, on the premise of the same parameters, the random forest model was trained with those three feature sets respectively (training feature set as model 1, radiographic characteristics feature set as model 2 and the optimal radiomic feature set as model 3). At the same time, SVM, Logistic Regression and C5.0 algorithm were used to verify the results. 2.6Statistical analysisQuantitative data were presented as mean ± SD or median (25th–75th percentile), while qualitative data were described as counts (n). Comparisons between groups for qualitative variables were performed using Fisher’s exact test, and comparisons for quantitative variables were performed using either a t-test or the Wilcoxon test. A p-value of less than 0.05 was considered statistically significant. The Area Under the Curve (AUC) represents the degree of separability and serves as a summary measure of the accuracy of a quantitative diagnostic test. The Receiver Operating Characteristic (ROC) curve, a commonly used graph to summarize the performance of a classifier across all possible thresholds, is generated by plotting the true positive rate (y-axis) against the false positive rate (x-axis). The ROC curve and AUC were used to evaluate the diagnostic performance, precision, and discrimination accuracy of the models. All statistical analyses were performed using R statistical software (http://www.Rproject.org, version 3.6.0) with the following packages: ’randomForest’, ’ggplot2’, ’caret’, ’pROC’, ’e1071’, and ’reshape2’. 2.7ResultsThe performance of the radiomics model constructed using random forest was evaluated by precisions and accuracies. The ROC curves of random forest models based on three feature set are shown in Figure 4. Because there are three types of lesions, the subsequent evaluation is to combine two of them into one and compare them with the remaining one (MIA represents the comparison between MIA and IAC & AIS, AIS represents the comparison between AIS and IAC & MIA, IAC represents the comparison between IAC and MIA & AIS). For the testing dataset, the AUC scores of model 1, model 2, and model 3 were 0.974, 0.483, and 0.835, respectively. The sensitivity values of model 1, model 2, and model 3 at the optimum critical point were 0.873(MIA:0.967, AIS:0.825, IAC:0.826), 0.814(MIA:0.866, AIS:0.750, IAC:0.826), and 0.641(MIA:0.902, AIS:0.378, IAC:0.642). The specificity values of model 1, model 2, and model 3 at the optimum critical point were 0.936(MIA:0.837, AIS:0.994, IAC:0.976), 0.897(MIA:0.791, AIS:0.965, IAC:0.934), and 0.829(MIA:0.589, AIS:0.972, IAC:0.925). For validation dataset, the AUC scores of model 1, model 2, and model 3 were 0.964, 0.915, and 0.841, respectively. The sensitivity values of model 1, model 2, and model 3 at the optimum critical point were 0.867(MIA:0.940, AIS:0.837, IAC:0.824), 0.825(MIA:0.880, AIS:0.756, IAC:0.838), and 0.648(MIA:0.927, AIS:0.245, IAC:0.772). The specificity values of model 1, model 2, and model 3 at the optimum critical point were 0.932(MIA:0.855, AIS:0.986, IAC:0.955), 0.906(MIA:0.803, AIS:0.949, IAC:0.965), and 0.798(MIA:0.521, AIS:0.986, IAC:0.886). The performance of the random forest models based on the three feature sets are shown in the Table 3. Table 3.The performance of random forest models in testing and validation datasets
At the same time, SVM, Logistic Regression and C5.0 algorithm used the same training dataset and training feature set to train the model. Evaluate the performance of the model on the testing dataset. For SVM model, the AUC score was 0.734. The sensitivity values were MIA:0.921, AIS:0.750, IAC:0.348. The specificity values were MIA:0.547, AIS:0.959, IAC:0.976. For Logistic Regression model, the AUC score was 0.780. The sensitivity values were MIA:0.850, AIS:0.750, IAC:0.587. The specificity values were MIA:0.709, AIS:0.971, IAC:0.892. For C5.0 model, the AUC score was 0.700. The sensitivity values were MIA:0.843, AIS:0.475, IAC:0.522. The specificity values were MIA:0.558, AIS:0.965, IAC:0.886. The details are shown in the table 4. Table 4.The performance of validation models in testing datasets
3.DISCUSSIONThis study demonstrates the potential of using a random forest-based classification model incorporating radiomics and radiographic features for the differentiation of pulmonary adenocarcinoma subtypes. The key findings indicate that the combination of radiomic features with traditional radiographic characteristics significantly enhances the accuracy of lung adenocarcinoma classification compared to using either feature set alone. The random forest model achieved high AUC scores for both training and validation datasets, highlighting its robustness and reliability. The model’s ability to effectively integrate radiomic features, which capture complex imaging patterns not visible to the naked eye, with conventional radiographic features, underscores the importance of a multifaceted approach in medical imaging analysis. This combined feature set resulted in a more comprehensive representation of the lesions, improving the model’s diagnostic performance. Radiomics offers a transformative approach by converting medical images into high-dimensional data that reflect underlying tumor biology. The study’s findings support the utility of radiomics in clinical settings, providing a non-invasive and reproducible method for lesion characterization. The enhanced ability to classify subtypes of pulmonary adenocarcinoma can lead to more personalized treatment strategies, potentially improving patient outcomes. Despite the promising results, several limitations were noted. The study was conducted at a single center with a relatively small sample size, which may limit the generalizability of the findings. Potential biases in the dataset due to retrospective data collection and selection criteria may affect the model’s performance. Additionally, the semi-automatic segmentation method, while effective, still involves manual steps that could introduce variability. The generalizability of the model to different populations or imaging protocols remains uncertain due to the specific CT imaging protocols used in this study. Future research should aim to develop fully automated segmentation techniques to further standardize the analysis and include multi-center data to validate the model across diverse patient cohorts and imaging conditions. Furthermore, the study’s feature selection process, while rigorous, still resulted in a large number of features that require manual assessment by clinicians. Incorporating additional clinical data and expanding the radiomics feature set could enhance the model’s accuracy and clinical applicability. To address these limitations and improve the model’s utility, future studies should focus on multi-center collaborations to validate the findings across diverse populations and imaging equipment. Increasing the dataset size and diversity would enhance the model’s robustness and applicability in different clinical scenarios. Additionally, integrating clinical data such as patient history and genetic markers could provide a more holistic approach to lung adenocarcinoma classification. This study highlights the significant potential of radiomics and machine learning in enhancing lung cancer diagnostics. The developed model represents a step forward in the integration of advanced imaging techniques with artificial intelligence, paving the way for more precise and personalized cancer care. Further research and technological advancements will be crucial in fully realizing the benefits of this innovative approach in clinical practice. 4.CONCLUSIONSIn conclusion, we developed an automatic pulmonary adenocarcinoma grading model utilizing the most significant radiomics features through a random forest method. This model demonstrates potential in aiding rapid clinical diagnosis and triage. Future research should focus on extracting additional radiomics parameters and incorporating patient clinical characteristics to enhance model accuracy. ETHICS DECLARATIONSEthics approval and consent to participate. This article does not contain any studies with human participants or animals performed by any of the authors. Ethical approval for this study was obtained from Shanghai Public Health Clinical Center. REFERENCESMcGuire, and S.,
“Geneva, Switzerland: World Health Organization, International Agency for Research on Cancer, WHO Press, 2015,”
Advances in Nutrition An International Review Journal, 7.2
(2016), 418
–419 Google Scholar
El Kinany K, Huybrechts I, Kampman E,
“Concordance with the World Cancer Research Fund/American Institute for Cancer Research recommendations for cancer prevention and colorectal cancer risk in Morocco: A large, population-based case-control study,”
Int J Cancer, 145
(7), 1829
–1837
(2019). https://doi.org/10.1002/ijc.v145.7 Google Scholar
Donahue JM, Morse CR, Wigle DA,
“Oncologic efficacy of anatomic segmentectomy in stage IA lung cancer patients with T1a tumors,”
Ann Thorac Surg, 93
(2), 381387
–387388
(2012). https://doi.org/10.1016/j.athoracsur.2011.10.079 Google Scholar
Travis WD, Brambilla E, Noguchi M,
“International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society: international multidisciplinary classification of lung adenocarcinoma: executive summary,”
Proc Am Thorac Soc, 8
(5), 381
–385
(2011). https://doi.org/10.1513/pats.201107-042ST Google Scholar
He J, Hu Y, Hu M, Li B.,
“Development of PD-1/PD-L1 Pathway in Tumor Immune Microenvironment and Treatment for Non-Small Cell Lung Cancer,”
Sci Rep, 5 13110
(2015). https://doi.org/10.1038/srep13110 Google Scholar
Sun Z, Ng K.H, & Sarji S.A.,
“Is utilisation of computed tomography justified in clinical practice? Part IV: applications of paediatric computed tomography,”
Singapore medical journal, 51
(6), 457
–463
(2010). Google Scholar
Alahmari SS, Cherezov D, Goldgof D, Hall L, Gillies RJ, Schabath MB,
“Delta Radiomics Improves Pulmonary Nodule Malignancy Prediction in Lung Cancer Screening,”
IEEE Access, 6 77796
–77806
(2018). https://doi.org/10.1109/Access.6287639 Google Scholar
Choi W, Nadeem S, Alam SR, Deasy JO, Tannenbaum A, Lu W.,
“Reproducible and Interpretable Spiculation Quantification for Lung Cancer Screening,”
Comput Methods Programs Biomed, 2020 105839 Google Scholar
Balagurunathan Y, Schabath MB, Wang H, Liu Y, Gillies RJ.,
“Quantitative Imaging features Improve Discrimination of Malignancy in Pulmonary nodules,”
Sci Rep, 9
(1), 8528
(2019). https://doi.org/10.1038/s41598-019-44562-z Google Scholar
Xu Y, Lu L, E LN,
“Application of Radiomics in Predicting the Malignancy of Pulmonary Nodules in Different Sizes,”
AJR Am J Roentgenol, 213
(6), 1213
–1220
(2019). https://doi.org/10.2214/AJR.19.21490 Google Scholar
Yang X, He J, Wang J,
“CT-based radiomics signature for differentiating solitary granulomatous nodules from solid lung adenocarcinoma,”
Lung Cancer, 125 109
–114
(2018). https://doi.org/10.1016/j.lungcan.2018.09.013 Google Scholar
Ortiz-Ramon R, Larroza A, Arana E, Moratal D.,
“A radiomics evaluation of 2D and 3D MRI texture features to classify brain metastases from lung cancer and melanoma,”
Annu Int Conf IEEE Eng Med Biol Soc, 2017 493
–496
(2017). Google Scholar
Kalpathy-Cramer J, Mamomov A, Zhao B,
“Radiomics of Lung Nodules: A Multi-Institutional Study of Robustness and Agreement of Quantitative Imaging Features,”
Tomography, 2
(4), 430
–437
(2016). https://doi.org/10.18383/j.tom.2016.00235 Google Scholar
Zhao W, Zhang W, Sun Y,
“Convolution kernel and iterative reconstruction affect the diagnostic performance of radiomics and deep learning in lung adenocarcinoma pathological subtypes,”
Thorac Cancer, 10
(10), 1893
–1903
(2019). https://doi.org/10.1111/tca.v10.10 Google Scholar
Aerts HJ, Velazquez ER, Leijenaar RT,
“Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach,”
Nat Commun, 5 4006
(2014). https://doi.org/10.1038/ncomms5006 Google Scholar
Constanzo J, Wei L, Tseng HH, El Naqa I.,
“Radiomics in precision medicine for lung cancer,”
Transl Lung Cancer Res, 6
(6), 635
–647
(2017). https://doi.org/10.21037/tlcr Google Scholar
Kalra M, Wang G, Orton CG.,
“Radiomics in lung cancer: Its time is here,”
Med Phys, 45
(3), 997
–1000
(2018). https://doi.org/10.1002/mp.2018.45.issue-3 Google Scholar
Shen C, Liu Z, Guan M,
“2D and 3D CT Radiomics Features Prognostic Performance Comparison in Non-Small Cell Lung Cancer,”
Transl Oncol, 10
(6), 886
–894
(2017). https://doi.org/10.1016/j.tranon.2017.08.007 Google Scholar
Wilson R, Devaraj A.,
“Radiomics of pulmonary nodules and lung cancer,”
Transl Lung Cancer Res, 6
(1), 86
–91
(2017). https://doi.org/10.21037/tlcr Google Scholar
Gillies RJK, Paul E, Hricak H.,
“Radiomics: Images Are More than Pictures, They Are Data,”
Radiology, 278
(2), 563
–77
(2016). https://doi.org/10.1148/radiol.2015151169 Google Scholar
Nardone V, Boldrini L, Grassi R, Franceschini D, & Desideri I.,
“Radiomics in the setting of neoadjuvant radiotherapy: a new approach for tailored treatment,”
Cancers, 13
(14), 3590
(2021). https://doi.org/10.3390/cancers13143590 Google Scholar
Dos Santos D, Dietzel M, Baessler B.,
“A decade of radiomics research: are images really data or just patterns in the noise?,”
Eur Radiol, 31
(1), 1
–4
(2021). https://doi.org/10.1007/s00330-020-07108-w Google Scholar
Bae JM, Jeong JY, Lee HY,
“Pathologic stratification of operable lung adenocarcinoma using radiomics features extracted from dual energy CT images,”
Oncotarget, 8
(1), 523
–535
(2017). https://doi.org/10.18632/oncotarget.v8i1 Google Scholar
Thawani R, McLane M, Beig N,
“Radiomics and radiogenomics in lung cancer: A review for the clinician,”
Lung Cancer, 115 34
–41
(2018). https://doi.org/10.1016/j.lungcan.2017.10.015 Google Scholar
Yang F, Simpson G, Young L, Ford J, Dogan N, Wang L.,
“Impact of contouring variability on oncological PET radiomics features in the lung,”
Sci Rep, 10
(1), 369
(2020). https://doi.org/10.1038/s41598-019-57171-7 Google Scholar
Ren H, Zhou L, Liu G,
“An unsupervised semi-automated pulmonary nodule segmentation method based on enhanced region growing,”
Quant Imaging Med Surg, 10
(1), 233
–242
(2020). https://doi.org/10.21037/qims Google Scholar
Griethuysen JJM, Fedorov A, Parmar C,
“Computational Radiomics System to Decode the Radiographic Phenotype,”
Cancer Res, 77
(21), e104
–e107
(2017). https://doi.org/10.1158/0008-5472.CAN-17-0339 Google Scholar
Svetnik V, Liaw A, Tong C, Culberson JC, Sheridan RP, Feuston BP.,
“Random forest: a classification and regression tool for compound classification and QSAR modeling,”
J Chem Inf Comput Sci, 43
(6), 1947
–1958
(2003). https://doi.org/10.1021/ci034160g Google Scholar
|