Open Access Paper
11 September 2024 The construction of the model of pulmonary adenocarcinoma classification based on radiomics and random forest
He Ren, Zhengguang Xiao, Yijie Li, Mengting Tu, Qi Sun, Ping Lu, Yejun Cao, Hanqing Chen, Ping Li
Author Affiliations +
Proceedings Volume 13270, International Conference on Future of Medicine and Biological Information Engineering (MBIE 2024); 132700C (2024) https://doi.org/10.1117/12.3048034
Event: 2024 International Conference on Future of Medicine and Biological Information Engineering (MBIE 2024), 2024, Shenyang, China
Abstract
Purpose: The present study aimed to construct classification models for pulmonary adenocarcinoma using computed tomography (CT)‐based radiomics features and random forest method. Methods: A total of 289 patients with 295 lung adenocarcinomas were included in this study. A total of 1066 CT images were extracted. The final data set was randomized into the training set and validation set at the ratio of 80%:20%. A total of 1082 features were captured from a semi‐automatic segmentation method segmented lesion of a CT image. 9 optimal radiomic features obtained from root mean squared error (REMS) through cross validation and 14 radiographic characteristic features were selected to construct a random forest classification model. At the same time, compared with the results of the Support Vector Machine (SVM), Logistic Regression and C5.0 algorithm. Results: The area under the curve (AUC) scores of training feature set, radiographic characteristics feature set, and the optimal radiomic feature set for testing dataset were 0.974, 0.483, and 0.835, respectively, and the corresponding AUC values for validation dataset were 0.964, 0.915, and 0.841, separately. Conclusion: The developed random forest‐based classification models using radiomics features and radiographic features of CT showed a relatively acceptable performance in lung adenocarcinoma and could assist clinical rapid diagnosis and triage.

1.

INTRODUCTION

Lung cancer, particularly non-small cell lung cancer (NSCLC), remains a major global health challenge due to its high incidence and mortality rates [1, 2]. NSCLC includes subtypes such as squamous carcinoma, pulmonary adenocarcinoma, and large cell carcinoma, accounting for approximately 85% of all lung cancer cases [3-5]. Among these, pulmonary adenocarcinoma can be further classified into adenocarcinoma in situ (AIS), minimally invasive adenocarcinoma (MIA), and invasive adenocarcinoma (IAC), each requiring different treatment approaches. Computed tomography (CT) imaging is a primary diagnostic tool for lung adenocarcinoma. However, distinguishing between the subtypes of pulmonary mini-nodules can be challenging due to limited resolution and the sheer volume of images needing interpretation [6]. Therefore, a non-invasive, repeatable, and quantitative method for the classification of these subtypes would be highly beneficial in clinical practice.

Despite the life-saving benefits of early CT detection, this imaging modality has several limitations, including high rates of detection of indeterminate pulmonary nodules [7-10]. Radiomics, which involves the extraction of quantitative features from medical images, offers a promising approach to this challenge [10-14]. This approach allows doctors to gain a more comprehensive view of lesions and enables quantitative, high-throughput analyses of lesion development and progression [15-19] which can reflect both biological and medical image information of lesions, providing valuable insights for diagnosis, prognosis, and disease prediction[20-25].

This study aims to develop a model for the classification of pulmonary adenocarcinoma subtypes using radiomics and a random forest algorithm. The integration of advanced feature selection techniques with a robust classification model is anticipated to significantly improve diagnostic accuracy

2.

MATERIALS AND METHODS

The study was approved by the Ethics Board of Shanghai University of Medicine & Health Sciences. The Ethics Committee gave up the written informed consent of the patient because it was a retrospective experiment and did not involve patient privacy.

2.1

Data collection and preprocessing

A total of 295 lesions were detected in 289 patients at the Shanghai Public Health Clinical Center (hospital 1). Additionally, in a recent examination, 127 patients with 138 lesions were re-selected at the same hospital (hospital 1.1). The basic patient information is detailed in Table 1. The selection criteria included: (1) patients who had undergone thin-slice chest CT imaging, (2) patients who had undergone an unenhanced chest CT exam, and (3) patients diagnosed with lung adenocarcinoma by doctors. Among these patients, one pGGN was detected in 187 patients and two in 6 patients. All lesions were pathologically diagnosed as 174 MIA, 54 AIS, and 67 IAC.

Table 1.

Demographic characteristics

 Full DatasetTraining & Testing DatasetValidation Datasetp value
Age52.26±1.5452.46±0.6451.78±3.02*
Long diameter9.69±0.159.73±0.189.60±0.25*
Short diameter7.62±0.127.65±0.157.56±0.21*
Sex176M,240F113M,176F63M 64F 

Ages and size are shown as mean ± standard deviation;

M male, F female

*

p value < 0.05

For the study, 236 lesions were allocated to the training set, while 59 were included in the testing set. Two to eight images with clear features were selected from the CT scans of each lesion, resulting in 853 CT images in the training set and 213 in the testing set. Additionally, 138 lesions diagnosed as 75 MIA, 36 AIS, and 27 IAC from two other hospitals were included. One to three images with clear features were selected from the CT scans of each lesion in this new dataset, yielding 267 CT images in the validation set. The patient selection flowchart is depicted in Figure 1.

Figure 1.

Flow chart of case sample selection

00172_PSISDG13270_132700C_page_3_1.jpg

2.2

CT scanning

An unenhanced chest CT exam was performed during the entire lung scan of each patient with a United-Imaging 760 CT device (42-126 mA 120 kV, slice thickness of 1 mm), a Siemens Emotion 16 CT device (34-123 mA, 130 kV, 1 mm) and a Siemens Perspective 77427 CT device (155mA, 110kV, 1mm), under a 512×512 resolution.

2.3

Feature extraction

A semi-automated, unsupervised method was employed to segment the region of interest (ROI) for each CT image[26]. This segmentation was subsequently confirmed by a radiologist with a decade of experience in chest CT interpretation.

Radiographic characteristics were independently evaluated by two experienced thoracic radiologists, with 6 and 13 years of experience in chest CT interpretation, respectively, who were blinded to the pathological results. Discrepancies between the observers were resolved through consensus. The radiographic characteristics analyzed for each lesion included: (1) margin (clear, blurred), (2) lobulation (absent, present), (3) spiculation (absent, present), (4) pleural attachment, including pleural tag and indentation (absent, present), (5) air bronchogram (absent, present), (6) vessel change (absent, present), and (7) bubble lucency (absent, present) (Table 2).

Table 2.

The Parameters of lesions in datasets

  Training datasetTesting datasetValidation dataset
 MIA473127150
CategoryAIS1304049
 IAC2504668
LobeLeft34585115
Right508128152
  S1+2 124S6 48S1+2 20S6 17S1+2 42S6 14
 LeftS3 73S8 13S3 22S8 3S3 22S8 3
  S4 20S9 29S4 3S9 4S4 4S9 14
  S5 15S10 23S5 4S10 12S5 5S10 11
Segment S1 98S6 58S1 24S6 16S1 22S6 21
  S2 121S7 10S2 34S7 1S2 38S7 5
 RightS3 88S8 54S3 16S8 9S3 22S8 14
  S4 28S9 35S4 13S9 8S4 17S9 7
  S5 10S10 6S5 4S10 3S5 6S10 0
MarginBlurred786195243
 Clear671824
LobulationPresent651162192
 Absent2025175
SpiculationPresent637156191
 Absent2165776
Pleural AttachmentPresent2365061
 Absent617163206
Air BronchogramPresent427114134
 Absent42699133
Vessel ChangePresent706187227
 Absent1472640
Bubble LucencyPresent1904948
 Absent663164219
Right lung, superior lobeRight lung, inferior lobe 
Apical segment [S1]Superior segment (Fowler) [S6]Left lung, superior lobe
Posterior segment [S2]Medial basal segment [S7]Apicoposterior segment [S1+2]
Anterior segment [S3]Anterior basal segment [S8]Anterior segment [S3]
Right lung, middle lobeLateral basal segment [S9]Superior lingular segment [S4]
Lateral segment [S4]Posterior basal segment [S10]Inferior lingular segment [S5]
Medial segment [S5]  

A total of 1,068 features were extracted from the ROI using commercial software (PyRadiomics 3.0.1)[27]. These features included tumor size, shape, first-order statistical descriptors (histogram features), and high-order texture features (gray level co-occurrence matrix and gray level run length). The original images were normalized prior to feature extraction.

2.4

Feature selection

Given that the initial number of features extracted was close to the number of patients, a systematic feature selection process was implemented to avoid overfitting. The feature set was divided into radiographic characteristics and radiomic features. All radiographic characteristic features were preserved due to their clinical relevance. For the radiomic features, an initial screening was conducted where features with low variance were removed as they provide minimal information and can introduce noise. Furthermore, highly correlated features (correlation coefficient > 0.9) were identified, and one feature from each pair was removed to reduce multicollinearity. Feature importance was then assessed using the ’RandomForest’ [28] package to compute the ’MeanDecreaseGini’ metric, ranking features in decreasing order of their importance scores. To determine the optimal subset of radiomic features, cross-validation was employed, calculating the root mean squared error (RMSE) for various feature subsets and identifying the subset that minimized RMSE. This iterative process ensured the retention of the most predictive features. Additionally, the correlation of each feature in the optimal radiomic feature set was analyzed to ensure that the selected features were not redundant and provided unique information. The final selected features included original_shape2D_Perimeter, squareroot_firstorder_90Percentile, original_shape_Maximum2DDiameterRow, wavelet.LL_glcm_ClusterShade, wavelet.HL_glszm_SmallAreaLowGrayLevelEmphasis, original_shape2D_Sphericity, diagnostics_Image.original_Mean, wavelet.LL_firstorder_Minimum, and logarithm_firstorder_90Percentile.

The optimal radiomic feature subset was then combined with the radiographic characteristics feature set to form a comprehensive training feature set, which was used for model training. This thorough feature selection process ensured that the final model included only the most relevant and non-redundant features, thereby enhancing its predictive performance and robustness.

Figure 2.

The optimal radiomic feature subset and its performance are obtained through cross validation. (a) represents the performance of cross validation. (b) represents the correlation diagram of 9 features. (c) represents the important value of 9 features.

00172_PSISDG13270_132700C_page_5_1.jpg

2.5

Model training and validation

Based on the preliminary training of all features, the optimal values of parameters ’mtry’ and ’ntree’ required by the random forest model were calculated (mtry=20, ntree=900, Figure 3). Then, on the premise of the same parameters, the random forest model was trained with those three feature sets respectively (training feature set as model 1, radiographic characteristics feature set as model 2 and the optimal radiomic feature set as model 3). At the same time, SVM, Logistic Regression and C5.0 algorithm were used to verify the results.

Figure 3.

Optimal parameters of random forest model. (a) represents the performance of the ‘mtry’. (b) represents the performance of the ‘ntree’.

00172_PSISDG13270_132700C_page_6_1.jpg

2.6

Statistical analysis

Quantitative data were presented as mean ± SD or median (25th–75th percentile), while qualitative data were described as counts (n). Comparisons between groups for qualitative variables were performed using Fisher’s exact test, and comparisons for quantitative variables were performed using either a t-test or the Wilcoxon test. A p-value of less than 0.05 was considered statistically significant.

The Area Under the Curve (AUC) represents the degree of separability and serves as a summary measure of the accuracy of a quantitative diagnostic test. The Receiver Operating Characteristic (ROC) curve, a commonly used graph to summarize the performance of a classifier across all possible thresholds, is generated by plotting the true positive rate (y-axis) against the false positive rate (x-axis). The ROC curve and AUC were used to evaluate the diagnostic performance, precision, and discrimination accuracy of the models.

All statistical analyses were performed using R statistical software (http://www.Rproject.org, version 3.6.0) with the following packages: ’randomForest’, ’ggplot2’, ’caret’, ’pROC’, ’e1071’, and ’reshape2’.

2.7

Results

The performance of the radiomics model constructed using random forest was evaluated by precisions and accuracies. The ROC curves of random forest models based on three feature set are shown in Figure 4. Because there are three types of lesions, the subsequent evaluation is to combine two of them into one and compare them with the remaining one (MIA represents the comparison between MIA and IAC & AIS, AIS represents the comparison between AIS and IAC & MIA, IAC represents the comparison between IAC and MIA & AIS).

Figure 4.

The performance of random forest model based on different feature sets on testing dataset and verification dataset. The yellow line represents the performance of the model in MIA compared with AIS&IAC. The blue line represents the performance of the model in AIS compared with MIA&IAC. The red line represents the performance of the model in IAC compared with MIA&AIS.

00172_PSISDG13270_132700C_page_7_1.jpg

For the testing dataset, the AUC scores of model 1, model 2, and model 3 were 0.974, 0.483, and 0.835, respectively. The sensitivity values of model 1, model 2, and model 3 at the optimum critical point were 0.873(MIA:0.967, AIS:0.825, IAC:0.826), 0.814(MIA:0.866, AIS:0.750, IAC:0.826), and 0.641(MIA:0.902, AIS:0.378, IAC:0.642). The specificity values of model 1, model 2, and model 3 at the optimum critical point were 0.936(MIA:0.837, AIS:0.994, IAC:0.976), 0.897(MIA:0.791, AIS:0.965, IAC:0.934), and 0.829(MIA:0.589, AIS:0.972, IAC:0.925).

For validation dataset, the AUC scores of model 1, model 2, and model 3 were 0.964, 0.915, and 0.841, respectively. The sensitivity values of model 1, model 2, and model 3 at the optimum critical point were 0.867(MIA:0.940, AIS:0.837, IAC:0.824), 0.825(MIA:0.880, AIS:0.756, IAC:0.838), and 0.648(MIA:0.927, AIS:0.245, IAC:0.772). The specificity values of model 1, model 2, and model 3 at the optimum critical point were 0.932(MIA:0.855, AIS:0.986, IAC:0.955), 0.906(MIA:0.803, AIS:0.949, IAC:0.965), and 0.798(MIA:0.521, AIS:0.986, IAC:0.886). The performance of the random forest models based on the three feature sets are shown in the Table 3.

Table 3.

The performance of random forest models in testing and validation datasets

 sensitivityprecisionspecificityf1
testingvalidationtestingvalidationtestingvalidationtestingvalidation
 MIA0.9670.940.8980.8920.8370.8550.9320.916
Model1AIS0.8250.8370.9710.9320.9940.9860.8910.882
 IAC0.8260.8240.9050.8620.9760.9550.8640.842
 MIA0.8660.8800.8590.8520.7910.8030.8630.866
Model2AIS0.7500.7560.8330.7710.9650.9490.7890.763
 IAC0.8260.8380.7760.8910.9340.9650.8000.864
 MIA0.9020.9270.7500.7130.5890.5210.8190.806
Model3AIS0.3780.2450.7370.8000.9720.9860.5000.375
 IAC0.6420.7720.7390.6470.9250.8860.6870.704

At the same time, SVM, Logistic Regression and C5.0 algorithm used the same training dataset and training feature set to train the model. Evaluate the performance of the model on the testing dataset. For SVM model, the AUC score was 0.734. The sensitivity values were MIA:0.921, AIS:0.750, IAC:0.348. The specificity values were MIA:0.547, AIS:0.959, IAC:0.976. For Logistic Regression model, the AUC score was 0.780. The sensitivity values were MIA:0.850, AIS:0.750, IAC:0.587. The specificity values were MIA:0.709, AIS:0.971, IAC:0.892. For C5.0 model, the AUC score was 0.700. The sensitivity values were MIA:0.843, AIS:0.475, IAC:0.522. The specificity values were MIA:0.558, AIS:0.965, IAC:0.886. The details are shown in the table 4.

Table 4.

The performance of validation models in testing datasets

  sensitivityprecisionspecificityf1
 MIA0.9210.750.5470.827
SVMAIS0.750.8110.9590.779
 IAC0.3480.80.9760.485
 MIA0.850.8120.7090.831
LOGISTICAIS0.750.8570.9710.8
 IAC0.5870.60.8920.593
 MIA0.8430.7380.5580.787
C5.0AIS0.4750.760.9650.585
 IAC0.5220.5580.8860.539
 MIA0.9670.8980.8370.932
Model1AIS0.8250.9710.9940.891
 IAC0.8260.9050.9760.864

3.

DISCUSSION

This study demonstrates the potential of using a random forest-based classification model incorporating radiomics and radiographic features for the differentiation of pulmonary adenocarcinoma subtypes. The key findings indicate that the combination of radiomic features with traditional radiographic characteristics significantly enhances the accuracy of lung adenocarcinoma classification compared to using either feature set alone.

The random forest model achieved high AUC scores for both training and validation datasets, highlighting its robustness and reliability. The model’s ability to effectively integrate radiomic features, which capture complex imaging patterns not visible to the naked eye, with conventional radiographic features, underscores the importance of a multifaceted approach in medical imaging analysis. This combined feature set resulted in a more comprehensive representation of the lesions, improving the model’s diagnostic performance.

Radiomics offers a transformative approach by converting medical images into high-dimensional data that reflect underlying tumor biology. The study’s findings support the utility of radiomics in clinical settings, providing a non-invasive and reproducible method for lesion characterization. The enhanced ability to classify subtypes of pulmonary adenocarcinoma can lead to more personalized treatment strategies, potentially improving patient outcomes.

Despite the promising results, several limitations were noted. The study was conducted at a single center with a relatively small sample size, which may limit the generalizability of the findings. Potential biases in the dataset due to retrospective data collection and selection criteria may affect the model’s performance. Additionally, the semi-automatic segmentation method, while effective, still involves manual steps that could introduce variability. The generalizability of the model to different populations or imaging protocols remains uncertain due to the specific CT imaging protocols used in this study. Future research should aim to develop fully automated segmentation techniques to further standardize the analysis and include multi-center data to validate the model across diverse patient cohorts and imaging conditions.

Furthermore, the study’s feature selection process, while rigorous, still resulted in a large number of features that require manual assessment by clinicians. Incorporating additional clinical data and expanding the radiomics feature set could enhance the model’s accuracy and clinical applicability.

To address these limitations and improve the model’s utility, future studies should focus on multi-center collaborations to validate the findings across diverse populations and imaging equipment. Increasing the dataset size and diversity would enhance the model’s robustness and applicability in different clinical scenarios. Additionally, integrating clinical data such as patient history and genetic markers could provide a more holistic approach to lung adenocarcinoma classification. This study highlights the significant potential of radiomics and machine learning in enhancing lung cancer diagnostics. The developed model represents a step forward in the integration of advanced imaging techniques with artificial intelligence, paving the way for more precise and personalized cancer care. Further research and technological advancements will be crucial in fully realizing the benefits of this innovative approach in clinical practice.

4.

CONCLUSIONS

In conclusion, we developed an automatic pulmonary adenocarcinoma grading model utilizing the most significant radiomics features through a random forest method. This model demonstrates potential in aiding rapid clinical diagnosis and triage. Future research should focus on extracting additional radiomics parameters and incorporating patient clinical characteristics to enhance model accuracy.

ETHICS DECLARATIONS

Ethics approval and consent to participate. This article does not contain any studies with human participants or animals performed by any of the authors. Ethical approval for this study was obtained from Shanghai Public Health Clinical Center.

REFERENCES

[1] 

McGuire, and S., “Geneva, Switzerland: World Health Organization, International Agency for Research on Cancer, WHO Press, 2015,” Advances in Nutrition An International Review Journal, 7.2 (2016), 418 –419 Google Scholar

[2] 

El Kinany K, Huybrechts I, Kampman E, “Concordance with the World Cancer Research Fund/American Institute for Cancer Research recommendations for cancer prevention and colorectal cancer risk in Morocco: A large, population-based case-control study,” Int J Cancer, 145 (7), 1829 –1837 (2019). https://doi.org/10.1002/ijc.v145.7 Google Scholar

[3] 

Donahue JM, Morse CR, Wigle DA, “Oncologic efficacy of anatomic segmentectomy in stage IA lung cancer patients with T1a tumors,” Ann Thorac Surg, 93 (2), 381387 –387388 (2012). https://doi.org/10.1016/j.athoracsur.2011.10.079 Google Scholar

[4] 

Travis WD, Brambilla E, Noguchi M, “International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society: international multidisciplinary classification of lung adenocarcinoma: executive summary,” Proc Am Thorac Soc, 8 (5), 381 –385 (2011). https://doi.org/10.1513/pats.201107-042ST Google Scholar

[5] 

He J, Hu Y, Hu M, Li B., “Development of PD-1/PD-L1 Pathway in Tumor Immune Microenvironment and Treatment for Non-Small Cell Lung Cancer,” Sci Rep, 5 13110 (2015). https://doi.org/10.1038/srep13110 Google Scholar

[6] 

Sun Z, Ng K.H, & Sarji S.A., “Is utilisation of computed tomography justified in clinical practice? Part IV: applications of paediatric computed tomography,” Singapore medical journal, 51 (6), 457 –463 (2010). Google Scholar

[7] 

Alahmari SS, Cherezov D, Goldgof D, Hall L, Gillies RJ, Schabath MB, “Delta Radiomics Improves Pulmonary Nodule Malignancy Prediction in Lung Cancer Screening,” IEEE Access, 6 77796 –77806 (2018). https://doi.org/10.1109/Access.6287639 Google Scholar

[8] 

Choi W, Nadeem S, Alam SR, Deasy JO, Tannenbaum A, Lu W., “Reproducible and Interpretable Spiculation Quantification for Lung Cancer Screening,” Comput Methods Programs Biomed, 2020 105839 Google Scholar

[9] 

Balagurunathan Y, Schabath MB, Wang H, Liu Y, Gillies RJ., “Quantitative Imaging features Improve Discrimination of Malignancy in Pulmonary nodules,” Sci Rep, 9 (1), 8528 (2019). https://doi.org/10.1038/s41598-019-44562-z Google Scholar

[10] 

Xu Y, Lu L, E LN, “Application of Radiomics in Predicting the Malignancy of Pulmonary Nodules in Different Sizes,” AJR Am J Roentgenol, 213 (6), 1213 –1220 (2019). https://doi.org/10.2214/AJR.19.21490 Google Scholar

[11] 

Yang X, He J, Wang J, “CT-based radiomics signature for differentiating solitary granulomatous nodules from solid lung adenocarcinoma,” Lung Cancer, 125 109 –114 (2018). https://doi.org/10.1016/j.lungcan.2018.09.013 Google Scholar

[12] 

Ortiz-Ramon R, Larroza A, Arana E, Moratal D., “A radiomics evaluation of 2D and 3D MRI texture features to classify brain metastases from lung cancer and melanoma,” Annu Int Conf IEEE Eng Med Biol Soc, 2017 493 –496 (2017). Google Scholar

[13] 

Kalpathy-Cramer J, Mamomov A, Zhao B, “Radiomics of Lung Nodules: A Multi-Institutional Study of Robustness and Agreement of Quantitative Imaging Features,” Tomography, 2 (4), 430 –437 (2016). https://doi.org/10.18383/j.tom.2016.00235 Google Scholar

[14] 

Zhao W, Zhang W, Sun Y, “Convolution kernel and iterative reconstruction affect the diagnostic performance of radiomics and deep learning in lung adenocarcinoma pathological subtypes,” Thorac Cancer, 10 (10), 1893 –1903 (2019). https://doi.org/10.1111/tca.v10.10 Google Scholar

[15] 

Aerts HJ, Velazquez ER, Leijenaar RT, “Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach,” Nat Commun, 5 4006 (2014). https://doi.org/10.1038/ncomms5006 Google Scholar

[16] 

Constanzo J, Wei L, Tseng HH, El Naqa I., “Radiomics in precision medicine for lung cancer,” Transl Lung Cancer Res, 6 (6), 635 –647 (2017). https://doi.org/10.21037/tlcr Google Scholar

[17] 

Kalra M, Wang G, Orton CG., “Radiomics in lung cancer: Its time is here,” Med Phys, 45 (3), 997 –1000 (2018). https://doi.org/10.1002/mp.2018.45.issue-3 Google Scholar

[18] 

Shen C, Liu Z, Guan M, “2D and 3D CT Radiomics Features Prognostic Performance Comparison in Non-Small Cell Lung Cancer,” Transl Oncol, 10 (6), 886 –894 (2017). https://doi.org/10.1016/j.tranon.2017.08.007 Google Scholar

[19] 

Wilson R, Devaraj A., “Radiomics of pulmonary nodules and lung cancer,” Transl Lung Cancer Res, 6 (1), 86 –91 (2017). https://doi.org/10.21037/tlcr Google Scholar

[20] 

Gillies RJK, Paul E, Hricak H., “Radiomics: Images Are More than Pictures, They Are Data,” Radiology, 278 (2), 563 –77 (2016). https://doi.org/10.1148/radiol.2015151169 Google Scholar

[21] 

Nardone V, Boldrini L, Grassi R, Franceschini D, & Desideri I., “Radiomics in the setting of neoadjuvant radiotherapy: a new approach for tailored treatment,” Cancers, 13 (14), 3590 (2021). https://doi.org/10.3390/cancers13143590 Google Scholar

[22] 

Dos Santos D, Dietzel M, Baessler B., “A decade of radiomics research: are images really data or just patterns in the noise?,” Eur Radiol, 31 (1), 1 –4 (2021). https://doi.org/10.1007/s00330-020-07108-w Google Scholar

[23] 

Bae JM, Jeong JY, Lee HY, “Pathologic stratification of operable lung adenocarcinoma using radiomics features extracted from dual energy CT images,” Oncotarget, 8 (1), 523 –535 (2017). https://doi.org/10.18632/oncotarget.v8i1 Google Scholar

[24] 

Thawani R, McLane M, Beig N, “Radiomics and radiogenomics in lung cancer: A review for the clinician,” Lung Cancer, 115 34 –41 (2018). https://doi.org/10.1016/j.lungcan.2017.10.015 Google Scholar

[25] 

Yang F, Simpson G, Young L, Ford J, Dogan N, Wang L., “Impact of contouring variability on oncological PET radiomics features in the lung,” Sci Rep, 10 (1), 369 (2020). https://doi.org/10.1038/s41598-019-57171-7 Google Scholar

[26] 

Ren H, Zhou L, Liu G, “An unsupervised semi-automated pulmonary nodule segmentation method based on enhanced region growing,” Quant Imaging Med Surg, 10 (1), 233 –242 (2020). https://doi.org/10.21037/qims Google Scholar

[27] 

Griethuysen JJM, Fedorov A, Parmar C, “Computational Radiomics System to Decode the Radiographic Phenotype,” Cancer Res, 77 (21), e104 –e107 (2017). https://doi.org/10.1158/0008-5472.CAN-17-0339 Google Scholar

[28] 

Svetnik V, Liaw A, Tong C, Culberson JC, Sheridan RP, Feuston BP., “Random forest: a classification and regression tool for compound classification and QSAR modeling,” J Chem Inf Comput Sci, 43 (6), 1947 –1958 (2003). https://doi.org/10.1021/ci034160g Google Scholar
© (2024) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
He Ren, Zhengguang Xiao, Yijie Li, Mengting Tu, Qi Sun, Ping Lu, Yejun Cao, Hanqing Chen, and Ping Li "The construction of the model of pulmonary adenocarcinoma classification based on radiomics and random forest", Proc. SPIE 13270, International Conference on Future of Medicine and Biological Information Engineering (MBIE 2024), 132700C (11 September 2024); https://doi.org/10.1117/12.3048034
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Radiomics

Data modeling

Performance modeling

Computed tomography

Random forests

Artificial intelligence

Education and training

Back to Top