Predicting High-Grade Patterns in Stage I Solid Lung Adenocarcinoma: A Study of 371 Patients Using Refined Radiomics and Deep Learning-Guided CatBoost Classifier

Technol Cancer Res Treat. 2024 Jan-Dec:23:15330338241308610. doi: 10.1177/15330338241308610.

Abstract

Introduction: This study aimed to devise a diagnostic algorithm, termed the Refined Radiomics and Deep Learning Features-Guided CatBoost Classifier (RRDLC-Classifier), and evaluate its efficacy in predicting pathological high-grade patterns in patients diagnosed with clinical stage I solid lung adenocarcinoma (LADC).

Methods: In this retrospective study, a total of 371 patients diagnosed with clinical stage I solid LADC were randomly categorized into training and validation sets in a 7:3 ratio. Uni- and multivariate logistic regression analyses were performed to examine the imaging findings that can be used to predict pathological high-grade patterns meticulously. Employing redundancy and the least absolute shrinkage and selection operator regression, a radiomics model was developed. Subsequently, radiomics refinement and deep learning features were employed using a machine learning algorithm to construct the RRDLC-Classifier, which aims to predict high-grade patterns in clinical stage I solid LADC. Evaluation metrics, such as receiver operating characteristic curves, areas under the curve (AUCs), accuracy, sensitivity, and specificity, were computed for assessment.

Results: The RRDLC-Classifier attained the highest AUC of 0.838 (95% confidence interval [CI]: 0.766-0.911) in predicting high-grade patterns in clinical stage I solid LADC, followed by radiomics with an AUC of 0.779 (95% CI: 0.675-0.883), and imaging findings with an AUC of 0.6 (95% CI: 0.472-0.726).

Conclusions: This study introduces the RRDLC-Classifier, a novel diagnostic algorithm that amalgamates refined radiomics and deep learning features to predict high-grade patterns in clinical stage I solid LADC. This algorithm may exhibit excellent diagnostic performance, which can facilitate its application in precision medicine.

Keywords: catboost classifier; clinical stage i solid; deep learning; high-grade patterns; lung adenocarcinomas; radiomics.

MeSH terms

  • Adenocarcinoma of Lung* / diagnostic imaging
  • Adenocarcinoma of Lung* / pathology
  • Adult
  • Aged
  • Algorithms
  • Deep Learning*
  • Female
  • Humans
  • Lung Neoplasms* / diagnostic imaging
  • Lung Neoplasms* / pathology
  • Male
  • Middle Aged
  • Neoplasm Grading
  • Neoplasm Staging*
  • Prognosis
  • ROC Curve
  • Radiomics
  • Retrospective Studies
  • Tomography, X-Ray Computed / methods