Using a convolutional neural network for classification of squamous and non-squamous non-small cell lung cancer based on diagnostic histopathology HES images

Sci Rep. 2021 Dec 13;11(1):23912. doi: 10.1038/s41598-021-03206-x.

Abstract

Histological stratification in metastatic non-small cell lung cancer (NSCLC) is essential to properly guide therapy. Morphological evaluation remains the basis for subtyping and is completed by additional immunohistochemistry labelling to confirm the diagnosis, which delays molecular analysis and utilises precious sample. Therefore, we tested the capacity of convolutional neural networks (CNNs) to classify NSCLC based on pathologic HES diagnostic biopsies. The model was estimated with a learning cohort of 132 NSCLC patients and validated on an external validation cohort of 65 NSCLC patients. Based on image patches, a CNN using InceptionV3 architecture was trained and optimized to classify NSCLC between squamous and non-squamous subtypes. Accuracies of 0.99, 0.87, 0.85, 0.85 was reached in the training, validation and test sets and in the external validation cohort. At the patient level, the CNN model showed a capacity to predict the tumour histology with accuracy of 0.73 and 0.78 in the learning and external validation cohorts respectively. Selecting tumour area using virtual tissue micro-array improved prediction, with accuracy of 0.82 in the external validation cohort. This study underlines the capacity of CNN to predict NSCLC subtype with good accuracy and to be applied to small pathologic samples without annotation.

MeSH terms

  • Carcinoma, Non-Small-Cell Lung / classification
  • Carcinoma, Non-Small-Cell Lung / pathology*
  • Carcinoma, Squamous Cell / classification
  • Carcinoma, Squamous Cell / pathology*
  • Humans
  • Image Interpretation, Computer-Assisted / methods*
  • Image Interpretation, Computer-Assisted / standards
  • Immunohistochemistry / methods
  • Machine Learning / standards*
  • Sensitivity and Specificity
  • Software / standards