Foundational Segmentation Models and Clinical Data Mining Enable Accurate Computer Vision for Lung Cancer

Nathaniel C Swinburne; Christopher B Jackson; Andrew M Pagano; Joseph N Stember; Javin Schefflein; Brett Marinelli; Prashanth Kumar Panyam; Arthur Autz; Mohapar S Chopra; Andrei I Holodny; Michelle S Ginsberg

doi:10.1007/s10278-024-01304-6

Foundational Segmentation Models and Clinical Data Mining Enable Accurate Computer Vision for Lung Cancer

J Imaging Inform Med. 2024 Oct 22. doi: 10.1007/s10278-024-01304-6. Online ahead of print.

Affiliations

¹ Department of Radiology, Memorial Sloan Kettering Cancer Center, 1275 York Ave, New York, NY, 10065, USA. [email protected].
² Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, 1275 York Ave, New York, NY, 10065, USA.
³ Department of Radiology, Memorial Sloan Kettering Cancer Center, 1275 York Ave, New York, NY, 10065, USA.

PMID: 39438365
DOI: 10.1007/s10278-024-01304-6

Abstract

This study aims to assess the effectiveness of integrating Segment Anything Model (SAM) and its variant MedSAM into the automated mining, object detection, and segmentation (MODS) methodology for developing robust lung cancer detection and segmentation models without post hoc labeling of training images. In a retrospective analysis, 10,000 chest computed tomography scans from patients with lung cancer were mined. Line measurement annotations were converted to bounding boxes, excluding boxes < 1 cm or > 7 cm. The You Only Look Once object detection architecture was used for teacher-student learning to label unannotated lesions on the training images. Subsequently, a final tumor detection model was trained and employed with SAM and MedSAM for tumor segmentation. Model performance was assessed on a manually annotated test dataset, with additional evaluations conducted on an external lung cancer dataset before and after detection model fine-tuning. Bootstrap resampling was used to calculate 95% confidence intervals. Data mining yielded 10,789 line annotations, resulting in 5403 training boxes. The baseline detection model achieved an internal F1 score of 0.847, improving to 0.860 after self-labeling. Tumor segmentation using the final detection model attained internal Dice similarity coefficients (DSCs) of 0.842 (SAM) and 0.822 (MedSAM). After fine-tuning, external validation showed an F1 of 0.832 and DSCs of 0.802 (SAM) and 0.804 (MedSAM). Integrating foundational segmentation models into the MODS framework results in high-performing lung cancer detection and segmentation models using only mined clinical data. Both SAM and MedSAM hold promise as foundational segmentation models for radiology images.

Keywords: Artificial intelligence; Computed tomography; Data mining; Lung neoplasms.

Grants and funding

Cancer Center Support Grant P30 CA008748/CA/NCI NIH HHS/United States