Data Augmentation and Transfer Learning to Improve Generalizability of an Automated Prostate Segmentation Model

Thomas H Sanford; Ling Zhang; Stephanie A Harmon; Jonathan Sackett; Dong Yang; Holger Roth; Ziyue Xu; Deepak Kesani; Sherif Mehralivand; Ronaldo H Baroni; Tristan Barrett; Rossano Girometti; Aytekin Oto; Andrei S Purysko; Sheng Xu; Peter A Pinto; Daguang Xu; Bradford J Wood; Peter L Choyke; Baris Turkbey

doi:10.2214/AJR.19.22347

Data Augmentation and Transfer Learning to Improve Generalizability of an Automated Prostate Segmentation Model

AJR Am J Roentgenol. 2020 Dec;215(6):1403-1410. doi: 10.2214/AJR.19.22347. Epub 2020 Oct 14.

Authors

Thomas H Sanford¹, Ling Zhang², Stephanie A Harmon^{1

3}, Jonathan Sackett¹, Dong Yang², Holger Roth², Ziyue Xu², Deepak Kesani¹, Sherif Mehralivand¹, Ronaldo H Baroni⁴, Tristan Barrett⁵, Rossano Girometti⁶, Aytekin Oto⁷, Andrei S Purysko⁸, Sheng Xu¹, Peter A Pinto¹, Daguang Xu², Bradford J Wood¹, Peter L Choyke¹, Baris Turkbey¹

Affiliations

¹ Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bldg 10, Rm B3B85, Bethesda MD 20892.
² NVIDIA Corporation, Bethesda, MD.
³ Clinical Research Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD.
⁴ Diagnostic Imaging Department, Albert Einstein Hospital, Sao Paulo, Brazil.
⁵ University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom.
⁶ Department of Radiology, University of Udine, Udine, Italy.
⁷ Department of Radiology, University of Chicago, Chicago, IL.
⁸ Department of Radiology, Cleveland Clinic, Cleveland, OH.

Abstract

OBJECTIVE. Deep learning applications in radiology often suffer from overfitting, limiting generalization to external centers. The objective of this study was to develop a high-quality prostate segmentation model capable of maintaining a high degree of performance across multiple independent datasets using transfer learning and data augmentation. MATERIALS AND METHODS. A retrospective cohort of 648 patients who underwent prostate MRI between February 2015 and November 2018 at a single center was used for training and validation. A deep learning approach combining 2D and 3D architecture was used for training, which incorporated transfer learning. A data augmentation strategy was used that was specific to the deformations, intensity, and alterations in image quality seen on radiology images. Five independent datasets, four of which were from outside centers, were used for testing, which was conducted with and without fine-tuning of the original model. The Dice similarity coefficient was used to evaluate model performance. RESULTS. When prostate segmentation models utilizing transfer learning were applied to the internal validation cohort, the mean Dice similarity coefficient was 93.1 for whole prostate and 89.0 for transition zone segmentations. When the models were applied to multiple test set cohorts, the improvement in performance achieved using data augmentation alone was 2.2% for the whole prostate models and 3.0% for the transition zone segmentation models. However, the best test-set results were obtained with models fine-tuned on test center data with mean Dice similarity coefficients of 91.5 for whole prostate segmentation and 89.7 for transition zone segmentation. CONCLUSION. Transfer learning allowed for the development of a high-performing prostate segmentation model, and data augmentation and fine-tuning approaches improved performance of a prostate segmentation model when applied to datasets from external centers.

Keywords: artificial intelligence; prostate MRI; segmentation.

Publication types

Research Support, N.I.H., Extramural
Research Support, N.I.H., Intramural

MeSH terms

Datasets as Topic
Deep Learning
Humans
Magnetic Resonance Imaging*
Male
Middle Aged
Pattern Recognition, Automated*
Prostatic Neoplasms / diagnostic imaging*
Retrospective Studies

Abstract

Publication types

MeSH terms

Grants and funding