Comparative assessment of segmentation algorithms for tumor delineation on a test-retest [(11)C]choline dataset

Med Phys. 2012 Dec;39(12):7571-9. doi: 10.1118/1.4761952.

Abstract

Purpose: Many methods have been proposed for tumor segmentation from positron emission tomography images. Because of the increasingly important role that [(11)C]choline is playing in oncology and because no study has compared segmentation methods on this tracer, the authors assessed several segmentation algorithms on a [(11)C]choline test-retest dataset.

Methods: Fixed and adaptive threshold-based methods, fuzzy C-means (FCM), Canny's edge detection method, the watershed transform, and the fuzzy locally adaptive Bayesian algorithm (FLAB) were used. Test-retest [(11)C]choline scans of nine patients with breast cancer were considered and the percent test-retest variability %VAR(TEST-RETEST) of tumor volume (TV) was employed to assess the results. The same methods were then applied to two denoised datasets generated by applying either a Gaussian filter or the wavelet transform.

Results: The (semi)automated methods FCM, FLAB, and Canny emerged as the best ones in terms of TV reproducibility. For these methods, the %root mean square error %RMSE of %VAR(TEST-RETEST), defined as %RMSE= variance+mean(2), was in the range 10%-21.2%, depending on the dataset and algorithm. Threshold-based methods gave TV estimates which were extremely variable, particularly on the unsmoothed data; their performance improved on the denoised datasets, whereas smoothing did not have a remarkable impact on the (semi)automated methods. TV variability was comparable to that of SUV(MAX) and SUV(MEAN) (range 14.7%-21.9% for %RMSE of %VAR(TEST-RETEST), after the exclusion of one outlier, 40%-43% when the outlier was included).

Conclusions: The TV variability obtained with the best methods was similar to the one reported for TV in previous [(18)F]FDG and [(18)F]FLT studies and to the one of SUV(MAX)∕SUV(MEAN) on the authors' [(11)C]choline dataset. The good reproducibility of [(11)C]choline TV warrants further studies to test whether TV could predict early response to treatment and survival, as for [(18)F]FDG, to complement∕substitute the use of SUV(MAX) and SUV(MEAN).

Publication types

  • Comparative Study
  • Evaluation Study

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Breast Neoplasms / diagnostic imaging*
  • Carbon Radioisotopes
  • Choline*
  • Female
  • Humans
  • Image Enhancement / methods
  • Image Interpretation, Computer-Assisted / methods*
  • Pattern Recognition, Automated / methods*
  • Positron-Emission Tomography / methods*
  • Radiopharmaceuticals
  • Reproducibility of Results
  • Sensitivity and Specificity

Substances

  • Carbon Radioisotopes
  • Radiopharmaceuticals
  • Choline