Survival analysis of patients with high-grade gliomas based on data mining of imaging variables

AJNR Am J Neuroradiol. 2012 Jun;33(6):1065-71. doi: 10.3174/ajnr.A2939. Epub 2012 Feb 9.

Abstract

Background and purpose: The prediction of prognosis in HGGs is poor in the majority of patients. Our aim was to test whether multivariate prediction models constructed by machine-learning methods provide a more accurate predictor of prognosis in HGGs than histopathologic classification. The prediction of survival was based on DTI and rCBV measurements as an adjunct to conventional imaging.

Materials and methods: The relationship of survival to 55 variables, including clinical parameters (age, sex), categoric or continuous tumor descriptors (eg, tumor location, extent of resection, multifocality, edema), and imaging characteristics in ROIs, was analyzed in a multivariate fashion by using data-mining techniques. A variable selection method was applied to identify the overall most important variables. The analysis was performed on 74 HGGs (18 anaplastic gliomas WHO grades III/IV and 56 GBMs or gliosarcomas WHO grades IV/IV).

Results: Five variables were identified as the most significant, including the extent of resection, mass effect, volume of enhancing tumor, maximum B0 intensity, and mean trace intensity in the nonenhancing/edematous region. These variables were used to construct a prediction model based on a J48 classification tree. The average classification accuracy, assessed by cross-validation, was 85.1%. Kaplan-Meier survival curves showed that the constructed prediction model classified malignant gliomas in a manner that better correlates with clinical outcome than standard histopathology.

Conclusions: Prediction models based on data-mining algorithms can provide a more accurate predictor of prognosis in malignant gliomas than histopathologic classification alone.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Algorithms*
  • Artificial Intelligence
  • Brain Neoplasms / mortality*
  • Brain Neoplasms / pathology
  • Data Mining*
  • Databases, Factual
  • Decision Support Systems, Clinical*
  • Female
  • Glioma / mortality*
  • Glioma / pathology
  • Humans
  • Image Interpretation, Computer-Assisted / methods
  • Magnetic Resonance Imaging / methods*
  • Male
  • Middle Aged
  • Pattern Recognition, Automated / methods
  • Pennsylvania / epidemiology
  • Prevalence
  • Proportional Hazards Models
  • Reproducibility of Results
  • Risk Factors
  • Sensitivity and Specificity
  • Survival Analysis
  • Survival Rate