Feature Engineering for Interpretable Machine Learning for Quality Assurance in Radiation Oncology

Stud Health Technol Inform. 2022 Jun 6:290:460-464. doi: 10.3233/SHTI220118.

Abstract

Chart checking is a time intensive process with high cognitive workload for physicists. Previous studies have partially automated and standardized chart checking, but limited studies implement data-driven approaches to reduce cognitive workload for quality assurance processes. This study aims to evaluate feature selection methods to improve the interpretability and transparency of machine learning models in predicting the degree of difficulty for a pretreatment physics chart check. We compare chi-square, mutual information, feature importance thresholding, and greedy feature selection for four different classifiers. Random forest has the highest performance with SMOTE oversampling using mutual information for feature selection (accuracy 84.0%, AUC 87.0%, precision 80.0%, recall 80.0%). This study demonstrates that feature selection methods can improve model interpretability and transparency.

Keywords: Machine learning; quality assurance; radiation oncology.

MeSH terms

  • Engineering
  • Machine Learning
  • Radiation Oncology*