Although allogeneic hematopoietic stem cell transplantation (allo-HSCT) is a curative therapy for high-risk acute leukemia (AL), some patients still relapse. Since patients simultaneously have many prognostic factors, difficulties are associated with the construction of a patient-based prediction algorithm of relapse. The alternating decision tree (ADTree) is a successful classification method that combines decision trees with the predictive accuracy of boosting. It is a component of machine learning (ML) and has the capacity to simultaneously analyze multiple factors. Using ADTree, we attempted to construct a prediction model of leukemia relapse within 1 year of transplantation. With the model of training data (n = 148), prediction accuracy, the AUC of ROC, and the κ-statistic value were 78.4%, 0.746, and 0.508, respectively. The false positive rate (FPR) of the relapse prediction was as low as 0.134. In an evaluation of the model with validation data (n = 69), prediction accuracy, AUC, and FPR of the relapse prediction were similar at 71.0%, 0.667, and 0.216, respectively. These results suggest that the model is generalized and highly accurate. Furthermore, the output of ADTree may visualize the branch point of treatment. For example, the selection of donor types resulted in different relapse predictions. Therefore, clinicians may change treatment options by referring to the model, thereby improving outcomes. The present results indicate that ML, such as ADTree, will contribute to the decision-making process in the diversified allo-HSCT field and be useful for preventing the relapse of leukemia.
Keywords: acute leukemia; allogeneic hematopoietic stem cell transplantation; machine learning; patient-based prediction; relapse posttransplantation.
© 2019 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.