The purpose of this study is to demonstrate that a Bayesian network (BN) approach can explore hierarchical biophysical relationships that influence tumor response and predict tumor local control (LC) in non-small-cell lung cancer (NSCLC) patients before and during radiotherapy from a large-scale dataset. Our BN building approach has two steps. First, relevant biophysical predictors influencing LC before and during the treatment are selected through an extended Markov blanket (eMB) method. From this eMB process, the most robust BN structure for LC prediction was found via a wrapper-based approach. Sixty-eight patients with complete feature information were used to identify a full BN model for LC prediction before and during the treatment. Fifty more recent patients with some missing information were reserved for independent testing of the developed pre- and during-therapy BNs. A nested cross-validation (N-CV) was developed to evaluate the performance of the two-step BN approach. An ensemble BN model is generated from the N-CV sampling process to assess its similarity with the corresponding full BN model, and thus evaluate the sensitivity of our BN approach. Our results show that the proposed BN development approach is a stable and robust approach to identify hierarchical relationships among biophysical features for LC prediction. Furthermore, BN predictions can be improved by incorporating during treatment information.
Keywords: Bayesian networks; large-scale model building; local control prediction; non–small-cell lung cancer; pan-Omics.