Parameter estimation for mechanistic models of infectious disease can be computationally intensive. Nsoesie et al. (2011) introduced an approach for inference on infectious disease data based on the idea of supervised learning. Their method involves simulating epidemics from various infectious disease models, and using classifiers built from the epidemic curve data to predict which model were most likely to have generated observed epidemic curves. They showed that the classification approach could fairly identify underlying characteristics of the disease system, without fitting various transmission models via, say, Bayesian Markov chain Monte Carlo. We extend this work to the case where the underlying infectious disease model is inherently spatial. Our goal is to compare the use of global epidemic curves for building the classifier, with the use of spatially stratified epidemic curves. We demonstrate these methods on simulated data and apply the method to analyze a tomato spotted wilt virus epidemic dataset.
Keywords: Random forests; Spatial epidemic; Spatial stratification; Supervised learning.
Copyright © 2014 Elsevier Ltd. All rights reserved.