Supervised learning and prediction of spatial epidemics

Spat Spatiotemporal Epidemiol. 2014 Oct:11:59-77. doi: 10.1016/j.sste.2014.08.003. Epub 2014 Sep 16.

Abstract

Parameter estimation for mechanistic models of infectious disease can be computationally intensive. Nsoesie et al. (2011) introduced an approach for inference on infectious disease data based on the idea of supervised learning. Their method involves simulating epidemics from various infectious disease models, and using classifiers built from the epidemic curve data to predict which model were most likely to have generated observed epidemic curves. They showed that the classification approach could fairly identify underlying characteristics of the disease system, without fitting various transmission models via, say, Bayesian Markov chain Monte Carlo. We extend this work to the case where the underlying infectious disease model is inherently spatial. Our goal is to compare the use of global epidemic curves for building the classifier, with the use of spatially stratified epidemic curves. We demonstrate these methods on simulated data and apply the method to analyze a tomato spotted wilt virus epidemic dataset.

Keywords: Random forests; Spatial epidemic; Spatial stratification; Supervised learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem*
  • Communicable Diseases / epidemiology*
  • Computer Simulation
  • Epidemics / statistics & numerical data*
  • Humans
  • Markov Chains
  • Models, Theoretical*
  • Monte Carlo Method
  • Regression Analysis
  • Reproducibility of Results
  • Spatial Analysis*
  • Tospovirus