Confound modelling in UK Biobank brain imaging

Neuroimage. 2021 Jan 1:224:117002. doi: 10.1016/j.neuroimage.2020.117002. Epub 2020 Jun 2.

Abstract

Dealing with confounds is an essential step in large cohort studies to address problems such as unexplained variance and spurious correlations. UK Biobank is a powerful resource for studying associations between imaging and non-imaging measures such as lifestyle factors and health outcomes, in part because of the large subject numbers. However, the resulting high statistical power also raises the sensitivity to confound effects, which therefore have to be carefully considered. In this work we describe a set of possible confounds (including non-linear effects and interactions that researchers may wish to consider for their studies using such data). We include descriptions of how we can estimate the confounds, and study the extent to which each of these confounds affects the data, and the spurious correlations that may arise if they are not controlled. Finally, we discuss several issues that future studies should consider when dealing with confounds.

Keywords: Big data imaging; Confounds; Data modelling; Epidemiological studies; Image analysis; Machine learning; Multi-modal data integration; Statistica l modelling.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biological Specimen Banks*
  • Brain*
  • Electronic Data Processing
  • Head
  • Humans
  • Neuroimaging* / methods
  • Time Factors
  • United Kingdom