A reformulation of pLSA for uncertainty estimation and hypothesis testing in bio-imaging

Bioinformatics. 2020 Jul 1;36(13):4080-4087. doi: 10.1093/bioinformatics/btaa270.

Abstract

Motivation: Probabilistic latent semantic analysis (pLSA) is commonly applied to describe mass spectra (MS) images. However, the method does not provide certain outputs necessary for the quantitative scientific interpretation of data. In particular, it lacks assessment of statistical uncertainty and the ability to perform hypothesis testing. We show how linear Poisson modelling advances pLSA, giving covariances on model parameters and supporting χ2 testing for the presence/absence of MS signal components. As an example, this is useful for the identification of pathology in MALDI biological samples. We also show potential wider applicability, beyond MS, using magnetic resonance imaging (MRI) data from colorectal xenograft models.

Results: Simulations and MALDI spectra of a stroke-damaged rat brain show MS signals from pathological tissue can be quantified. MRI diffusion data of control and radiotherapy-treated tumours further show high sensitivity hypothesis testing for treatment effects. Successful χ2 and degrees-of-freedom are computed, allowing null-hypothesis thresholding at high levels of confidence.

Availability and implementation: Open-source image analysis software available from TINA Vision, www.tina-vision.net.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Diffusion
  • Image Processing, Computer-Assisted*
  • Latent Class Analysis
  • Rats
  • Software*
  • Uncertainty