Computational surprisal analysis speeds-up genomic characterization of cancer processes

PLoS One. 2014 Nov 18;9(11):e108549. doi: 10.1371/journal.pone.0108549. eCollection 2014.

Abstract

Surprisal analysis is increasingly being applied for the examination of transcription levels in cellular processes, towards revealing inner network structures and predicting response. But to achieve its full potential, surprisal analysis should be integrated into a wider range computational tool. The purposes of this paper are to combine surprisal analysis with other important computation procedures, such as easy manipulation of the analysis results--e.g. to choose desirable result sub-sets for further inspection--, retrieval and comparison with relevant datasets from public databases, and flexible graphical displays for heuristic thinking. The whole set of computation procedures integrated into a single practical tool is what we call Computational Surprisal Analysis. This combined kind of analysis should facilitate significantly quantitative understanding of different cellular processes for researchers, including applications in proteomics and metabolomics. Beyond that, our vision is that Computational Surprisal Analysis has the potential to reach the status of a routine method of analysis for practitioners. The resolving power of Computational Surprisal Analysis is here demonstrated by its application to a variety of cellular cancer process transcription datasets, ours and from the literature. The results provide a compact biological picture of the thermodynamic significance of the leading gene expression phenotypes in every stage of the disease. For each transcript we characterize both its inherent steady state weight, its correlation with the other transcripts and its variation due to the disease. We present a dedicated website to facilitate the analysis for researchers and practitioners.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Carcinogenesis / genetics*
  • Gene Expression Regulation, Neoplastic
  • Genome, Human*
  • Humans
  • Transcription, Genetic

Grants and funding

This work was supported by an EMBO postdoctoral fellowship to N.K.B. and European Commission FP7 Future and Emerging Technologies–Open Project BAMBI 618024 (to FR and RDL). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.