Application of information theoretical approaches to assess diversity and similarity in single-cell transcriptomics

Comput Struct Biotechnol J. 2020 May 21:18:1830-1837. doi: 10.1016/j.csbj.2020.05.005. eCollection 2020.

Abstract

Single-cell transcriptomics offers a powerful way to reveal the heterogeneity of individual cells. To date, many information theoretical approaches have been proposed to assess diversity and similarity, and characterize the latent heterogeneity in transcriptome data. Diversity implies gene expression variations and can facilitate the identification of signature genes; while, similarity unravels co-expression patterns for cell type clustering. In this review, we summarized 16 measures of information theory used for evaluating diversity and similarity in single-cell transcriptomic data, provide references and shed light on selected theoretical properties when there is a need to select proper measurements in general cases. We further provide an R package assembling discussed approaches to improve the researchers own single-cell transcriptome study. At last, we prospected further applications of diversity and similarity measures in support of depicting heterogeneity in single-cell multi-omics data.

Keywords: Diversity and similarity assessment; Renyi’s divergence; Renyi’s entropy; Single cell sequencing.

Publication types

  • Review