geneBasis: an iterative approach for unsupervised selection of targeted gene panels from scRNA-seq

Genome Biol. 2021 Dec 6;22(1):333. doi: 10.1186/s13059-021-02548-z.

Abstract

scRNA-seq datasets are increasingly used to identify gene panels that can be probed using alternative technologies, such as spatial transcriptomics, where choosing the best subset of genes is vital. Existing methods are limited by a reliance on pre-existing cell type labels or by difficulties in identifying markers of rare cells. We introduce an iterative approach, geneBasis, for selecting an optimal gene panel, where each newly added gene captures the maximum distance between the true manifold and the manifold constructed using the currently selected gene panel. Our approach outperforms existing strategies and can resolve cell types and subtle cell state differences.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Exome Sequencing
  • Gene Expression Profiling
  • Humans
  • RNA-Seq*
  • Sequence Analysis, RNA / methods*
  • Single-Cell Analysis / methods*
  • Transcriptome