SERGIO: A Single-Cell Expression Simulator Guided by Gene Regulatory Networks

Cell Syst. 2020 Sep 23;11(3):252-271.e11. doi: 10.1016/j.cels.2020.08.003. Epub 2020 Aug 31.

Abstract

A common approach to benchmarking of single-cell transcriptomics tools is to generate synthetic datasets that statistically resemble experimental data. However, most existing single-cell simulators do not incorporate transcription factor-gene regulatory interactions that underlie expression dynamics. Here, we present SERGIO, a simulator of single-cell gene expression data that models the stochastic nature of transcription as well as regulation of genes by multiple transcription factors according to a user-provided gene regulatory network. SERGIO can simulate any number of cell types in steady state or cells differentiating to multiple fates. We show that datasets generated by SERGIO are statistically comparable to experimental data generated by Illumina HiSeq2000, Drop-seq, Illumina 10X chromium, and Smart-seq. We use SERGIO to benchmark several single-cell analysis tools, including GRN inference methods, and identify Tcf7, Gata3, and Bcl11b as key drivers of T cell differentiation by performing in silico knockout experiments. SERGIO is freely available for download here: https://github.com/PayamDiba/SERGIO.

Keywords: RNA velocity; benchmarking single-cell analysis tools; differentiation trajectories; gene regulatory networks; simulations; single-cell RNA-seq.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Gene Regulatory Networks / genetics*
  • Humans
  • Single-Cell Analysis / methods*