Transcription factors (TFs) define cellular identity either by activating target cell program or by silencing donor program as demonstrated by intensive cell reprogramming studies. Here, we propose an extended minimum set cover model with stable selection (3Scover) to systematically identify silencing TFs, named safeguard TFs, from omics data. First, a cell type-TF specificity network is constructed to systematically link cell types with their specifically expressed TFs. Then we search the minimum TF set to cover this network with "many but one specificity" characteristic and integrate many subsampling models for a stable solution. 3Scover identified 30 safeguard TFs in human and mouse. These safeguard TFs are significantly enriched in the experimentally discovered reprogramming panel with their protein-protein interactors. In addition, they tend to interact closely with chromatin regulators, negatively regulate transcription, and function earlier in development. Collectively, 3Scover allows us to probe master TFs and combinatorial regulation in controlling cell identity.
Keywords: Bioinformatics; Systems Biology; Transcriptomics.
Copyright © 2020 The Author(s). Published by Elsevier Inc. All rights reserved.