CRISPR-GEM: A Novel Machine Learning Model for CRISPR Genetic Target Discovery and Evaluation

ACS Synth Biol. 2024 Oct 18;13(10):3413-3429. doi: 10.1021/acssynbio.4c00473. Epub 2024 Oct 7.

Abstract

CRISPR gene editing strategies are shaping cell therapies through precise and tunable control over gene expression. However, limitations in safely delivering high quantities of CRISPR machinery demand careful target gene selection to achieve reliable therapeutic effects. Informed target gene selection requires a thorough understanding of the involvement of target genes in gene regulatory networks (GRNs) and thus their impact on cell phenotype. Effective decoding of these complex networks has been achieved using machine learning models, but current techniques are limited to single cell types and focus mainly on transcription factors, limiting their applicability to CRISPR strategies. To address this, we present CRISPR-GEM, a multilayer perceptron (MLP) based synthetic GRN constructed to accurately predict the downstream effects of CRISPR gene editing. First, input and output nodes are identified as differentially expressed genes between defined experimental and target cell/tissue types, respectively. Then, MLP training learns regulatory relationships in a black-box approach allowing accurate prediction of output gene expression using only input gene expression. Finally, CRISPR-mimetic perturbations are made to each input gene individually, and the resulting model predictions are compared to those for the target group to score and assess each input gene as a CRISPR candidate. The top scoring genes provided by CRISPR-GEM therefore best modulate experimental group GRNs to motivate transcriptomic shifts toward a target group phenotype. This machine learning model is the first of its kind for predicting optimal CRISPR target genes and serves as a powerful tool for enhanced CRISPR strategies across a range of cell therapies.

Keywords: CRISPR gene editing; MSC chondrogenesis; gene regulatory network; machine learning; osteoarthritis; regulatory T cell.

MeSH terms

  • CRISPR-Cas Systems* / genetics
  • Clustered Regularly Interspaced Short Palindromic Repeats / genetics
  • Gene Editing* / methods
  • Gene Regulatory Networks*
  • Humans
  • Machine Learning*