GOLDBAR: A Framework for Combinatorial Biological Design

ACS Synth Biol. 2024 Sep 20;13(9):2899-2911. doi: 10.1021/acssynbio.4c00296. Epub 2024 Aug 20.

Abstract

With the rise of new DNA part libraries and technologies for assembling DNA, synthetic biologists are increasingly constructing and screening combinatorial libraries to optimize their biological designs. As combinatorial libraries are used to generate data on design performance, new rules for composing biological designs will emerge. Most formal frameworks for combinatorial design, however, do not yet support formal comparison of design composition, which is needed to facilitate automated analysis and machine learning in massive biological design spaces. To address this need, we introduce a combinatorial design framework called GOLDBAR. Compared with existing frameworks, GOLDBAR enables synthetic biologists to intersect and merge the rules for entire classes of biological designs to extract common design motifs and infer new ones. Here, we demonstrate the application of GOLDBAR to refine/validate design spaces for TetR-homologue transcriptional logic circuits, verify the assembly of a partial nif gene cluster, and infer novel gene clusters for the biosynthesis of rebeccamycin. We also discuss how GOLDBAR could be used to facilitate grammar-based machine learning in synthetic biology.

Keywords: biological design; combinatorial engineering; design automation; genetic design; machine learning; regular grammar.

MeSH terms

  • DNA / chemistry
  • DNA / genetics
  • Gene Library
  • Gene Regulatory Networks
  • Machine Learning
  • Multigene Family
  • Synthetic Biology* / methods

Substances

  • DNA