Site-specific genome integration in alphaproteobacteria mediated by TG1 integrase

Appl Microbiol Biotechnol. 2012 Jan;93(1):295-304. doi: 10.1007/s00253-011-3545-3. Epub 2011 Sep 2.

Abstract

The serine-type phage integrase is an enzyme that catalyzes site-specific recombination between two attachment sites of phage and host bacterial genomes (attP and attB, respectively) having relatively short but distinct sequences without host auxiliary factor(s). Previously, we have established in vivo and in vitro site-specific recombination systems based on the serine-type integrase produced by actinophage TG1 and determined the minimal sizes of attP(TG1) and attB(TG1) sites required for the in vitro TG1 integrase reaction as 43- and 39-bp, respectively. Here, DNA databases were surveyed by FASTA program with the authentic attB(TG1) sequence of Streptomyces avermitilis as a query. As a result, possible attB(TG1) sequences were extracted from genomes of bacterial strains belonging to Class Alphaproteobacteria in addition to those of Class Actinobacteria. Those sequences extracted with a high similarity score and high sequence identity (we took arbitrarily more than 80% identity) turned out to be located within a conserved region of dapC or related genes encoding aminotransferases and proved to be actually recognized as the cognate substrate of attP(TG1) site by the in vitro TG1 integrase assay. Furthermore, the possible attB(TG1) site of Rhodospirillum rubrum revealed to be used actually as a native (endogenous) attachment site for the in vivo TG1-based integration system. These features are distinct from other serine-type phage integrases and advantageous for a tool of genome technology in varied industrially important bacteria belonging to Class Alphaproteobacteria.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Actinobacteria / genetics*
  • Actinobacteria / metabolism
  • Alphaproteobacteria / genetics*
  • Alphaproteobacteria / metabolism
  • Attachment Sites, Microbiological
  • Bacteriophages / enzymology*
  • Base Sequence
  • Computational Biology
  • DNA, Bacterial / metabolism*
  • Genome, Bacterial
  • Integrases / metabolism*
  • Molecular Sequence Data
  • Recombination, Genetic*

Substances

  • DNA, Bacterial
  • Integrases