Global analysis of Drosophila Cys₂-His₂ zinc finger proteins reveals a multitude of novel recognition motifs and binding determinants

Genome Res. 2013 Jun;23(6):928-40. doi: 10.1101/gr.151472.112. Epub 2013 Mar 7.

Abstract

Cys2-His2 zinc finger proteins (ZFPs) are the largest group of transcription factors in higher metazoans. A complete characterization of these ZFPs and their associated target sequences is pivotal to fully annotate transcriptional regulatory networks in metazoan genomes. As a first step in this process, we have characterized the DNA-binding specificities of 129 zinc finger sets from Drosophila using a bacterial one-hybrid system. This data set contains the DNA-binding specificities for at least one encoded ZFP from 70 unique genes and 23 alternate splice isoforms representing the largest set of characterized ZFPs from any organism described to date. These recognition motifs can be used to predict genomic binding sites for these factors within the fruit fly genome. Subsets of fingers from these ZFPs were characterized to define their orientation and register on their recognition sequences, thereby allowing us to define the recognition diversity within this finger set. We find that the characterized fingers can specify 47 of the 64 possible DNA triplets. To confirm the utility of our finger recognition models, we employed subsets of Drosophila fingers in combination with an existing archive of artificial zinc finger modules to create ZFPs with novel DNA-binding specificity. These hybrids of natural and artificial fingers can be used to create functional zinc finger nucleases for editing vertebrate genomes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Alternative Splicing
  • Animals
  • Base Sequence
  • Binding Sites*
  • Cluster Analysis
  • Computational Biology / methods
  • Drosophila / genetics*
  • Drosophila Proteins / chemistry
  • Drosophila Proteins / classification
  • Drosophila Proteins / genetics*
  • Models, Molecular
  • Nucleotide Motifs*
  • Phylogeny
  • Position-Specific Scoring Matrices
  • Protein Binding
  • Protein Conformation
  • Zinc Fingers / genetics*

Substances

  • Drosophila Proteins

Associated data

  • GEO/GSE42709