Selection of target sites for mobile DNA integration in the human genome

PLoS Comput Biol. 2006 Nov 24;2(11):e157. doi: 10.1371/journal.pcbi.0020157.

Abstract

DNA sequences from retroviruses, retrotransposons, DNA transposons, and parvoviruses can all become integrated into the human genome. Accumulation of such sequences accounts for at least 40% of our genome today. These integrating elements are also of interest as gene-delivery vectors for human gene therapy. Here we present a comprehensive bioinformatic analysis of integration targeting by HIV, MLV, ASLV, SFV, L1, SB, and AAV. We used a mathematical method which allowed annotation of each base pair in the human genome for its likelihood of hosting an integration event by each type of element, taking advantage of more than 200 types of genomic annotation. This bioinformatic resource documents a wealth of new associations between genomic features and integration targeting. The study also revealed that the length of genomic intervals analyzed strongly affected the conclusions drawn--thus, answering the question "What genomic features affect integration?" requires carefully specifying the length scale of interest.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites
  • Cluster Analysis
  • Computational Biology / methods*
  • CpG Islands
  • DNA / chemistry*
  • DNA Transposable Elements
  • Gene Transfer Techniques
  • Genome, Human*
  • Genomics / methods
  • Humans
  • Models, Genetic
  • Software
  • Transcription, Genetic
  • Virus Integration*
  • Viruses / metabolism*

Substances

  • DNA Transposable Elements
  • DNA