Short-insert libraries as a method of problem solving in genome sequencing

Genome Res. 1998 May;8(5):562-6. doi: 10.1101/gr.8.5.562.

Abstract

As the Human Genome Project moves into its sequencing phase, a serious problem has arisen. The same problem has been increasingly vexing in the closing phase of the Caenorhabditis elegans project. The difficulty lies in sequencing efficiently through certain regions in which the templates (DNA substrates for the sequencing process) form complex folded secondary structures that are inaccessible to the enzymes. The solution, however, is simply to break them up. Specifically, the offending fragments are sonicated heavily and recloned, as much smaller fragments, into pUC vector. The sequences obtained from the resulting library can subsequently be assembled, free from the effects of secondary structure, to produce high-quality, complete sequence. Because of the success and simplicity of this procedure, we have begun to use it for the sequencing of all regions in which standard primer walking has been at all difficult.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • BRCA2 Protein
  • Base Sequence
  • Caenorhabditis elegans / genetics
  • Chromosomes, Human, Pair 13 / genetics
  • Gene Library*
  • Genes, Helminth
  • Genes, Neoplasm
  • Genome, Human
  • Humans
  • Molecular Sequence Data
  • Neoplasm Proteins / genetics
  • Problem Solving*
  • Sequence Analysis, DNA / methods*
  • Transcription Factors / genetics

Substances

  • BRCA2 Protein
  • Neoplasm Proteins
  • Transcription Factors

Associated data

  • GENBANK/Z81558
  • GENBANK/Z92540
  • GENBANK/Z93392