Sequence assembly

Comput Biol Chem. 2009 Apr;33(2):121-36. doi: 10.1016/j.compbiolchem.2008.11.003. Epub 2008 Dec 6.

Abstract

Despite the rapidly increasing number of sequenced and re-sequenced genomes, many issues regarding the computational assembly of large-scale sequencing data have remain unresolved. Computational assembly is crucial in large genome projects as well for the evolving high-throughput technologies and plays an important role in processing the information generated by these methods. Here, we provide a comprehensive overview of the current publicly available sequence assembly programs. We describe the basic principles of computational assembly along with the main concerns, such as repetitive sequences in genomic DNA, highly expressed genes and alternative transcripts in EST sequences. We summarize existing comparisons of different assemblers and provide a detailed descriptions and directions for download of assembly programs at: http://genome.ku.dk/resources/assembly/methods.html.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Computational Biology / methods
  • DNA / chemistry
  • Expressed Sequence Tags
  • Genome
  • Genomics / methods*
  • Polymorphism, Single Nucleotide
  • Repetitive Sequences, Nucleic Acid

Substances

  • DNA