inGAP: an integrated next-generation genome analysis pipeline

Bioinformatics. 2010 Jan 1;26(1):127-9. doi: 10.1093/bioinformatics/btp615. Epub 2009 Oct 30.

Abstract

Summary: We develop a novel mining pipeline, Integrative Next-generation Genome Analysis Pipeline (inGAP), guided by a Bayesian principle to detect single nucleotide polymorphisms (SNPs), insertion/deletions (indels) by comparing high-throughput pyrosequencing reads with a reference genome of related organisms. inGAP can be applied to the mapping of both Roche/454 and Illumina reads with no restriction of read length. Experiments on simulated and experimental data show that this pipeline can achieve overall 97% accuracy in SNP detection and 94% in the finding of indels. All the detected SNPs/indels can be further evaluated by a graphical editor in our pipeline. inGAP also provides functions of multiple genomes comparison and assistance of bacterial genome assembly.

Availability: inGAP is available at http://sites.google.com/site/nextgengenomics/ingap

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Base Sequence
  • Chromosome Mapping / methods*
  • DNA Mutational Analysis / methods*
  • Genome / genetics*
  • Molecular Sequence Data
  • Pancreatitis-Associated Proteins
  • Polymorphism, Single Nucleotide / genetics*
  • Sequence Analysis, DNA / methods*
  • Software*
  • Systems Integration
  • User-Computer Interface