Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing

Nat Genet. 2010 Nov;42(11):931-6. doi: 10.1038/ng.691. Epub 2010 Oct 24.

Abstract

We report the analysis of a Japanese male using high-throughput sequencing to × 40 coverage. More than 99% of the sequence reads were mapped to the reference human genome. Using a Bayesian decision method, we identified 3,132,608 single nucleotide variations (SNVs). Comparison with six previously reported genomes revealed an excess of singleton nonsense and nonsynonymous SNVs, as well as singleton SNVs in conserved non-coding regions. We also identified 5,319 deletions smaller than 10 kb with high accuracy, in addition to copy number variations and rearrangements. De novo assembly of the unmapped sequence reads generated around 3 Mb of novel sequence, which showed high similarity to non-reference human genomes and the human herpesvirus 4 genome. Our analysis suggests that considerable variation remains undiscovered in the human genome and that whole-genome sequencing is an invaluable tool for obtaining a complete understanding of human genetic variation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Asian People / genetics
  • Base Sequence
  • Chromosome Mapping
  • Genetic Diseases, Inborn / genetics
  • Genetic Variation
  • Genome, Human*
  • Genome-Wide Association Study
  • Humans
  • Male
  • Polymorphism, Single Nucleotide
  • Translocation, Genetic