Parallel Analysis of 124 Universal SNPs for Human Identification by Targeted Semiconductor Sequencing

Sci Rep. 2015 Dec 22:5:18683. doi: 10.1038/srep18683.

Abstract

SNPs, abundant in human genome with lower mutation rate, are attractive to genetic application like forensic, anthropological and evolutionary studies. Universal SNPs showing little allelic frequency variation among populations while remaining highly informative for human identification were obtained from previous studies. However, genotyping tools target only dozens of markers simultaneously, limiting their applications. Here, 124 SNPs were simultaneous tested using Ampliseq technology with Ion Torrent PGM platform. Concordance study was performed with 2 reference samples of 9947A and 9948 between NGS and Sanger sequencing. Full concordance were obtained except genotype of rs576261 with 9947A. Parameter of FMAR (%) was introduced for NGS data analysis for the first time, evaluating allelic performance, sensitivity testing and mixture testing. FMAR values for accurate heterozygotes should be range from 50% to 60%, for homozygotes or Y-SNP should be above 90%. SNPs of rs7520386, rs4530059, rs214955, rs1523537, rs2342747, rs576261 and rs12997453 were recognized as poorly performing loci, either with allelic imbalance or with lower coverage. Sensitivity testing demonstrated that with DNA range from 10 ng-0.5 ng, all correct genotypes were obtained. For mixture testing, a clear linear correlation (R(2) = 0.9429) between the excepted FMAR and observed FMAR values of mixtures was observed.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Asian People / genetics
  • Base Sequence
  • DNA / genetics
  • Ethnicity / genetics
  • Forensic Anthropology / methods*
  • Gene Frequency / genetics
  • Heterozygote
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Male
  • Molecular Sequence Data
  • Polymorphism, Single Nucleotide / genetics*
  • Semiconductors*
  • Sequence Analysis, DNA / methods*
  • Software

Substances

  • DNA