US Population Data for 94 Identity-Informative SNP Loci

Genes (Basel). 2023 May 12;14(5):1071. doi: 10.3390/genes14051071.

Abstract

The US National Institute of Standards and Technology (NIST) analyzed a set of 1036 samples representing four major US population groups (African American, Asian American, Caucasian, and Hispanic) with 94 single nucleotide polymorphisms (SNPs) used for individual identification (iiSNPs). The compact size of iiSNP amplicons compared to short tandem repeat (STR) markers increases the likelihood of successful amplification with degraded DNA samples. Allele frequencies and relevant forensic statistics were calculated for each population group as well as the aggregate population sample. Examination of sequence data in the regions flanking the targeted SNPs identified additional variants, which can be combined with the target SNPs to form microhaplotypes (multiple phased SNPs within a short-read sequence). Comparison of iiSNP performance with and without flanking SNP variation identified four amplicons containing microhaplotypes with observed heterozygosity increases of greater than 15% over the targeted SNP alone. For this set of 1036 samples, comparison of average match probabilities from iiSNPs with the 20 CODIS core STR markers yielded an estimate of 1.7 × 10-38 for iiSNPs (assuming independence between all 94 SNPs), which was four orders of magnitude lower (more discriminating) than STRs where internal sequence variation was considered, and 10 orders of magnitude lower than STRs using established capillary electrophoresis length-based genotypes.

Keywords: human identification; microhaplotype; next generation sequencing; single nucleotide polymorphism.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Gene Frequency / genetics
  • Genotype
  • Heterozygote
  • High-Throughput Nucleotide Sequencing*
  • Polymorphism, Single Nucleotide* / genetics

Grants and funding

NIST received funding to support this work through an interagency agreement with the Federal Bureau of Investigation: NIST IAA # DJF-19-1200-R000221.