Open source software EuroForMix can be used to analyse complex SNP mixtures

Forensic Sci Int Genet. 2017 Nov:31:105-110. doi: 10.1016/j.fsigen.2017.08.001. Epub 2017 Aug 18.

Abstract

A series of two- and three-person mixtures of varying dilutions were prepared and analysed with Life Technologies' HID-Ion AmpliSeq™ Identity Panel v2.2 using the Ion PGM™ massively parallel sequencing (MPS) system. From this panel we used 134 autosomal SNPs. Using the reference samples of three donors, we evaluated the strength of evidence with likelihood ratio (LR) calculations using the open-source quantitative EuroForMix program and compared the results with a previous study using a qualitative software (LRmix). SNP analysis is a special case of STRs, restricted to a maximum of two alleles per locus. We showed that simple two-person mixtures can be readily analysed with both LRmix and Euroformix, but the performance of three- or more person mixtures is generally inefficient with LRmix. Taking account of the "peak height" information, by substituting 'sequence read' coverage values from the MPS data for each SNP allele, greatly improves the discrimination between true and non-contributors. The higher the mixture proportion (Mx) of the person of interest is, the higher the LR. Simulation experiments (up to six contributors) showed that the strength of the evidence is dependent upon Mx, but relatively insensitive to the number of contributors. If a higher number of loci were multiplexed, the analysis of mixtures would be much improved, because the extra information would enable lower Mx values to be evaluated. In summary, incorporating the 'sequence read' (coverage) into the quantitative model shows a significant benefit over the qualitative approach. Calculations are quite fast (six seconds for three contributors).

Keywords: Complex mixture; EuroForMix; Likelihood ratio; SNP; Sequence read.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA Fingerprinting*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Likelihood Functions*
  • Polymorphism, Single Nucleotide*
  • Sequence Analysis, DNA
  • Software*