DOMINO: development of informative molecular markers for phylogenetic and genome-wide population genetic studies in non-model organisms

Bioinformatics. 2016 Dec 15;32(24):3753-3759. doi: 10.1093/bioinformatics/btw534. Epub 2016 Aug 16.

Abstract

Motivation: The development of molecular markers is one of the most important challenges in phylogenetic and genome wide population genetics studies, especially in studies with non-model organisms. A highly promising approach for obtaining suitable markers is the utilization of genomic partitioning strategies for the simultaneous discovery and genotyping of a large number of markers. Unfortunately, not all markers obtained from these strategies provide enough information for solving multiple evolutionary questions at a reasonable taxonomic resolution.

Results: We have developed Development Of Molecular markers In Non-model Organisms (DOMINO), a bioinformatics tool for informative marker development from both next generation sequencing (NGS) data and pre-computed sequence alignments. The application implements popular NGS tools with new utilities in a highly versatile pipeline specifically designed to discover or select personalized markers at different levels of taxonomic resolution. These markers can be directly used to study the taxa surveyed for their design, utilized for further downstream PCR amplification in a broader set taxonomic scope, or exploited as suitable templates to bait design for target DNA enrichment techniques. We conducted an exhaustive evaluation of the performance of DOMINO via computer simulations and illustrate its utility to find informative markers in an empirical dataset.

Availability and implementation: DOMINO is freely available from www.ub.edu/softevol/domino CONTACT: [email protected] or [email protected] information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Computational Biology / methods*
  • Genetic Markers*
  • Genetics, Population / methods*
  • Genomics / methods
  • Genotype
  • High-Throughput Nucleotide Sequencing
  • Phylogeny*
  • User-Computer Interface

Substances

  • Genetic Markers