PopNet: A Markov Clustering Approach to Study Population Genetic Structure

Mol Biol Evol. 2017 Jul 1;34(7):1799-1811. doi: 10.1093/molbev/msx110.

Abstract

With the advent of low cost, high-throughput genome sequencing technology, population genomic data sets are being generated for hundreds of species of pathogenic, industrial, and agricultural importance. The challenge is how best to analyze and visually display these complex data sets to yield intuitive representations capable of capturing complex evolutionary relationships. Here we present PopNet, a novel computational method that identifies regions of shared ancestry in the chromosomes of related strains through clustering patterns of genetic variation. These relationships are subsequently visualized within a network by a novel implementation of chromosome painting. We apply PopNet to three diverse populations that feature differential rates of recombination and demonstrate its ability to capture evolutionary relationships as well as associate traits to specific loci. Compared with existing tools, PopNet provides substantial advances by both removing the need to predefine a single reference genome that can bias interpretation of population structure, as well as its ability to visualize multiple evolutionary relationships, such as recombination events and shared ancestry, across hundreds of strains.

Keywords: chromosome painting; network visualization; population genomics; single nucleotide polymorphisms; software.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Intramural

MeSH terms

  • Algorithms
  • Base Sequence
  • Chromosome Mapping / methods
  • Cluster Analysis
  • Genetic Variation / genetics
  • Genetics, Population / methods*
  • Genome / genetics
  • Genomics / methods*
  • Linkage Disequilibrium / genetics
  • Markov Chains
  • Metagenomics / methods
  • Polymorphism, Single Nucleotide / genetics
  • Recombination, Genetic / genetics
  • Sequence Analysis, DNA / methods*

Grants and funding