gpart: human genome partitioning and visualization of high-density SNP data by identifying haplotype blocks

Bioinformatics. 2019 Nov 1;35(21):4419-4421. doi: 10.1093/bioinformatics/btz308.

Abstract

Summary: For the analysis of high-throughput genomic data produced by next-generation sequencing (NGS) technologies, researchers need to identify linkage disequilibrium (LD) structure in the genome. In this work, we developed an R package gpart which provides clustering algorithms to define LD blocks or analysis units consisting of SNPs. The visualization tool in gpart can display the LD structure and gene positions for up to 20 000 SNPs in one image. The gpart functions facilitate construction of LD blocks and SNP partitions for vast amounts of genome sequencing data within reasonable time and memory limits in personal computing environments.

Availability and implementation: The R package is available at https://bioconductor.org/packages/gpart.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genome, Human*
  • Haplotypes
  • Humans
  • Linkage Disequilibrium
  • Polymorphism, Single Nucleotide*
  • Software