plyranges: a grammar of genomic data transformation

Genome Biol. 2019 Jan 4;20(1):4. doi: 10.1186/s13059-018-1597-8.

Abstract

Bioconductor is a widely used R-based platform for genomics, but its host of complex genomic data structures places a cognitive burden on the user. For most tasks, the GRanges object would suffice, but there are gaps in the API that prevent its general use. By recognizing that the GRanges class follows "tidy" data principles, we create a grammar of genomic data transformation, defining verbs for performing actions on and between genomic interval data and providing a way of performing common data analysis tasks through a coherent interface to existing Bioconductor infrastructure. We implement this grammar as a Bioconductor/R package called plyranges.

Keywords: Bioconductor; Data analysis; Genomes; Grammar.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genomics / methods*
  • Software*
  • Terminology as Topic