*-DCC: A platform to collect, annotate, and explore a large variety of sequencing experiments

Gigascience. 2020 Mar 1;9(3):giaa024. doi: 10.1093/gigascience/giaa024.

Abstract

Background: Over the past few years the variety of experimental designs and protocols for sequencing experiments increased greatly. To ensure the wide usability of the produced data beyond an individual project, rich and systematic annotation of the underlying experiments is crucial.

Findings: We first developed an annotation structure that captures the overall experimental design as well as the relevant details of the steps from the biological sample to the library preparation, the sequencing procedure, and the sequencing and processed files. Through various design features, such as controlled vocabularies and different field requirements, we ensured a high annotation quality, comparability, and ease of annotation. The structure can be easily adapted to a large variety of species. We then implemented the annotation strategy in a user-hosted web platform with data import, query, and export functionality.

Conclusions: We present here an annotation structure and user-hosted platform for sequencing experiment data, suitable for lab-internal documentation, collaborations, and large-scale annotation efforts.

Keywords: databases; sample annotation; sequencing data annotation; sequencing experiments.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Molecular Sequence Annotation / methods*
  • Molecular Sequence Annotation / standards
  • Sequence Analysis / methods*
  • Sequence Analysis / standards
  • Software*