PHYMYCO-DB: a curated database for analyses of fungal diversity and evolution

Stéphane Mahé; Marie Duhamel; Thomas Le Calvez; Laetitia Guillot; Ludmila Sarbu; Anthony Bretaudeau; Olivier Collin; Alexis Dufresne; E Toby Kiers; Philippe Vandenkoornhuyse

doi:10.1371/journal.pone.0043117

PHYMYCO-DB: a curated database for analyses of fungal diversity and evolution

PLoS One. 2012;7(9):e43117. doi: 10.1371/journal.pone.0043117. Epub 2012 Sep 13.

Authors

Stéphane Mahé¹, Marie Duhamel, Thomas Le Calvez, Laetitia Guillot, Ludmila Sarbu, Anthony Bretaudeau, Olivier Collin, Alexis Dufresne, E Toby Kiers, Philippe Vandenkoornhuyse

Affiliation

¹ Université de Rennes I, CNRS, UMR 6553 ECOBIO, Campus de Beaulieu, Rennes, France.

Abstract

Background: In environmental sequencing studies, fungi can be identified based on nucleic acid sequences, using either highly variable sequences as species barcodes or conserved sequences containing a high-quality phylogenetic signal. For the latter, identification relies on phylogenetic analyses and the adoption of the phylogenetic species concept. Such analysis requires that the reference sequences are well identified and deposited in public-access databases. However, many entries in the public sequence databases are problematic in terms of quality and reliability and these data require screening to ensure correct phylogenetic interpretation.

Methods and principal findings: To facilitate phylogenetic inferences and phylogenetic assignment, we introduce a fungal sequence database. The database PHYMYCO-DB comprises fungal sequences from GenBank that have been filtered to satisfy stringent sequence quality criteria. For the first release, two widely used molecular taxonomic markers were chosen: the nuclear SSU rRNA and EF1-α gene sequences. Following the automatic extraction and filtration, a manual curation is performed to remove problematic sequences while preserving relevant sequences useful for phylogenetic studies. As a result of curation, ~20% of the automatically filtered sequences have been removed from the database. To demonstrate how PHYMYCO-DB can be employed, we test a set of environmental Chytridiomycota sequences obtained from deep sea samples.

Conclusion: PHYMYCO-DB offers the tools necessary to: (i) extract high quality fungal sequences for each of the 5 fungal phyla, at all taxonomic levels, (ii) extract already performed alignments, to act as 'reference alignments', (iii) launch alignments of personal sequences along with stored data. A total of 9120 SSU rRNA and 672 EF1-α high-quality fungal sequences are now available. The PHYMYCO-DB is accessible through the URL http://phymycodb.genouest.org/.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Base Sequence
Databases, Nucleic Acid*
Evolution, Molecular*
Fungi / classification
Fungi / genetics*
Genetic Variation*
Internet
Molecular Sequence Data
Phylogeny
RNA, Ribosomal / genetics
Sequence Alignment

Substances

RNA, Ribosomal

Grants and funding

This work was supported by a grant from the “Total Corporate Foundation for biodiversity and the sea” and a grant from the French National Agency for Research within the Systerra call (ANR-10-STRA-002). MD has a Ph.D. grant from the French ministry of research. ETK is supported by an Nederlandse Organisatie voor Wetenschappelijk Onderzoek (NWO) “vidi” and “meervoud” grant. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.