A practical implementation of large transcriptomic data analysis to resolve cryptic species diversity problems in microbial eukaryotes

BMC Evol Biol. 2018 Nov 16;18(1):170. doi: 10.1186/s12862-018-1283-1.

Abstract

Background: Transcriptome sequencing has become a method of choice for evolutionary studies in microbial eukaryotes due to low cost and minimal sample requirements. Transcriptome data has been extensively used in phylogenomic studies to infer ancient evolutionary histories. However, its utility in studying cryptic species diversity is not well explored. An empirical investigation was conducted to test the applicability of transcriptome data in resolving two major types of discordances at lower taxonomic levels. These include cases where species have the same morphology but different genetics (cryptic species) and species of different morphologies but have the same genetics. We built a species comparison bioinformatic pipeline that takes into account the nature of transcriptome data in amoeboid microbes exemplifying such discordances.

Result: Our analyses of known or suspected cryptic species yielded consistent results regardless of the methods of culturing, RNA collection or sequencing. Over 95% of the single copy genes analyzed in samples of the same species sequenced using different methods and cryptic species had intra- and interspecific divergences below 2%. Only a minority of groups (2.91-4.87%) had high distances exceeding 2% in these taxa, which was likely caused by low data quality. This pattern was also observed in suspected genetically similar species with different morphologies. Transcriptome data consistently delineated all taxa above species level, including cryptically diverse species. Using our approach we were able to resolve cryptic species problems, uncover misidentification and discover new species. We also identified several potential barcode markers with varying evolutionary rates that can be used in lineages with different evolutionary histories.

Conclusion: Our findings demonstrate that transcriptome data is appropriate for understanding cryptic species diversity in microbial eukaryotes.

Keywords: Amoebozoa; Bioinformatics; Cryptic species; DNA barcode; Phylogenomics; RNA-seq.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Biological Evolution
  • DNA Barcoding, Taxonomic
  • Data Analysis*
  • Eukaryota / genetics*
  • Gene Expression Profiling*
  • Genetic Markers
  • Genetic Variation*
  • Genome
  • Phylogeny
  • Species Specificity
  • Transcriptome / genetics

Substances

  • Genetic Markers