Efficient Detection of Novel Nuclear Markers for Brassicaceae by Transcriptome Sequencing

PLoS One. 2015 Jun 10;10(6):e0128181. doi: 10.1371/journal.pone.0128181. eCollection 2015.

Abstract

The lack of DNA sequence information for most non-model organisms impairs the design of primers that are universally applicable for the study of molecular polymorphisms in nuclear markers. Next-generation sequencing (NGS) techniques nowadays provide a powerful approach to overcome this limitation. We present a flexible and inexpensive method to identify large numbers of nuclear primer pairs that amplify in most Brassicaceae species. We first obtained and mapped NGS transcriptome sequencing reads from two of the distantly related Brassicaceae species, Cardamine hirsuta and Arabis alpina, onto the Arabidopsis thaliana reference genome, and then identified short conserved sequence motifs among the three species bioinformatically. From these, primer pairs to amplify coding regions (nuclear protein coding loci, NPCL) and exon-primed intron-crossing sequences (EPIC) were developed. We identified 2,334 universally applicable primer pairs, targeting 1,164 genes, which provide a large pool of markers as readily usable genomic resource that will help addressing novel questions in the Brassicaceae family. Testing a subset of the newly designed nuclear primer pairs revealed that a great majority yielded a single amplicon in all of the 30 investigated Brassicaceae taxa. Sequence analysis and phylogenetic reconstruction with a subset of these markers on different levels of phylogenetic divergence in the mustard family were compared with previous studies. The results corroborate the usefulness of the newly developed primer pairs, e.g., for phylogenetic analyses or population genetic studies. Thus, our method provides a cost-effective approach for designing nuclear loci across a broad range of taxa and is compatible with current NGS technologies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Brassicaceae / classification
  • Brassicaceae / genetics*
  • Cell Nucleus / genetics
  • Computational Biology / methods
  • Gene Expression Profiling / economics
  • Gene Expression Profiling / methods*
  • Genetic Loci
  • Genetic Markers / genetics*
  • Phylogeny
  • Plant Proteins / genetics
  • Sequence Analysis, RNA / economics
  • Sequence Analysis, RNA / methods*

Substances

  • Genetic Markers
  • Plant Proteins

Grants and funding

The study was funded by the Swiss National Science Foundation (Sinergia project CRSI33_127155), ProDoc PDFMP3_137084 and University Research Priority Program in Systems Biology/Functional Genomics. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.