Using Next Generation RAD Sequencing to Isolate Multispecies Microsatellites for Pilosocereus (Cactaceae)

PLoS One. 2015 Nov 11;10(11):e0142602. doi: 10.1371/journal.pone.0142602. eCollection 2015.

Abstract

Microsatellite markers (also known as SSRs, Simple Sequence Repeats) are widely used in plant science and are among the most informative molecular markers for population genetic investigations, but the development of such markers presents substantial challenges. In this report, we discuss how next generation sequencing can replace the cloning, Sanger sequencing, identification of polymorphic loci, and testing cross-amplification that were previously required to develop microsatellites. We report the development of a large set of microsatellite markers for five species of the Neotropical cactus genus Pilosocereus using a restriction-site-associated DNA sequencing (RAD-seq) on a Roche 454 platform. We identified an average of 165 microsatellites per individual, with the absolute numbers across individuals proportional to the sequence reads obtained per individual. Frequency distribution of the repeat units was similar in the five species, with shorter motifs such as di- and trinucleotide being the most abundant repeats. In addition, we provide 72 microsatellites that could be potentially amplified in the sampled species and 22 polymorphic microsatellites validated in two populations of the species Pilosocereus machrisii. Although low coverage sequencing among individuals was observed for most of the loci, which we suggest to be more related to the nature of the microsatellite markers and the possible bias inserted by the restriction enzymes than to the genome size, our work demonstrates that an NGS approach is an efficient method to isolate multispecies microsatellites even in non-model organisms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cactaceae / genetics*
  • Computational Biology
  • Cost-Benefit Analysis
  • DNA Primers
  • DNA Restriction Enzymes / genetics
  • DNA, Plant / genetics
  • Genetic Markers / genetics
  • Genetic Variation
  • Genotype
  • High-Throughput Nucleotide Sequencing / methods*
  • Linkage Disequilibrium
  • Microsatellite Repeats / genetics*
  • Molecular Sequence Data
  • Polymerase Chain Reaction
  • Polymorphism, Genetic
  • Sequence Analysis, DNA
  • Species Specificity

Substances

  • DNA Primers
  • DNA, Plant
  • Genetic Markers
  • DNA Restriction Enzymes

Associated data

  • GENBANK/KR478539
  • GENBANK/KR478540
  • GENBANK/KR478541
  • GENBANK/KR478542
  • GENBANK/KR478543
  • GENBANK/KR478544
  • GENBANK/KR478545
  • GENBANK/KR478546
  • GENBANK/KR478547
  • GENBANK/KR478548
  • GENBANK/KR478549
  • GENBANK/KR478550
  • GENBANK/KR478551
  • GENBANK/KR478552
  • GENBANK/KR478553
  • GENBANK/KR478554
  • GENBANK/KR478555
  • GENBANK/KR478556
  • GENBANK/KR478557
  • GENBANK/KR478558
  • GENBANK/KR478559
  • GENBANK/KR478560
  • GENBANK/KR478561
  • GENBANK/KR478562
  • GENBANK/KR478563
  • GENBANK/KR478564
  • GENBANK/KR478565
  • GENBANK/KR478566
  • GENBANK/KR478567
  • GENBANK/KR478568
  • GENBANK/KR478569
  • GENBANK/KR478570
  • GENBANK/KR478571
  • GENBANK/KR478572
  • GENBANK/KR478573
  • GENBANK/KR478574
  • GENBANK/KR478575
  • GENBANK/KR478576
  • GENBANK/KR478577
  • GENBANK/KR478578
  • GENBANK/KR478579
  • GENBANK/KR478580
  • GENBANK/KR478581
  • GENBANK/KR478582
  • GENBANK/KR478583
  • GENBANK/KR478584
  • GENBANK/KR478585
  • GENBANK/KR478586
  • GENBANK/KR478587
  • GENBANK/KR478588
  • GENBANK/KR478589
  • GENBANK/KR478590
  • GENBANK/KR478591
  • GENBANK/KR478592
  • GENBANK/KR478593
  • GENBANK/KR478594
  • GENBANK/KR478595
  • GENBANK/KR478596
  • GENBANK/KR478597
  • GENBANK/KR478598
  • GENBANK/KR478599
  • GENBANK/KR478600
  • GENBANK/KR478601
  • GENBANK/KR478602
  • GENBANK/KR478603
  • GENBANK/KR478604
  • GENBANK/KR478605
  • GENBANK/KR478606
  • GENBANK/KR478607
  • GENBANK/KR478608
  • GENBANK/KR478609
  • GENBANK/KR478610
  • GENBANK/KR478611
  • GENBANK/KR478612
  • GENBANK/KR478613
  • GENBANK/KR478614
  • GENBANK/KR478615
  • GENBANK/KR478616
  • GENBANK/KR478617
  • GENBANK/KR478618
  • GENBANK/KR478619
  • GENBANK/KR478620
  • GENBANK/KR478621
  • GENBANK/KR478622
  • GENBANK/KR478623
  • GENBANK/KR478624
  • GENBANK/KR478625
  • GENBANK/KR478626
  • GENBANK/KR478627
  • GENBANK/KR478628
  • GENBANK/KR478629
  • GENBANK/KR478630
  • GENBANK/KR478631
  • GENBANK/KR478632

Grants and funding

This work was supported through grants from the São Paulo Research Foundation (FAPESP) (2012/22857-8 to IASB and 2012/22477-0 to EMM) and CAPES (BEX: 5639/13-1 to IASB). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.