Genomic DNA enrichment using sequence capture microarrays: a novel approach to discover sequence nucleotide polymorphisms (SNP) in Brassica napus L

PLoS One. 2013 Dec 3;8(12):e81992. doi: 10.1371/journal.pone.0081992. eCollection 2013.

Abstract

Targeted genomic selection methodologies, or sequence capture, allow for DNA enrichment and large-scale resequencing and characterization of natural genetic variation in species with complex genomes, such as rapeseed canola (Brassica napus L., AACC, 2n=38). The main goal of this project was to combine sequence capture with next generation sequencing (NGS) to discover single nucleotide polymorphisms (SNPs) in specific areas of the B. napus genome historically associated (via quantitative trait loci -QTL- analysis) to traits of agronomical and nutritional importance. A 2.1 million feature sequence capture platform was designed to interrogate DNA sequence variation across 47 specific genomic regions, representing 51.2 Mb of the Brassica A and C genomes, in ten diverse rapeseed genotypes. All ten genotypes were sequenced using the 454 Life Sciences chemistry and to assess the effect of increased sequence depth, two genotypes were also sequenced using Illumina HiSeq chemistry. As a result, 589,367 potentially useful SNPs were identified. Analysis of sequence coverage indicated a four-fold increased representation of target regions, with 57% of the filtered SNPs falling within these regions. Sixty percent of discovered SNPs corresponded to transitions while 40% were transversions. Interestingly, fifty eight percent of the SNPs were found in genic regions while 42% were found in intergenic regions. Further, a high percentage of genic SNPs was found in exons (65% and 64% for the A and C genomes, respectively). Two different genotyping assays were used to validate the discovered SNPs. Validation rates ranged from 61.5% to 84% of tested SNPs, underpinning the effectiveness of this SNP discovery approach. Most importantly, the discovered SNPs were associated with agronomically important regions of the B. napus genome generating a novel data resource for research and breeding this crop species.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Brassica napus / genetics*
  • DNA, Plant / genetics*
  • Genome, Plant / genetics
  • Genomics / methods*
  • Genotype
  • High-Throughput Nucleotide Sequencing / methods*
  • Introns / genetics
  • Oligonucleotide Array Sequence Analysis*
  • Polymorphism, Single Nucleotide*
  • Quantitative Trait Loci / genetics
  • Reproducibility of Results
  • Sequence Analysis, DNA / methods*

Substances

  • DNA, Plant

Grants and funding

This research was funded by Comisión Nacional de Investigación Científica y Tecnológica de Chile (CONICYT), FONDECYT/Regular 1100732 and CONICYT REGIONAL/GORE LA ARAUCANIA/CGNA/R10C1001. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.