De novo assembly of soybean wild relatives for pan-genome analysis of diversity and agronomic traits

Nat Biotechnol. 2014 Oct;32(10):1045-52. doi: 10.1038/nbt.2979. Epub 2014 Sep 14.

Abstract

Wild relatives of crops are an important source of genetic diversity for agriculture, but their gene repertoire remains largely unexplored. We report the establishment and analysis of a pan-genome of Glycine soja, the wild relative of cultivated soybean Glycine max, by sequencing and de novo assembly of seven phylogenetically and geographically representative accessions. Intergenomic comparisons identified lineage-specific genes and genes with copy number variation or large-effect mutations, some of which show evidence of positive selection and may contribute to variation of agronomic traits such as biotic resistance, seed composition, flowering and maturity time, organ size and final biomass. Approximately 80% of the pan-genome was present in all seven accessions (core), whereas the rest was dispensable and exhibited greater variation than the core genome, perhaps reflecting a role in adaptation to diverse environments. This work will facilitate the harnessing of untapped genetic diversity from wild soybean for enhancement of elite cultivars.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Agriculture
  • Amino Acid Sequence
  • Biomass
  • DNA, Plant / analysis
  • DNA, Plant / genetics
  • Disease Resistance / genetics
  • Genome, Plant / genetics*
  • Genomics / methods*
  • Glycine max / classification
  • Glycine max / genetics*
  • Glycine max / physiology*
  • Molecular Sequence Data
  • Phylogeny
  • Polymorphism, Single Nucleotide / genetics*
  • Seeds / genetics
  • Sequence Alignment
  • Sequence Analysis, DNA

Substances

  • DNA, Plant