Genome assembly with in vitro proximity ligation data and whole-genome triplication in lettuce

Nat Commun. 2017 Apr 12:8:14953. doi: 10.1038/ncomms14953.

Abstract

Lettuce (Lactuca sativa) is a major crop and a member of the large, highly successful Compositae family of flowering plants. Here we present a reference assembly for the species and family. This was generated using whole-genome shotgun Illumina reads plus in vitro proximity ligation data to create large superscaffolds; it was validated genetically and superscaffolds were oriented in genetic bins ordered along nine chromosomal pseudomolecules. We identify several genomic features that may have contributed to the success of the family, including genes encoding Cycloidea-like transcription factors, kinases, enzymes involved in rubber biosynthesis and disease resistance proteins that are expanded in the genome. We characterize 21 novel microRNAs, one of which may trigger phasiRNAs from numerous kinase transcripts. We provide evidence for a whole-genome triplication event specific but basal to the Compositae. We detect 26% of the genome in triplicated regions containing 30% of all genes that are enriched for regulatory sequences and depleted for genes involved in defence.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Asteraceae / classification
  • Asteraceae / genetics
  • Chromosome Mapping
  • Chromosomes, Plant / genetics
  • Gene Expression Profiling
  • Gene Expression Regulation, Plant
  • Genes, Plant / genetics
  • Genome, Plant / genetics*
  • Genome-Wide Association Study
  • Genomics / methods*
  • Lactuca / genetics*
  • Molecular Sequence Annotation
  • Phylogeny
  • Triploidy*
  • Whole Genome Sequencing