Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution

Nat Biotechnol. 2015 May;33(5):524-30. doi: 10.1038/nbt.3208. Epub 2015 Apr 20.

Abstract

Gossypium hirsutum has proven difficult to sequence owing to its complex allotetraploid (AtDt) genome. Here we produce a draft genome using 181-fold paired-end sequences assisted by fivefold BAC-to-BAC sequences and a high-resolution genetic map. In our assembly 88.5% of the 2,173-Mb scaffolds, which cover 89.6%∼96.7% of the AtDt genome, are anchored and oriented to 26 pseudochromosomes. Comparison of this G. hirsutum AtDt genome with the already sequenced diploid Gossypium arboreum (AA) and Gossypium raimondii (DD) genomes revealed conserved gene order. Repeated sequences account for 67.2% of the AtDt genome, and transposable elements (TEs) originating from Dt seem more active than from At. Reduction in the AtDt genome size occurred after allopolyploidization. The A or At genome may have undergone positive selection for fiber traits. Concerted evolution of different regulatory mechanisms for Cellulose synthase (CesA) and 1-Aminocyclopropane-1-carboxylic acid oxidase1 and 3 (ACO1,3) may be important for enhanced fiber production in G. hirsutum.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Oxidoreductases / genetics
  • Base Sequence
  • Chromosome Mapping
  • Cotton Fiber
  • DNA Transposable Elements / genetics
  • Evolution, Molecular*
  • Genome, Plant*
  • Glucosyltransferases / genetics
  • Gossypium / genetics*
  • Phylogeny
  • Polyploidy
  • Sequence Analysis, DNA*

Substances

  • DNA Transposable Elements
  • Amino Acid Oxidoreductases
  • 1-aminocyclopropane-1-carboxylic acid oxidase
  • Glucosyltransferases
  • cellulose synthase

Associated data

  • BioProject/PRJNA259930
  • SRA/SRA180756