Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement

Nat Biotechnol. 2015 May;33(5):531-7. doi: 10.1038/nbt.3207. Epub 2015 Apr 20.

Abstract

Upland cotton is a model for polyploid crop domestication and transgenic improvement. Here we sequenced the allotetraploid Gossypium hirsutum L. acc. TM-1 genome by integrating whole-genome shotgun reads, bacterial artificial chromosome (BAC)-end sequences and genotype-by-sequencing genetic maps. We assembled and annotated 32,032 A-subgenome genes and 34,402 D-subgenome genes. Structural rearrangements, gene loss, disrupted genes and sequence divergence were more common in the A subgenome than in the D subgenome, suggesting asymmetric evolution. However, no genome-wide expression dominance was found between the subgenomes. Genomic signatures of selection and domestication are associated with positively selected genes (PSGs) for fiber improvement in the A subgenome and for stress tolerance in the D subgenome. This draft genome sequence provides a resource for engineering superior cotton lines.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Chromosome Mapping
  • Cotton Fiber*
  • Genome, Plant*
  • Gossypium / genetics*
  • High-Throughput Nucleotide Sequencing
  • Plant Proteins / biosynthesis
  • Plant Proteins / genetics*
  • Sequence Analysis, DNA
  • Tetraploidy

Substances

  • Plant Proteins

Associated data

  • SRA/PRJNA248163