Comparative chloroplast genomes of camellia species

PLoS One. 2013 Aug 23;8(8):e73053. doi: 10.1371/journal.pone.0073053. eCollection 2013.

Abstract

Background: Camellia, comprising more than 200 species, is a valuable economic commodity due to its enormously popular commercial products: tea leaves, flowers, and high-quality edible oils. It is the largest and most important genus in the family Theaceae. However, phylogenetic resolution of the species has proven to be difficult. Consequently, the interspecies relationships of the genus Camellia are still hotly debated. Phylogenomics is an attractive avenue that can be used to reconstruct the tree of life, especially at low taxonomic levels.

Methodology/principal findings: Seven complete chloroplast (cp) genomes were sequenced from six species representing different subdivisions of the genus Camellia using Illumina sequencing technology. Four junctions between the single-copy segments and the inverted repeats were confirmed and genome assemblies were validated by PCR-based product sequencing using 123 pairs of primers covering preliminary cp genome assemblies. The length of the Camellia cp genome was found to be about 157kb, which contained 123 unique genes and 23 were duplicated in the IR regions. We determined that the complete Camellia cp genome was relatively well conserved, but contained enough genetic differences to provide useful phylogenetic information. Phylogenetic relationships were analyzed using seven complete cp genomes of six Camellia species. We also identified rapidly evolving regions of the cp genome that have the potential to be used for further species identification and phylogenetic resolution.

Conclusions/significance: In this study, we wanted to determine if analyzing completely sequenced cp genomes could help settle these controversies of interspecies relationships in Camellia. The results demonstrate that cp genome data are beneficial in resolving species definition because they indicate that organelle-based "barcodes", can be established for a species and then used to unmask interspecies phylogenetic relationships. It reveals that phylogenomics based on cp genomes is an effective approach for achieving phylogenetic resolution between Camellia species.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Camellia / classification
  • Camellia / genetics*
  • Chloroplasts / genetics*
  • Computational Biology
  • Gene Order
  • Genetic Variation
  • Genome, Chloroplast*
  • Genomics
  • Molecular Sequence Data
  • Phylogeny
  • Repetitive Sequences, Nucleic Acid
  • Sequence Analysis, DNA

Associated data

  • GENBANK/KF156833
  • GENBANK/KF156834
  • GENBANK/KF156835
  • GENBANK/KF156836
  • GENBANK/KF156837
  • GENBANK/KF156838
  • GENBANK/KF156839

Grants and funding

This work was funded by the National Natural Science Foundation of China (30870169, 31161140350), the National 863 Project of China (2012AA021801) and the Chinese Academy of Sciences through a Large-Scale Scientific Facilities Research Project (2009-LSFGBOWS-01). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.