De novo transcriptome assembly in chili pepper (Capsicum frutescens) to identify genes involved in the biosynthesis of capsaicinoids

PLoS One. 2013;8(1):e48156. doi: 10.1371/journal.pone.0048156. Epub 2013 Jan 22.

Abstract

The capsaicinoids are a group of compounds produced by chili pepper fruits and are used widely in many fields, especially in medical purposes. The capsaicinoid biosynthetic pathway has not yet been established clearly. To understand more knowledge in biosynthesis of capsaicinoids, we applied RNA-seq for the mixture of placenta and pericarp of pungent pepper (Capsicum frutescens L.). We have assessed the effect of various assembly parameters using different assembly software, and obtained one of the best strategies for de novo assembly of transcriptome data. We obtained a total 54,045 high-quality unigenes (transcripts) using Trinity software. About 92.65% of unigenes showed similarity to the public protein sequences, genome of potato and tomato and pepper (C. annuum) ESTs databases. Our results predicted 3 new structural genes (DHAD, TD, PAT), which filled gaps of the capsaicinoid biosynthetic pathway predicted by Mazourek, and revealed new candidate genes involved in capsaicinoid biosynthesis based on KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis. A significant number of SSR (Simple Sequence Repeat) and SNP (Single Nucleotide Polymorphism) markers were predicted in C. frutescens and C. annuum sequences, which will be helpful in the identification of polymorphisms within chili pepper populations. These data will provide new insights to the pathway of capsaicinoid biosynthesis and subsequent research of chili peppers. In addition, our strategy of de novo transcriptome assembly is applicable to a wide range of similar studies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Capsaicin / metabolism*
  • Capsicum / enzymology
  • Capsicum / genetics*
  • Capsicum / metabolism*
  • Gene Expression Profiling*
  • Genes, Plant / genetics*
  • Genetic Markers / genetics
  • Hydro-Lyases / genetics
  • Hydro-Lyases / metabolism
  • Molecular Sequence Annotation
  • Sequence Analysis, RNA
  • Transaminases / genetics
  • Transaminases / metabolism

Substances

  • Genetic Markers
  • Transaminases
  • prephenate aminotransferase
  • Hydro-Lyases
  • dihydroxyacid dehydratase
  • Capsaicin

Grants and funding

This work was supported by the Natural Science Foundation of Guangdong Province (NO. 10151064201000056) and the Principal Foundation of South China Agricultural University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.