Global transcriptome analysis and identification of a CONSTANS-like gene family in the orchid Erycina pusilla

Planta. 2013 Jun;237(6):1425-41. doi: 10.1007/s00425-013-1850-z. Epub 2013 Feb 16.

Abstract

The high chromosome numbers, polyploid genomes, and long juvenile phases of most ornamental orchid species render functional genomics difficult and limit the discovery of genes influencing horticultural traits. The orchid Erycina pusilla has a low chromosome number (2n = 12) and flowers in vitro within 1 year, making it a standout candidate for use as a model orchid. However, transcriptomic and genomic information from E. pusilla remains limited. In this study, next-generation sequencing (NGS) technology was used to identify 90,668 unigenes by de novo assembly. These unigenes were annotated functionally and analyzed with regard to their gene ontology (GO), clusters of orthologous groups (COG), and KEGG pathways. To validate the discovery methods, a homolog of CONSTANS (CO), one of the key genes in the flowering pathway, was further analyzed. The Arabidopsis CO-Like (COL) amino acid sequences were used to screen for homologs in the E. pusilla transcriptome database. Specific primers to the homologous unigenes were then used to isolate BAC clones, which were sequenced to identify 12 E. pusilla CO-like (EpCOL) full-length genes. Based on sequence homology, domain structure, and phylogenetic analysis, these EpCOL genes were divided into four groups. Four EpCOLs fused with GFP were localized in the nucleus. Some EpCOL genes were regulated by light. These results demonstrate that nascent E. pusilla resources (transcriptome and BAC library) can be used to investigate the E. pusilla photoperiod-dependent flowering genes. In future, this strategy can be applied to other biological processes, marketable traits, and molecular breeding in this model orchid.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis Proteins / genetics*
  • Circadian Rhythm / genetics
  • DNA-Binding Proteins / genetics*
  • Gene Expression Profiling*
  • Gene Expression Regulation, Plant*
  • Gene Ontology
  • Genes, Plant / genetics*
  • Green Fluorescent Proteins / metabolism
  • Molecular Sequence Annotation
  • Multigene Family*
  • Nucleotide Motifs / genetics
  • Orchidaceae / genetics*
  • Phylogeny
  • Plant Proteins / genetics
  • Plant Proteins / metabolism
  • Protein Transport
  • Sequence Analysis, DNA
  • Subcellular Fractions / metabolism
  • Transcription Factors / genetics*
  • Transcriptome / genetics

Substances

  • Arabidopsis Proteins
  • CONSTANS protein, Arabidopsis
  • DNA-Binding Proteins
  • Plant Proteins
  • Transcription Factors
  • Green Fluorescent Proteins