De novo assembly, functional annotation, and marker development of Asian pear (Pyrus pyrifolia) fruit transcriptome through massively parallel sequencing

Genet Mol Res. 2015 Dec 28;14(4):18344-55. doi: 10.4238/2015.December.23.22.

Abstract

This study investigated the Asian pear transcriptome using the RNA-Seq normalized fruit cDNA library to create a transcriptomic resource for unigene and marker discovery. Following the removal of lowquality reads, 127,085,054 trimmed reads were assembled de novo to yield 37,649 non-redundant unigenes with an average length of 599 bp. Alternative splicing events were detected in 4121 contigs. A total of 30,560 single nucleotide polymorphisms (SNPs) and 7443 simple sequence repeat (SSR) makers were obtained. Approximately 21,449 (56.9%) unigenes were categorized into three gene ontology groups; 3682 (9.8%) were classified into 25 cluster of orthologous groups; and 10,451 (27.8%) were assigned to six Kyoto Encyclopedia of Genes and Genomes pathways. Differentially expressed genes were investigated using the reads per kilobase of the exon model per million reads methodology. A total of 546 unigenes showed significant differences in expression levels at different fruit developmental stages. Gene ontology categories associated with various aspects, including carbohydrate metabolic processes, transmembrane transport, and signal transduction, were enriched with genes with divergent expressions. These Pyrus pyrifolia transcriptome data provide a rich resource for the discovery and identification of new genes. Furthermore, the numerous putative SSRs and SNPs detected in this study will be important resources for the future development of a linkage map or of marker-assisted breeding programs for the Asian pear.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Computational Biology / methods
  • Databases, Genetic
  • Fruit / genetics
  • Gene Expression Profiling*
  • Gene Expression Regulation, Plant
  • Gene Ontology
  • Genetic Markers*
  • High-Throughput Nucleotide Sequencing
  • Microsatellite Repeats
  • Molecular Sequence Annotation
  • Polymorphism, Single Nucleotide
  • Pyrus / genetics*
  • Transcriptome*

Substances

  • Genetic Markers