Determination of tag density required for digital transcriptome analysis: application to an androgen-sensitive prostate cancer model

Proc Natl Acad Sci U S A. 2008 Dec 23;105(51):20179-84. doi: 10.1073/pnas.0807121105. Epub 2008 Dec 16.

Abstract

High-throughput sequencing has rapidly gained popularity for transcriptome analysis in mammalian cells because of its ability to generate digital and quantitative information on annotated genes and to detect transcripts and mRNA isoforms. Here, we described a double-random priming method for deep sequencing to profile double poly(A)-selected RNA from LNCaP cells before and after androgen stimulation. From approximately 20 million sequence tags, we uncovered 71% of annotated genes and identified hormone-regulated gene expression events that are highly correlated with quantitative real time PCR measurement. A fraction of the sequence tags were mapped to constitutive and alternative splicing events to detect known and new mRNA isoforms expressed in the cell. Finally, curve fitting was used to estimate the number of tags necessary to reach a "saturating" discovery rate among individual applications. This study provides a general guide for analysis of gene expression and alternative splicing by deep sequencing.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Androgens / pharmacology
  • Animals
  • Automation / methods*
  • Cell Line, Tumor
  • Expressed Sequence Tags*
  • Gene Expression Profiling / methods*
  • Male
  • Polymerase Chain Reaction
  • Prostatic Neoplasms / genetics*
  • RNA, Messenger / analysis

Substances

  • Androgens
  • RNA, Messenger