DFI: gene feature discovery in RNA-seq experiments from multiple sources

BMC Genomics. 2012;13 Suppl 8(Suppl 8):S11. doi: 10.1186/1471-2164-13-S8-S11. Epub 2012 Dec 17.

Abstract

Background: Differential expression detection for RNA-seq experiments is often biased by normalization algorithms due to their sensitivity to parametric assumptions on the gene count distributions, extreme values of gene expression, gene length and total number of sequence reads.

Results: To overcome limitations of current methodologies, we developed Differential Feature Index (DFI), a non-parametric method for characterizing distinctive gene features across any number of diverse RNA-seq experiments without inter-sample normalization. Validated with qRT-PCR datasets, DFI accurately detected differentially expressed genes regardless of expression levels and consistent with tissue selective expression. Accuracy of DFI was very similar to the currently accepted methods: EdgeR, DESeq and Cuffdiff.

Conclusions: In this study, we demonstrated that DFI can efficiently handle multiple groups of data simultaneously, and identify differential gene features for RNA-Seq experiments from different laboratories, tissue types, and cell origins, and is robust to extreme values of gene expression, size of the datasets and gene length.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Brain / metabolism
  • Cluster Analysis
  • Databases, Factual
  • Humans
  • Leukemia / genetics
  • Leukemia / metabolism
  • Liver / metabolism
  • Liver Neoplasms / genetics
  • Liver Neoplasms / metabolism
  • RNA / genetics
  • RNA / metabolism*
  • Sequence Alignment
  • Sequence Analysis, RNA*

Substances

  • RNA