Exaggerated false positives by popular differential expression methods when analyzing human population samples

Genome Biol. 2022 Mar 15;23(1):79. doi: 10.1186/s13059-022-02648-4.

Abstract

When identifying differentially expressed genes between two conditions using human population RNA-seq samples, we found a phenomenon by permutation analysis: two popular bioinformatics methods, DESeq2 and edgeR, have unexpectedly high false discovery rates. Expanding the analysis to limma-voom, NOISeq, dearseq, and Wilcoxon rank-sum test, we found that FDR control is often failed except for the Wilcoxon rank-sum test. Particularly, the actual FDRs of DESeq2 and edgeR sometimes exceed 20% when the target FDR is 5%. Based on these results, for population-level RNA-seq studies with large sample sizes, we recommend the Wilcoxon rank-sum test.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology* / methods
  • Gene Expression Profiling* / methods
  • Humans
  • RNA-Seq
  • Sample Size
  • Sequence Analysis, RNA / methods