A prioritization analysis of disease association by data-mining of functional annotation of human genes

Genomics. 2012 Jan;99(1):1-9. doi: 10.1016/j.ygeno.2011.10.002. Epub 2011 Oct 14.

Abstract

Complex diseases result from contributions of multiple genes that act in concert through pathways. Here we present a method to prioritize novel candidates of disease-susceptibility genes depending on the biological similarities to the known disease-related genes. The extent of disease-susceptibility of a gene is prioritized by analyzing seven features of human genes captured in H-InvDB. Taking rheumatoid arthritis (RA) and prostate cancer (PC) as two examples, we evaluated the efficiency of our method. Highly scored genes obtained included TNFSF12 and OSM as candidate disease genes for RA and PC, respectively. Subsequent characterization of these genes based upon an extensive literature survey reinforced the validity of these highly scored genes as possible disease-susceptibility genes. Our approach, Prioritization ANalysis of Disease Association (PANDA), is an efficient and cost-effective method to narrow down a large set of genes into smaller subsets that are most likely to be involved in the disease pathogenesis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arthritis, Rheumatoid / genetics
  • Cost-Benefit Analysis
  • Cytokine TWEAK
  • Data Mining
  • Genetic Association Studies / economics
  • Genetic Association Studies / methods*
  • Genetic Predisposition to Disease / genetics*
  • Genomics / methods*
  • Humans
  • Male
  • Oncostatin M / genetics
  • Prostatic Neoplasms / genetics
  • Tumor Necrosis Factors / genetics

Substances

  • Cytokine TWEAK
  • OSM protein, human
  • TNFSF12 protein, human
  • Tumor Necrosis Factors
  • Oncostatin M