Novel integration of hospital electronic medical records and gene expression measurements to identify genetic markers of maturation

Pac Symp Biocomput. 2008:243-54.

Abstract

Traditionally, the elucidation of genes involved in maturation and aging has been studied in a temporal fashion by examining gene expression at different time points in an organism's life as well as by knocking out, knocking in, and mutating genes thought to be involved. Here, we propose an in silico method to combine clinical electronic medical record (EMR) data and gene expression measurements in the context of disease to identify genes that may be involved in the process of human maturation and aging. First we show that absolute lymphocyte count may serve as a biomarker for maturation by using statistical methods to compare trends among different clinical laboratory tests in response to an increase in age. We then propose using the rate of decay for absolute lymphocyte count across 12 diseases as a proxy for differences in aging. We correlate the differing rates with gene expression across the same diseases to find maturation/aging related genes. Among the 53 genes with strongest correlations between expression profile and change in rate of decay, we found genes previously implicated in the process of aging, including MGMT (DNA repair), TERF2 (telomere stability), POLD1 (DNA replication and repair), and POLG (mtDNA replication).

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Aging / blood
  • Aging / genetics*
  • Analysis of Variance
  • Child
  • Child, Preschool
  • Computational Biology
  • Gene Expression Profiling / statistics & numerical data*
  • Genetic Markers*
  • Hospital Records*
  • Humans
  • Infant
  • Infant, Newborn
  • Lymphocyte Count
  • Medical Records Systems, Computerized*

Substances

  • Genetic Markers