Applying family analyses to electronic health records to facilitate genetic research

Bioinformatics. 2018 Feb 15;34(4):635-642. doi: 10.1093/bioinformatics/btx569.

Abstract

Motivation: Pedigree analysis is a longstanding and powerful approach to gain insight into the underlying genetic factors in human health, but identifying, recruiting and genotyping families can be difficult, time consuming and costly. Development of high throughput methods to identify families and foster downstream analyses are necessary.

Results: This paper describes simple methods that allowed us to identify 173 368 family pedigrees with high probability using basic demographic data available in most electronic health records (EHRs). We further developed and validate a novel statistical method that uses EHR data to identify families more likely to have a major genetic component to their diseases risk. Lastly, we showed that incorporating EHR-linked family data into genetic association testing may provide added power for genetic mapping without additional recruitment or genotyping. The totality of these results suggests that EHR-linked families can enable classical genetic analyses in a high-throughput manner.

Availability and implementation: Pseudocode is provided as supplementary information.

Contact: [email protected].

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromosome Mapping
  • Databases, Factual
  • Electronic Health Records*
  • Female
  • Genetic Association Studies
  • Genetic Diseases, Inborn
  • Genetic Research*
  • Genome, Human*
  • Humans
  • Male
  • Middle Aged
  • Pedigree*