HEALER: homomorphic computation of ExAct Logistic rEgRession for secure rare disease variants analysis in GWAS

Bioinformatics. 2016 Jan 15;32(2):211-8. doi: 10.1093/bioinformatics/btv563. Epub 2015 Oct 6.

Abstract

Motivation: Genome-wide association studies (GWAS) have been widely used in discovering the association between genotypes and phenotypes. Human genome data contain valuable but highly sensitive information. Unprotected disclosure of such information might put individual's privacy at risk. It is important to protect human genome data. Exact logistic regression is a bias-reduction method based on a penalized likelihood to discover rare variants that are associated with disease susceptibility. We propose the HEALER framework to facilitate secure rare variants analysis with a small sample size.

Results: We target at the algorithm design aiming at reducing the computational and storage costs to learn a homomorphic exact logistic regression model (i.e. evaluate P-values of coefficients), where the circuit depth is proportional to the logarithmic scale of data size. We evaluate the algorithm performance using rare Kawasaki Disease datasets.

Availability and implementation: Download HEALER at http://research.ucsd-dbmi.org/HEALER/ CONTACT: [email protected]

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Genetic Privacy*
  • Genetic Variation*
  • Genome, Human
  • Genome-Wide Association Study*
  • Humans
  • Mucocutaneous Lymph Node Syndrome / genetics
  • Rare Diseases / genetics*