Detection of DNA copy number alterations using penalized least squares regression

Bioinformatics. 2005 Oct 15;21(20):3811-7. doi: 10.1093/bioinformatics/bti646. Epub 2005 Aug 30.

Abstract

Motivation: Genomic DNA copy number alterations are characteristic of many human diseases including cancer. Various techniques and platforms have been proposed to allow researchers to partition the whole genome into segments where copy numbers change between contiguous segments, and subsequently to quantify DNA copy number alterations. In this paper, we incorporate the spatial dependence of DNA copy number data into a regression model and formalize the detection of DNA copy number alterations as a penalized least squares regression problem. In addition, we use a stationary bootstrap approach to estimate the statistical significance and false discovery rate.

Results: The proposed method is studied by simulations and illustrated by an application to an extensively analyzed dataset in the literature. The results show that the proposed method can correctly detect the numbers and locations of the true breakpoints while appropriately controlling the false positives.

Availability: http://bioinformatics.med.yale.edu/DNACopyNumber

Contact: [email protected]

Supplementary information: http://bioinformatics.med.yale.edu/DNACopyNumber.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms*
  • Base Sequence
  • Chromosome Mapping / methods*
  • Gene Dosage / genetics*
  • Genetic Variation / genetics
  • Least-Squares Analysis
  • Models, Genetic*
  • Models, Statistical
  • Molecular Sequence Data
  • Regression Analysis
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*