Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data

Weil R Lai; Mark D Johnson; Raju Kucherlapati; Peter J Park

doi:10.1093/bioinformatics/bti611

Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data

Bioinformatics. 2005 Oct 1;21(19):3763-70. doi: 10.1093/bioinformatics/bti611. Epub 2005 Aug 4.

Authors

Weil R Lai¹, Mark D Johnson, Raju Kucherlapati, Peter J Park

Affiliation

¹ Harvard-Partners Center for Genetics and Genomics 77 Avenue Louis Pasteur, Boston, MA 02115, USA.

Abstract

Motivation: Array Comparative Genomic Hybridization (CGH) can reveal chromosomal aberrations in the genomic DNA. These amplifications and deletions at the DNA level are important in the pathogenesis of cancer and other diseases. While a large number of approaches have been proposed for analyzing the large array CGH datasets, the relative merits of these methods in practice are not clear.

Results: We compare 11 different algorithms for analyzing array CGH data. These include both segment detection methods and smoothing methods, based on diverse techniques such as mixture models, Hidden Markov Models, maximum likelihood, regression, wavelets and genetic algorithms. We compute the Receiver Operating Characteristic (ROC) curves using simulated data to quantify sensitivity and specificity for various levels of signal-to-noise ratio and different sizes of abnormalities. We also characterize their performance on chromosomal regions of interest in a real dataset obtained from patients with Glioblastoma Multiforme. While comparisons of this type are difficult due to possibly sub-optimal choice of parameters in the methods, they nevertheless reveal general characteristics that are helpful to the biological investigator.

Publication types

Comparative Study
Evaluation Study
Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Algorithms*
Chromosome Mapping / methods*
Gene Amplification / genetics*
Gene Deletion*
Nucleic Acid Hybridization / methods*
Oligonucleotide Array Sequence Analysis / methods*
Reproducibility of Results
Sensitivity and Specificity
Software
Software Validation

Abstract

Publication types

MeSH terms

Grants and funding