Extracting information from cDNA arrays

Chaos. 2001 Mar;11(1):98-107. doi: 10.1063/1.1336843.

Abstract

High-density DNA arrays allow measurements of gene expression levels (messenger RNA abundance) for thousands of genes simultaneously. We analyze arrays with spotted cDNA used in monitoring of expression profiles. A dilution series of a mouse liver probe is deployed to quantify the reproducibility of expression measurements. Saturation effects limit the accessible signal range at high intensities. Additive noise and outshining from neighboring spots dominate at low intensities. For repeated measurements on the same filter and filter-to-filter comparisons correlation coefficients of 0.98 are found. Next we consider the clustering of gene expression time series from stimulated human fibroblasts which aims at finding co-regulated genes. We analyze how preprocessing, the distance measure, and the clustering algorithm affect the resulting clusters. Finally we discuss algorithms for the identification of transcription factor binding sites from clusters of co-regulated genes. (c) 2001 American Institute of Physics.