Identifying nonspecific SAGE tags by context of gene expression

Methods Mol Biol. 2008:387:199-204. doi: 10.1007/978-1-59745-454-4_15.

Abstract

Many serial analysis of gene expression (SAGE) tags can be matched to multiple genes, leading to difficulty in SAGE data interpretation and analysis. As only a subset of genes in the human genome are transcribed in a certain type of tissue/cell, we used microarray expression data from different tissue types to define contexts of gene expression and to annotate SAGE tags collected from the same or similar tissue sources. To predict the original transcript contributing a nonspecific SAGE tag collected from a particular tissue, we ranked the corresponding genes by their expression levels determined by microarray. We developed a tissue-specific SAGE tag annotation database based on microarray data collected from 73 normal human tissues and 18 cancer tissues and cell lines. The database can be queried online at: http://www.basic.northwestern.edu/SAGE/. The accuracy of this database was confirmed by experimental data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Nucleic Acid
  • Expressed Sequence Tags*
  • Gene Expression Profiling / methods*
  • Gene Expression Profiling / statistics & numerical data
  • Genome, Human
  • Humans
  • Neoplasms / genetics
  • Oligonucleotide Array Sequence Analysis
  • Tissue Distribution