Proteins of Unknown Biochemical Function: A Persistent Problem and a Roadmap to Help Overcome It

Plant Physiol. 2015 Nov;169(3):1436-42. doi: 10.1104/pp.15.00959. Epub 2015 Aug 12.

Abstract

The number of sequenced genomes is rapidly increasing, but functional annotation of the genes in these genomes lags far behind. Even in Arabidopsis (Arabidopsis thaliana), only approximately 40% of enzyme- and transporter-encoding genes have credible functional annotations, and this number is even lower in nonmodel plants. Functional characterization of unknown genes is a challenge, but various databases (e.g. for protein localization and coexpression) can be mined to provide clues. If homologous microbial genes exist-and about one-half the genes encoding unknown enzymes and transporters in Arabidopsis have microbial homologs-cross-kingdom comparative genomics can powerfully complement plant-based data. Multiple lines of evidence can strengthen predictions and warrant experimental characterization. In some cases, relatively quick tests in genetically tractable microbes can determine whether a prediction merits biochemical validation, which is costly and demands specialized skills.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computational Biology / methods
  • Databases, Genetic
  • Gene Expression Regulation, Plant / physiology*
  • Genome, Plant
  • Genomics / methods
  • Metabolic Networks and Pathways / genetics
  • Plant Proteins / chemistry
  • Plant Proteins / genetics
  • Plant Proteins / metabolism*
  • Plants / genetics
  • Plants / metabolism*
  • Protein Transport
  • Systems Biology / methods
  • Transcriptome

Substances

  • Plant Proteins