PAS domains are omnipresent building blocks of multidomain proteins in all domains of life. Bacteria possess a variety of PAS domains in intracellular proteins and the related Cache domains in periplasmic or extracellular proteins. PAS and Cache domains are predominant in sensory systems, often carry cofactors or bind ligands, and serve as dimerization domains in protein association. To aid our understanding of the wide distribution of these domains, we analyzed the proteome of the opportunistic human pathogen Pseudomonas aeruginosa PAO1 in silico. The ability of this bacterium to survive under different environmental conditions, to switch between planktonic and sessile/biofilm lifestyle, or to evade stresses, notably involves c-di-GMP regulatory proteins or depends on sensory pathways involving multidomain proteins that possess PAS or Cache domains. Maximum likelihood phylogeny was used to group PAS and Cache domains on the basis of amino acid sequence. Conservation of cofactor- or ligand-coordinating amino acids aided by structure-based comparison was used to inform function. The resulting classification presented here includes PAS domains that are candidate binders of carboxylic acids, amino acids, fatty acids, flavin adenine dinucleotide (FAD), 4-hydroxycinnamic acid, and heme. These predictions are put in context to previously described phenotypic data, often generated from deletion mutants. The analysis predicts novel functions for sensory proteins and sheds light on functional diversification in a large set of proteins with similar architecture. IMPORTANCE To adjust to a variety of life conditions, bacteria typically use multidomain proteins, where the modular structure allows functional differentiation. Proteins responding to environmental cues and regulating physiological responses are found in chemotaxis pathways that respond to a wide range of stimuli to affect movement. Environmental cues also regulate intracellular levels of cyclic-di-GMP, a universal bacterial secondary messenger that is a key determinant of bacterial lifestyle and virulence. We study Pseudomonas aeruginosa, an organism known to colonize a broad range of environments that can switch lifestyle between the sessile biofilm and the planktonic swimming form. We have investigated the PAS and Cache domains, of which we identified 101 in 70 Pseudomonas aeruginosa PAO1 proteins, and have grouped these by phylogeny with domains of known structure. The resulting data set integrates sequence analysis and structure prediction to infer ligand or cofactor binding. With this data set, functional predictions for PAS and Cache domain-containing proteins are made.
Keywords: Cache domain; PAS domain; Pseudomonas; cofactors; phylogenetic analysis; phylogeny; sensory transduction processes.