Comprehensive classification of the plant non-specific lipid transfer protein superfamily towards its sequence-structure-function analysis

PeerJ. 2019 Aug 14:7:e7504. doi: 10.7717/peerj.7504. eCollection 2019.

Abstract

Background: Non-specific Lipid Transfer Proteins (nsLTPs) are widely distributed in the plant kingdom and constitute a superfamily of related proteins. Several hundreds of different nsLTP sequences-and counting-have been characterized so far, but their biological functions remain unclear. It has been clear for years that they present a certain interest for agronomic and nutritional issues. Deciphering their functions means collecting and analyzing a variety of data from gene sequence to protein structure, from cellular localization to the physiological role. As a huge and growing number of new protein sequences are available nowadays, extracting meaningful knowledge from sequence-structure-function relationships calls for the development of new tools and approaches. As nsLTPs show high evolutionary divergence, but a conserved common right handed superhelix structural fold, and as they are involved in a large number of key roles in plant development and defense, they are a stimulating case study for validating such an approach.

Methods: In this study, we comprehensively investigated 797 nsLTP protein sequences, including a phylogenetic analysis on canonical protein sequences, three-dimensional structure modeling and functional annotation using several well-established bioinformatics programs. Additionally, two integrative methodologies using original tools were developed. The first was a new method for the detection of (i) conserved amino acid residues involved in structure stabilization and (ii) residues potentially involved in ligand interaction. The second was a structure-function classification based on the evolutionary trace display method using a new tree visualization interface. We also present a new tool for visualizing phylogenetic trees.

Results: Following this new protocol, an updated classification of the nsLTP superfamily was established and a new functional hypothesis for key residues is suggested. Lastly, this work allows a better representation of the diversity of plant nsLTPs in terms of sequence, structure and function.

Keywords: Functional annotation; Homology modeling; Molecular modeling; Multigenic family; Phylogeny; Plant; Structure–function relationships; nsLTP.

Grants and funding

This work was supported by the French National Research Agency (ANR Genoplante) (grant ANR- 08-GENO118) and (ANR-10-BINF-03-04). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.