Simultaneous Assignment and Structure Determination of Proteins From Sparsely Labeled NMR Datasets

Front Mol Biosci. 2021 Nov 24:8:774394. doi: 10.3389/fmolb.2021.774394. eCollection 2021.

Abstract

Sparsely labeled NMR samples provide opportunities to study larger biomolecular assemblies than is traditionally done by NMR. This requires new computational tools that can handle the sparsity and ambiguity in the NMR datasets. The MELD (modeling employing limited data) Bayesian approach was assessed to be the best performing in predicting structures from sparsely labeled NMR data in the 13th edition of the Critical Assessment of Structure Prediction (CASP) event-and limitations of the methodology were also noted. In this report, we evaluate the nature and difficulty in modeling unassigned sparsely labeled NMR datasets and report on an improved methodological pipeline leading to higher-accuracy predictions. We benchmark our methodology against the NMR datasets provided by CASP 13.

Keywords: MELD; REMD; molecular dynamics; protein structure determination; sparse NMR.