Comparing distance metrics for rotation using the k-nearest neighbors algorithm for entropy estimation

J Comput Chem. 2014 Feb 15;35(5):377-85. doi: 10.1002/jcc.23504. Epub 2013 Dec 5.

Abstract

Distance metrics facilitate a number of methods for statistical analysis. For statistical mechanical applications, it is useful to be able to compute the distance between two different orientations of a molecule. However, a number of distance metrics for rotation have been employed, and in this study, we consider different distance metrics and their utility in entropy estimation using the k-nearest neighbors (KNN) algorithm. This approach shows a number of advantages over entropy estimation using a histogram method, and the different approaches are assessed using uniform randomly generated data, biased randomly generated data, and data from a molecular dynamics (MD) simulation of bulk water. The results identify quaternion metrics as superior to a metric based on the Euler angles. However, it is demonstrated that samples from MD simulation must be independent for effective use of the KNN algorithm and this finding impacts any application to time series data.

Keywords: distance metric; entropy; k-nearest neighbors; molecular dynamics; solvation; statistical mechanics.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Entropy*
  • Molecular Dynamics Simulation
  • Rotation*
  • Water / chemistry

Substances

  • Water