A topological classifier to characterize brain states: When shape matters more than variance

PLoS One. 2023 Oct 2;18(10):e0292049. doi: 10.1371/journal.pone.0292049. eCollection 2023.

Abstract

Despite the remarkable accuracies attained by machine learning classifiers to separate complex datasets in a supervised fashion, most of their operation falls short to provide an informed intuition about the structure of data, and, what is more important, about the phenomena being characterized by the given datasets. By contrast, topological data analysis (TDA) is devoted to study the shape of data clouds by means of persistence descriptors and provides a quantitative characterization of specific topological features of the dataset under scrutiny. Here we introduce a novel TDA-based classifier that works on the principle of assessing quantifiable changes on topological metrics caused by the addition of new input to a subset of data. We used this classifier with a high-dimensional electro-encephalographic (EEG) dataset recorded from eleven participants during a previous decision-making experiment in which three motivational states were induced through a manipulation of social pressure. We calculated silhouettes from persistence diagrams associated with each motivated state with a ready-made band-pass filtered version of these signals, and classified unlabeled signals according to their impact on each reference silhouette. Our results show that in addition to providing accuracies within the range of those of a nearest neighbour classifier, the TDA classifier provides formal intuition of the structure of the dataset as well as an estimate of its intrinsic dimension. Towards this end, we incorporated variance-based dimensionality reduction methods to our dataset and found that in most cases the accuracy of our TDA classifier remains essentially invariant beyond a certain dimension.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Brain*
  • Cluster Analysis
  • Humans
  • Machine Learning*

Grants and funding

This work was supported by the CERCA Programme of the Catalan Government (I. Cos); by the European Union’s Horizon 2020 Framework Programme for Research and Innovation under the Specific Grant Agreements No 945539, Human Brain Project SGA3 (I. Cos), and by MCIN/AEI/10.13039/501100011033 under grant PRE2020-094372 (A. Ferrà) and projects PID2019-105093GB-I00 (I. Cos, A. Ferrà) and PID2020-117971GB-C22 (C. Casacuberta).