Input representations and classification strategies for automated human gait analysis

Djordje Slijepcevic; Matthias Zeppelzauer; Caterine Schwab; Anna-Maria Raberger; Christian Breiteneder; Brian Horsak

doi:10.1016/j.gaitpost.2019.10.021

Input representations and classification strategies for automated human gait analysis

Gait Posture. 2020 Feb:76:198-203. doi: 10.1016/j.gaitpost.2019.10.021. Epub 2019 Nov 9.

Authors

Djordje Slijepcevic¹, Matthias Zeppelzauer², Caterine Schwab³, Anna-Maria Raberger³, Christian Breiteneder⁴, Brian Horsak³

Affiliations

¹ St. Pölten University of Applied Sciences, Institute for Creative Media Technologies, St. Pölten, Austria. Electronic address: [email protected].
² St. Pölten University of Applied Sciences, Institute for Creative Media Technologies, St. Pölten, Austria.
³ St. Pölten University of Applied Sciences, Institute of Health Sciences, St. Pölten, Austria.
⁴ TU Wien, Institute of Visual Computing and Human-Centered Technology, Vienna, Austria.

PMID: 31862670
DOI: 10.1016/j.gaitpost.2019.10.021

Abstract

Background: Quantitative gait analysis produces a vast amount of data, which can be difficult to analyze. Automated gait classification based on machine learning techniques bear the potential to support clinicians in comprehending these complex data. Even though these techniques are already frequently used in the scientific community, there is no clear consensus on how the data need to be preprocessed and arranged to assure optimal classification accuracy outcomes.

Research question: Is there an optimal data aggregation and preprocessing workflow to optimize classification accuracy outcomes?

Methods: Based on our previous work on automated classification of ground reaction force (GRF) data, a sequential setup was followed: firstly, several aggregation methods - early fusion and late fusion - were compared, and secondly, based on the best aggregation method identified, the expressiveness of different combinations of signal representations was investigated. The employed dataset included data from 910 subjects, with four gait disorder classes and one healthy control group. The machine learning pipeline comprised principle component analysis (PCA), z-standardization and a support vector machine (SVM).

Results: The late fusion aggregation, i.e., utilizing majority voting on the classifier's predictions, performed best. In addition, the use of derived signal representations (relative changes and signal differences) seems to be advantageous as well.

Significance: Our results indicate that great caution is needed when data preprocessing and aggregation methods are selected, as these can have an impact on classification accuracies. These results shall serve future studies as a guideline for the choice of data aggregation and preprocessing techniques to be employed.

Keywords: Gait classification; Gait disorders; Ground reaction force; Machine learning; Support vector machine.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Gait / physiology*
Gait Analysis / methods*
Gait Disorders, Neurologic / diagnosis*
Gait Disorders, Neurologic / physiopathology
Humans
Principal Component Analysis
Support Vector Machine*
Young Adult