Identification of the most important features of knee osteoarthritis structural progressors using machine learning methods

Ther Adv Musculoskelet Dis. 2020 Aug 13:12:1759720X20933468. doi: 10.1177/1759720X20933468. eCollection 2020.

Abstract

Objectives: The aim was to identify the most important features of structural knee osteoarthritis (OA) progressors and classification using machine learning methods.

Methods: Participants, features and outcomes were from the Osteoarthritis Initiative. Features were from baseline (1107), including articular knee tissues (135) assessed by quantitative magnetic resonance imaging (MRI). OA progressors were ascertained by four outcomes: cartilage volume loss in medial plateau at 48 and 96 months (Prop_CV_48M, 96M), Kellgren-Lawrence (KL) grade ⩾ 2 and medial joint space narrowing (JSN) ⩾ 1 at 48 months. Six feature selection models were used to identify the common features in each outcome. Six classification methods were applied to measure the accuracy of the selected features in classifying the subjects into progressors and non-progressors. Classification of the best features was done using an automatic machine learning interface and the area under the curve (AUC). To prioritize the top five features, sparse partial least square (sPLS) method was used.

Results: For the classification of the best common features in each outcome, Multi-Layer Perceptron (MLP) achieved the highest AUC in Prop_CV_96M, KL and JSN (0.80, 0.88, 0.95), and Gradient Boosting Machine for Prop_CV_48M (0.70). sPLS showed the baseline top five features to predict knee OA progressors are the joint space width, mean cartilage thickness of the medial tibial plateau and sub-regions and JSN.

Conclusion: In this comprehensive study using a large number of features (n = 1107) and MRI outcomes in addition to radiological outcomes, we identified the best features and classification methods for knee OA structural progressors. Data revealed baseline X-ray and MRI-based features could predict early OA knee progressors and that MLP is the best classification method.

Keywords: classification; feature selection; joint space narrowing; machine learning; magnetic resonance imaging; osteoarthritis; prediction.