Background: Patellofemoral osteoarthritis (PFOA) has a high prevalence and is assessed on axial radiography of the patellofemoral joint (PFJ). A deep learning (DL)-based approach could help radiologists automatically diagnose and grade PFOA via interpreting axial radiographs.
Purpose: To develop and assess the performance of a DL-based approach for diagnosing and grading PFOA on axial radiographs.
Material and methods: A total of 1280 (dataset 1) axial radiographs were retrospectively collected and utilized to develop the high-resolution network (HRNet)-based classification models. The ground truth was the interpretation from two experienced radiologists in consensus according to the K-L grading system. A binary-class model was trained to diagnose the presence (K-L 2∼4) or absence (K-L 0∼1) of PFOA. A multi-class model was used to grade the stage of PFOA, i.e. from K-L 0 to K-L 4. Model performances were evaluated using the receiver operating characteristics (ROC), confusion matrix, and the corresponding evaluation metrics (positive predictive value [PPV], negative predictive value [NPV], F1 score, sensitivity, specificity, accuracy) of the internal test set (n = 129) from dataset 1 and an external validation set (dataset 2, n = 187).
Results: For the binary-class model, the area under the curve (AUC) was 0.91 in the internal test set and 0.90 in the external validation set. For grading PFOA, moderate to severe stage of PFOA exhibited a good performance in these two datasets (AUC = 0.91-0.98, PPV = 0.69-0.90, NPV = 0.92-0.99, F1 score = 0.72-0.87, sensitivity = 0.75-0.87, specificity = 0.90-0.99, accuracy = 0.87-0.98).
Conclusion: The HRNet-based approach performed well in diagnosing and grading radiographic PFOA, especially for the moderate to severe cases.
Keywords: Osteoarthritis; deep learning; patellofemoral joint; radiography.