Background: Psoriatic arthritis (PSA) is an inflammatory joint disease associated with psoriasis (PSO) that can be easily missed. Existing PSA screening tools ignore objective serologic indicators. The aim of this study was to develop a disease screening model and the Psoriatic Arthritis Inflammation Index (PSAII) based on serologic data to enhance the efficiency of PSA screening.
Method: A total of 719 PSO and PSA patients from the National Health and Nutrition Examination Survey (NHANES) (as training set and test set) and 135 PSO and PSA patients who were seen at The First Affiliated Hospital of Zhejiang Chinese Medical University (as external validation set) were selected, 31 indicators for these patients were collected as potential input features for the model. Least Absolute Shrinkage and Selection Operator (LASSO) was used to identify PSA-related features. Five models of logistic regression (LR), random forest, k-nearest neighbor, gradient augmentation and neural network were developed in the training set using quintuple cross validation. And we developed PSAII based on the results of LASSO regression and weights of logistic model parameters. All performance metrics are derived on the test set and the external validation set.
Results: Five variables were selected to build models, including age, lymphocyte percentage, neutrophil count, eosinophilic count, and C-reactive protein. In all established models, the LR model performed the best, with an Area Under Curve (AUC) of 0.87 (95% confidence interval (CI): 0.83-0.90) on the test set; on the external validation set the AUC was 0.82 (95%CI: 0.74-0.90). The PSAII formula was PSAII = percentage of lymphocytes × C-reactive protein/(neutrophil count × eosinophilic count × 10). The AUC of PSAII in the test is 0.93 (95%CI: 0.88-0.97), and the cutoff value is 18. The AUC of the external validation set is 0.81 (95%CI: 0.72-0.89).
Conclusions: This study developed and validated five models to assist screening for PSA by analyzing serum data from NHANES and Chinese populations. The LR model demonstrated the best performance. We created PSAII for PSA screening. However, the high false positive rate of PSAII makes it necessary to combine it with other PSA screening tools when applied.
Keywords: PSAII; machine learning; psoriasis; psoriatic arthritis; screening.
Copyright © 2024 Lin, Pan, Shi, Wu, Dou, Lin and Cao.