A Scoring System Developed by a Machine Learning Algorithm to Better Predict Adnexal Torsion

J Minim Invasive Gynecol. 2023 Jun;30(6):486-493. doi: 10.1016/j.jmig.2023.02.008. Epub 2023 Feb 10.

Abstract

Study objective: To establish a clinically relevant prediction score for the diagnosis of adnexal torsion (AT) in women who were operated on for suspected AT.

Design: A retrospective cohort study conducted between 2014 and 2021.

Setting: A large tertiary teaching medical center.

Patients: Women who underwent urgent laparoscopy for suspected AT.

Interventions: Analyses included univariate and multivariate models combined with the machine learning (ML) Random Forest model, which included all information available about the women and reported the accuracy of the model and the importance of each variable. Based on this model, we created a predictive score and evaluated its accuracy by receiver operating characteristic (ROC) curve.

Measurements and main results: A total of 503 women were included in our study, 244 (49%) of whom were diagnosed with AT during the surgery, and 44 (8.8%) cases of necrotic ovary were found. Based on the Random Forrest and multivariate models, the most important preoperative clinical predictive variables for AT were vomiting, left-side complaints, and concurrent pregnancy; cervical tenderness and urinary symptoms decreased the likelihood of surgically confirmed AT. The most important sonographic findings that predicted increased risk of surgically confirmed AT were ovarian edema and decreased vascular flow; in contrast, hemorrhagic corpus luteum decreased the likelihood of surgically confirmed AT. The accuracy of the Random Forest model was 71% for the training set and 68% for the testing set, and the area under the curve for the multivariate model was 0.75 (95% confidence interval [CI] 0.69-0.80). Based on these models, we created a predictive score with a total score that ranges from 4 to 12. The area under the curve for this score was 0.72 (95% CI 0.67-0.76), and the best cutoff for the final score was >5, with a sensitivity, specificity, positive predictive value, and negative predictive value of 64%, 73%, 70%, and 67%, respectively.

Conclusion: Clinical characteristics and ultrasound findings may be incorporated into the emergency room workup of women with suspected AT. ML in this setting has no diagnostic/predictive advantage over the performance of logistic regression methods. Additional prospective studies are needed to confirm the accuracy of this model.

Keywords: Adnexal torsion; Laparoscopy; Prediction; Scoring; Ultrasound.

MeSH terms

  • Adnexal Diseases* / diagnostic imaging
  • Adnexal Diseases* / surgery
  • Algorithms
  • Female
  • Humans
  • Machine Learning
  • Ovarian Torsion
  • Pregnancy
  • Retrospective Studies