A Comparative Analysis of Two Machine Learning-Based Diagnostic Patterns with Thyroid Imaging Reporting and Data System for Thyroid Nodules: Diagnostic Performance and Unnecessary Biopsy Rate

Thyroid. 2021 Mar;31(3):470-481. doi: 10.1089/thy.2020.0305. Epub 2020 Sep 9.

Abstract

Background: The risk stratification system of the American College of Radiology Thyroid Imaging Reporting and Data System (ACR TI-RADS) for thyroid nodules is affected by low diagnostic specificity. Machine learning (ML) methods can optimize the diagnostic performance in medical image analysis. However, it is unknown which ML-based diagnostic pattern is more effective in improving diagnostic performance for thyroid nodules and reducing nodule biopsies. Therefore, we compared ML-assisted visual approaches and radiomics approaches with ACR TI-RADS in diagnostic performance and unnecessary fine-needle aspiration biopsy (FNAB) rate for thyroid nodules. Methods: This retrospective study evaluated a data set of ultrasound (US) and shear wave elastography (SWE) images in patients with biopsy-proven thyroid nodules (≥1 cm) from the Shanghai Tenth People's Hospital (743 nodules in 720 patients from September 2017 to January 2019) and an independent test data set from the Ma'anshan People's Hospital (106 nodules in 102 patients from February 2019 to April 2019). Six US features and five SWE parameters from the radiologists' interpretation were used for building the ML-assisted visual approaches. The radiomics features extracted from the US and SWE images were used with ML methods for developing the radiomics approaches. The diagnostic performance for differentiating thyroid nodules and the unnecessary FNAB rate of the ML-assisted visual approaches and the radiomics approaches were compared with ACR TI-RADS. Results: The ML-assisted US visual approach had the best diagnostic performance than the US radiomics approach and ACR TI-RADS (area under the curve [AUC]: 0.900 vs. 0.789 vs. 0.689 for the validation data set, 0.917 vs. 0.770 vs. 0.681 for the test data set). After adding SWE, the ML-assisted visual approach had a better diagnostic performance than US alone (AUC: 0.951 vs. 0.900 for the validation data set, 0.953 vs. 0.917 for the test data set). When applying the ML-assisted US+SWE visual approach, the unnecessary FNAB rate decreased from 30.0% to 4.5% in the validation data set and from 37.7% to 4.7% in the test data set in comparison to ACR TI-RADS. Conclusions: The ML-assisted dual modalities visual approach can assist radiologists to diagnose thyroid nodules more effectively and considerably reduce the unnecessary FNAB rate in the clinical management of thyroid nodules.

Keywords: artificial intelligence; diagnosis; machine learning; shear wave elastography; thyroid nodules; ultrasound.

Publication types

  • Comparative Study
  • Multicenter Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Biopsy, Fine-Needle
  • China
  • Clinical Decision-Making
  • Elasticity Imaging Techniques
  • Female
  • Humans
  • Image Interpretation, Computer-Assisted*
  • Machine Learning*
  • Male
  • Middle Aged
  • Predictive Value of Tests
  • Reproducibility of Results
  • Retrospective Studies
  • Thyroid Neoplasms / diagnostic imaging*
  • Thyroid Neoplasms / pathology
  • Thyroid Nodule / diagnostic imaging*
  • Thyroid Nodule / pathology
  • Tumor Burden
  • Ultrasonography*
  • Unnecessary Procedures*
  • Young Adult