Advancing thyroid care: An accurate trustworthy diagnostics system with interpretable AI and hybrid machine learning techniques

Ananda Sutradhar; Sharmin Akter; F M Javed Mehedi Shamrat; Pronab Ghosh; Xujuan Zhou; Mohd Yamani Idna Bin Idris; Kawsar Ahmed; Mohammad Ali Moni

doi:10.1016/j.heliyon.2024.e36556

Advancing thyroid care: An accurate trustworthy diagnostics system with interpretable AI and hybrid machine learning techniques

Heliyon. 2024 Aug 20;10(17):e36556. doi: 10.1016/j.heliyon.2024.e36556. eCollection 2024 Sep 15.

Authors

Ananda Sutradhar¹, Sharmin Akter¹, F M Javed Mehedi Shamrat², Pronab Ghosh³, Xujuan Zhou⁴, Mohd Yamani Idna Bin Idris², Kawsar Ahmed^{5

6

7}, Mohammad Ali Moni⁸

Affiliations

¹ Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh.
² Department of Computer System and Technology, Universiti Malaya, Kuala Lumpur, 50603, Malaysia.
³ Department of Computer Science, Lakehead University, 955 Oliver Rd, Thunder Bay, ON, P7B 5E1, Canada.
⁴ School of Business, University of Southern Queensland, Springfield, Australia.
⁵ Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N 5A9, Canada.
⁶ Health Informatics Research Lab, Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh.
⁷ Group of Bio-photomatiχ, Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail, 1902, Bangladesh.
⁸ School of Health and Rehabilitation Sciences, Faculty of Health and Behavioural Sciences, The University of Queensland, St Lucia, QLD, 4072, Australia.

Abstract

The worldwide prevalence of thyroid disease is on the rise, representing a chronic condition that significantly impacts global mortality rates. Machine learning (ML) approaches have demonstrated potential superiority in mitigating the occurrence of this disease by facilitating early detection and treatment. However, there is a growing demand among stakeholders and patients for reliable and credible explanations of the generated predictions in sensitive medical domains. Hence, we propose an interpretable thyroid classification model to illustrate outcome explanations and investigate the contribution of predictive features by utilizing explainable AI. Two real-time thyroid datasets underwent various preprocessing approaches, addressing data imbalance issues using the Synthetic Minority Over-sampling Technique with Edited Nearest Neighbors (SMOTE-ENN). Subsequently, two hybrid classifiers, namely RDKVT and RDKST, were introduced to train the processed and selected features from Univariate and Information Gain feature selection techniques. Following the training phase, the Shapley Additive Explanation (SHAP) was applied to identify the influential characteristics and corresponding values contributing to the outcomes. The conducted experiments ultimately concluded that the presented RDKST classifier achieved the highest performance, demonstrating an accuracy of 98.98 % when trained on Information Gain selected features. Notably, the features T3 (triiodothyronine), TT4 (total thyroxine), TSH (thyroid-stimulating hormone), FTI (free thyroxine index), and T3_measured significantly influenced the generated outcomes. By balancing classification accuracy and outcome explanation ability, this study aims to enhance the clinical decision-making process and improve patient care.

Keywords: And explainable AI; Ensemble methods; Machine learning; SMOTE-ENN; Thyroid disease.