Autism screening: an unsupervised machine learning approach

Fadi Thabtah; Robinson Spencer; Neda Abdelhamid; Firuz Kamalov; Carl Wentzel; Yongsheng Ye; Thanu Dayara

doi:10.1007/s13755-022-00191-x

Autism screening: an unsupervised machine learning approach

Health Inf Sci Syst. 2022 Sep 8;10(1):26. doi: 10.1007/s13755-022-00191-x. eCollection 2022 Dec.

Authors

Fadi Thabtah¹, Robinson Spencer², Neda Abdelhamid³, Firuz Kamalov⁴, Carl Wentzel², Yongsheng Ye², Thanu Dayara²

Affiliations

¹ ASDTests, Auckland, New Zealand.
² Digital Technologies, Manukau Institute of Technology, Auckland, New Zealand.
³ Abu Dhabi School of Management, Abu Dhabi, UAE.
⁴ Canadian University Dubai, Dubai, UAE.

Abstract

Early screening of autism spectrum disorders (ASD) is a key area of research in healthcare. Currently artificial intelligence (AI)-driven approaches are used to improve the process of autism diagnosis using computer-aided diagnosis (CAD) systems. One of the issues related to autism diagnosis and screening data is the reliance of the predictions primarily on scores provided by medical screening methods which can be biased depending on how the scores are calculated. We attempt to reduce this bias by assessing the performance of the predictions related to the screening process using a new model that consists of a Self-Organizing Map (SOM) with classification algorithms. The SOM is employed prior to the diagnostic process to derive a new class label using clusters learnt from the independent features; these clusters are related to communication, repetitive traits, and social traits in the input dataset. Then, the new clusters are compared with existing class labels in the dataset to refine and eliminate any inconsistencies. Lastly, the refined dataset is utilised to derive classification systems for autism diagnosis. The new model was evaluated against a real-life autism screening dataset that consists of over 2000 instances of cases and controls. The results based on the refined dataset show that the proposed method achieves significantly higher accuracy, precision, and recall for the classification models derived when compared to models derived from the original dataset.

Keywords: Autism spectrum disorders; Classification; Clustering; Machine learning; Self-organising Map.

© The Author(s), under exclusive licence to Springer Nature Switzerland AG 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.