Objective: Early diagnosis of laryngeal cancer (LC) is crucial, particularly in rural areas. Despite existing studies on deep learning models for LC identification, challenges remain in selecting suitable models for rural areas with shortages of laryngologists and limited computer resources. We present the intelligent laryngeal cancer detection system (ILCDS), a deep learning-based solution tailored for effective LC screening in resource-constrained rural areas.
Methods: We compiled a dataset comprised of 2023 laryngoscopic images and applied data augmentation techniques for dataset expansion. Subsequently, we utilized eight deep learning models-AlexNet, VGG, ResNet, DenseNet, MobileNet, ShuffleNet, Vision Transformer, and Swin Transformer-for LC identification. A comprehensive evaluation of their performances and efficiencies was conducted, and the most suitable model was selected to assemble the ILCDS.
Results: Regarding performance, all models attained an average accuracy exceeding 90 % on the test set. Particularly noteworthy are VGG, DenseNet, and MobileNet, which exceeded an accuracy of 95 %, with scores of 95.32 %, 95.75 %, and 95.99 %, respectively. Regarding efficiency, MobileNet excels owing to its compact size and fast inference speed, making it an ideal model for integration into ILCDS.
Conclusion: The ILCDS demonstrated promising accuracy in LC detection while maintaining modest computational resource requirements, indicating its potential to enhance LC screening accuracy and alleviate the workload on otolaryngologists in rural areas.
Keywords: Deep learning model; Laryngeal cancer; Laryngoscopy; MobileNet; Rural areas.
Copyright © 2024 Elsevier Inc. All rights reserved.