Boosting skin cancer diagnosis accuracy with ensemble approach

Sci Rep. 2025 Jan 8;15(1):1290. doi: 10.1038/s41598-024-84864-5.

Abstract

Skin cancer is common and deadly, hence a correct diagnosis at an early age is essential. Effective therapy depends on precise classification of the several skin cancer forms, each with special traits. Because dermoscopy and other sophisticated imaging methods produce detailed lesion images, early detection has been enhanced. It's still difficult to analyze the images to differentiate benign from malignant tumors, though. Better predictive modeling methods are needed since the diagnostic procedures used now frequently produce inaccurate and inconsistent results. In dermatology, Machine learning (ML) models are becoming essential for the automatic detection and classification of skin cancer lesions from image data. With the ensemble model, which mix several ML approaches to take use of their advantages and lessen their disadvantages, this work seeks to improve skin cancer predictions. We introduce a new method, the Max Voting method, for optimization of skin cancer classification. On the HAM10000 and ISIC 2018 datasets, we trained and assessed three distinct ML models: Random Forest (RF), Multi-layer Perceptron Neural Network (MLPN), and Support Vector Machine (SVM). Overall performance was increased by the combined predictions made with the Max Voting technique. Moreover, feature vectors that were optimally produced from image data by a Genetic Algorithm (GA) were given to the ML models. We demonstrate that the Max Voting method greatly improves predictive performance, reaching an accuracy of 94.70% and producing the best results for F1-measure, recall, and precision. The most dependable and robust approach turned out to be Max Voting, which combines the benefits of numerous pre-trained ML models to provide a new and efficient method for classifying skin cancer lesions.

Keywords: Ensemble model; HAM10000 dataset; ISIC 2018 dataset; Machine learning (ML); Max Voting method; Multi-class skin cancer classification; Multi-layer perceptron neural network (MLPN); Random forest (RF); Support vector machine (SVM).

MeSH terms

  • Algorithms
  • Dermoscopy / methods
  • Humans
  • Machine Learning*
  • Neural Networks, Computer
  • Skin Neoplasms* / diagnosis
  • Skin Neoplasms* / pathology
  • Support Vector Machine*