This study develops a method combining a convolutional neural network model, INSIGHT, with a self-attention model, WiseMSI, to predict microsatellite instability (MSI) based on the tiles in colorectal cancer patients from a multicenter Chinese cohort. After INSIGHT differentiates tumor tiles from normal tissue tiles in a whole slide image, features of tumor tiles are extracted with a ResNet model pre-trained on ImageNet. Attention-based pooling is adopted to aggregate tile-level features into slide-level representation. INSIGHT has an area under the curve (AUC) of 0.985 for tumor patch classification. The Spearman correlation coefficient of tumor cell fraction given by expert pathologist and INSIGHT is 0.7909. WiseMSI achieves a specificity of 94.7% (95% confidence interval [CI] 93.7%-95.7%), a sensitivity of 84.7% (95% CI 82.6%-86.9%), and an AUC of 0.954 (95% CI 0.948-0.960). Comparative analysis shows that this method has better performance than the other five classic deep learning methods.
Keywords: colorectal cancer; convoluted neural network; machine learning; microsatellite instability; self-attention; tumor purity; whole slide images.
Copyright © 2023 The Authors. Published by Elsevier Inc. All rights reserved.