ROBUST QUANTIFICATION OF PERCENT EMPHYSEMA ON CT VIA DOMAIN ATTENTION: THE MULTI-ETHNIC STUDY OF ATHEROSCLEROSIS (MESA) LUNG STUDY

Proc IEEE Int Symp Biomed Imaging. 2024 May:2024:10.1109/isbi56570.2024.10635299. doi: 10.1109/isbi56570.2024.10635299. Epub 2024 Aug 22.

Abstract

Robust quantification of pulmonary emphysema on computed tomography (CT) remains challenging for large-scale research studies that involve scans from different scanner types and for translation to clinical scans. Although the domain shifts in different CT scanners are subtle compared to shifts existing in other modalities (e.g., MRI) or cross-modality, emphysema is highly sensitive to it. Such subtle difference limits the application of general domain adaptation methods, such as image translation-based methods, as the contrast difference is too subtle to be distinguished. Existing studies have explored several directions to tackle this challenge, including density correction, noise filtering, regression, hidden Markov measure field (HMMF) model-based segmentation, and volume-adjusted lung density. Despite some promising results, previous studies either required a tedious workflow or eliminated opportunities for downstream emphysema subtyping, limiting efficient adaptation on a large-scale study. To alleviate this dilemma, we developed an end-to-end deep learning framework based on an existing HMMF segmentation framework. We first demonstrate that a regular UNet cannot replicate the existing HMMF results because of the lack of scanner priors. We then design a novel domain attention block, a simple yet efficient cross-modal block to fuse image visual features with quantitative scanner priors (a sequence), which significantly improves the results.

Keywords: deep learning; multi-modal learning; pulmonary emphysema; segmentation.