Screening for extranodal extension in HPV-associated oropharyngeal carcinoma: evaluation of a CT-based deep learning algorithm in patient data from a multicentre, randomised de-escalation trial

Benjamin H Kann; Jirapat Likitlersuang; Dennis Bontempi; Zezhong Ye; Sanjay Aneja; Richard Bakst; Hillary R Kelly; Amy F Juliano; Sam Payabvash; Jeffrey P Guenette; Ravindra Uppaluri; Danielle N Margalit; Jonathan D Schoenfeld; Roy B Tishler; Robert Haddad; Hugo J W L Aerts; Joaquin J Garcia; Yael Flamand; Rathan M Subramaniam; Barbara A Burtness; Robert L Ferris

doi:10.1016/S2589-7500(23)00046-8

Screening for extranodal extension in HPV-associated oropharyngeal carcinoma: evaluation of a CT-based deep learning algorithm in patient data from a multicentre, randomised de-escalation trial

Lancet Digit Health. 2023 Jun;5(6):e360-e369. doi: 10.1016/S2589-7500(23)00046-8. Epub 2023 Apr 21.

Authors

Benjamin H Kann¹, Jirapat Likitlersuang², Dennis Bontempi², Zezhong Ye², Sanjay Aneja³, Richard Bakst⁴, Hillary R Kelly⁵, Amy F Juliano⁵, Sam Payabvash⁶, Jeffrey P Guenette⁷, Ravindra Uppaluri⁷, Danielle N Margalit⁷, Jonathan D Schoenfeld⁷, Roy B Tishler⁷, Robert Haddad⁷, Hugo J W L Aerts⁸, Joaquin J Garcia⁹, Yael Flamand¹⁰, Rathan M Subramaniam¹¹, Barbara A Burtness¹², Robert L Ferris¹³

Affiliations

¹ Department of Radiation Oncology, Harvard Medical School, Boston, MA, USA; Mass General Brigham Artificial Intelligence in Medicine Program, Boston, MA, USA. Electronic address: [email protected].
² Dana-Farber Cancer Institute/Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Mass General Brigham Artificial Intelligence in Medicine Program, Boston, MA, USA.
³ Department of Therapeutic Radiology, New Haven, CT, USA.
⁴ Icahn School of Medicine at Mount Sinai, New York, NY, USA.
⁵ Mass Eye and Ear, Mass General Hospital, Boston, MA, USA.
⁶ Department of Radiology, New Haven, CT, USA.
⁷ Dana-Farber Cancer Institute/Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
⁸ Dana-Farber Cancer Institute/Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Mass General Brigham Artificial Intelligence in Medicine Program, Boston, MA, USA; Department of Radiology, Maastricht University, Maastricht, Netherlands.
⁹ Department of Pathology, Mayo Clinic, Rochester, MN, USA.
¹⁰ Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, ECOG-ACRIN Biostatistics Center, Boston, MA, USA.
¹¹ Department of Radiology and Nuclear Medicine, University of Notre Dame Australia, Sydney, NSW, Australia; Department of Radiology, Duke University, Durham, NC, USA.
¹² Yale School of Medicine, New Haven, CT, USA.
¹³ Department of Otolaryngology, University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA.

Abstract

Background: Pretreatment identification of pathological extranodal extension (ENE) would guide therapy de-escalation strategies for in human papillomavirus (HPV)-associated oropharyngeal carcinoma but is diagnostically challenging. ECOG-ACRIN Cancer Research Group E3311 was a multicentre trial wherein patients with HPV-associated oropharyngeal carcinoma were treated surgically and assigned to a pathological risk-based adjuvant strategy of observation, radiation, or concurrent chemoradiation. Despite protocol exclusion of patients with overt radiographic ENE, more than 30% had pathological ENE and required postoperative chemoradiation. We aimed to evaluate a CT-based deep learning algorithm for prediction of ENE in E3311, a diagnostically challenging cohort wherein algorithm use would be impactful in guiding decision-making.

Methods: For this retrospective evaluation of deep learning algorithm performance, we obtained pretreatment CTs and corresponding surgical pathology reports from the multicentre, randomised de-escalation trial E3311. All enrolled patients on E3311 required pretreatment and diagnostic head and neck imaging; patients with radiographically overt ENE were excluded per study protocol. The lymph node with largest short-axis diameter and up to two additional nodes were segmented on each scan and annotated for ENE per pathology reports. Deep learning algorithm performance for ENE prediction was compared with four board-certified head and neck radiologists. The primary endpoint was the area under the curve (AUC) of the receiver operating characteristic.

Findings: From 178 collected scans, 313 nodes were annotated: 71 (23%) with ENE in general, 39 (13%) with ENE larger than 1 mm ENE. The deep learning algorithm AUC for ENE classification was 0·86 (95% CI 0·82-0·90), outperforming all readers (p<0·0001 for each). Among radiologists, there was high variability in specificity (43-86%) and sensitivity (45-96%) with poor inter-reader agreement (κ 0·32). Matching the algorithm specificity to that of the reader with highest AUC (R2, false positive rate 22%) yielded improved sensitivity to 75% (+ 13%). Setting the algorithm false positive rate to 30% yielded 90% sensitivity. The algorithm showed improved performance compared with radiologists for ENE larger than 1 mm (p<0·0001) and in nodes with short-axis diameter 1 cm or larger.

Interpretation: The deep learning algorithm outperformed experts in predicting pathological ENE on a challenging cohort of patients with HPV-associated oropharyngeal carcinoma from a randomised clinical trial. Deep learning algorithms should be evaluated prospectively as a treatment selection tool.

Funding: ECOG-ACRIN Cancer Research Group and the National Cancer Institute of the US National Institutes of Health.

Publication types

Randomized Controlled Trial
Multicenter Study
Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural

MeSH terms

Algorithms
Carcinoma* / complications
Deep Learning*
Extranodal Extension
Human Papillomavirus Viruses
Humans
Oropharyngeal Neoplasms* / diagnostic imaging
Oropharyngeal Neoplasms* / pathology
Papillomavirus Infections* / complications
Papillomavirus Infections* / diagnostic imaging
Retrospective Studies
Tomography, X-Ray Computed

Abstract

Publication types

MeSH terms

Grants and funding