Natural language processing as an alternative to manual reporting of colonoscopy quality metrics

Gottumukkala S Raju; Phillip J Lum; Rebecca S Slack; Selvi Thirumurthi; Patrick M Lynch; Ethan Miller; Brian R Weston; Marta L Davila; Manoop S Bhutani; Mehnaz A Shafi; Robert S Bresalier; Alexander A Dekovich; Jeffrey H Lee; Sushovan Guha; Mala Pande; Boris Blechacz; Asif Rashid; Mark Routbort; Gladis Shuttlesworth; Lopa Mishra; John R Stroehlein; William A Ross

doi:10.1016/j.gie.2015.01.049

Natural language processing as an alternative to manual reporting of colonoscopy quality metrics

Gastrointest Endosc. 2015 Sep;82(3):512-9. doi: 10.1016/j.gie.2015.01.049. Epub 2015 Apr 22.

Authors

Gottumukkala S Raju¹, Phillip J Lum¹, Rebecca S Slack², Selvi Thirumurthi¹, Patrick M Lynch¹, Ethan Miller¹, Brian R Weston¹, Marta L Davila¹, Manoop S Bhutani¹, Mehnaz A Shafi¹, Robert S Bresalier¹, Alexander A Dekovich¹, Jeffrey H Lee¹, Sushovan Guha¹, Mala Pande¹, Boris Blechacz¹, Asif Rashid³, Mark Routbort⁴, Gladis Shuttlesworth¹, Lopa Mishra¹, John R Stroehlein¹, William A Ross¹

Affiliations

¹ Department of Gastroenterology, Hepatology and Nutrition, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
² Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
³ Department of Pathology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
⁴ Department of Hematopathology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA.

Abstract

Background and aims: The adenoma detection rate (ADR) is a quality metric tied to interval colon cancer occurrence. However, manual extraction of data to calculate and track the ADR in clinical practice is labor-intensive. To overcome this difficulty, we developed a natural language processing (NLP) method to identify adenomas and sessile serrated adenomas (SSAs) in patients undergoing their first screening colonoscopy. We compared the NLP-generated results with that of manual data extraction to test the accuracy of NLP and report on colonoscopy quality metrics using NLP.

Methods: Identification of screening colonoscopies using NLP was compared with that using the manual method for 12,748 patients who underwent colonoscopies from July 2010 to February 2013. Also, identification of adenomas and SSAs using NLP was compared with that using the manual method with 2259 matched patient records. Colonoscopy ADRs using these methods were generated for each physician.

Results: NLP correctly identified 91.3% of the screening examinations, whereas the manual method identified 87.8% of them. Both the manual method and NLP correctly identified examinations of patients with adenomas and SSAs in the matched records almost perfectly. Both NLP and the manual method produced comparable values for ADRs for each endoscopist and for the group as a whole.

Conclusions: NLP can correctly identify screening colonoscopies, accurately identify adenomas and SSAs in a pathology database, and provide real-time quality metrics for colonoscopy.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Adenoma / diagnosis*
Colonic Neoplasms / diagnosis*
Colonoscopy / standards*
Documentation*
Early Detection of Cancer
Electronic Data Processing / methods*
Female
Humans
Male
Natural Language Processing*
Quality Indicators, Health Care*

Abstract

Publication types

MeSH terms

Grants and funding