Large multimodal model-based standardisation of pathology reports with confidence and its prognostic significance

Ethar Alzaid; Gabriele Pergola; Harriet Evans; David Snead; Fayyaz Minhas

doi:10.1002/2056-4538.70010

Large multimodal model-based standardisation of pathology reports with confidence and its prognostic significance

J Pathol Clin Res. 2024 Nov;10(6):e70010. doi: 10.1002/2056-4538.70010.

Authors

Ethar Alzaid¹, Gabriele Pergola¹, Harriet Evans^{2

3}, David Snead^{1

2

3}, Fayyaz Minhas¹

Affiliations

¹ Department of Computer Science, University of Warwick, Coventry, UK.
² Histopathology Department, University Hospitals Coventry and Warwickshire NHS Trust, Coventry, UK.
³ Warwick Medical School, University of Warwick, Coventry, UK.

Abstract

Despite the existence of established standards and guidelines for pathology reporting, many pathology reports are still written in unstructured free text. Extracting information from these reports and formatting it according to a standard is crucial for consistent interpretation. Automated information extraction from unstructured pathology reports is a challenging task, as it requires accurately interpreting medical terminologies and context-dependent details. In this work, we present a practical approach for automatically extracting information from unstructured pathology reports or scanned paper reports utilising a large multimodal model. This framework uses context-aware prompting strategies to extract values of individual fields, such as grade, size, etc. from pathology reports. A unique feature of the proposed approach is that it assigns a confidence value indicating the correctness of the model's extraction for each field and generates a structured report in line with national pathology guidelines in human and machine-readable formats. We have analysed the extraction performance in terms of accuracy and kappa scores, and the quality of the confidence scores assigned by the model. We have also evaluated the prognostic value of the extracted fields and feature embeddings of the raw text. Results showed that the model can accurately extract information with an accuracy and kappa score up to 0.99 and 0.98, respectively. Our results indicate that confidence scores are an effective indicator of the correctness of the extracted information achieving an area under the receiver operating characteristic curve up to 0.93 thus enabling automatic flagging of extraction errors. Our analysis further reveals that, as expected, information extracted from pathology reports is highly prognostically relevant. The framework demo is available at: https://labieb.dcs.warwick.ac.uk/. Information extracted from pathology reports of colorectal cancer cases in the cancer genome atlas using the proposed approach and its code are available at: https://github.com/EtharZaid/Labieb.

Keywords: GPT‐4; LLM; LMM; information extraction; large language models; large multimodal model; pathology reports; report standardisation.

MeSH terms

Humans
Medical Informatics*
Pathology* / standards