Automatic Processing of Anatomic Pathology Reports in the Italian Language to Enhance the Reuse of Clinical Data

Natalia Viani; Lorenzo Chiudinelli; Cristina Tasca; Alberto Zambelli; Mauro Bucalo; Arianna Ghirardi; Nicola Barbarini; Eleonora Sfreddo; Lucia Sacchi; Carlo Tondini; Riccardo Bellazzi

Automatic Processing of Anatomic Pathology Reports in the Italian Language to Enhance the Reuse of Clinical Data

Stud Health Technol Inform. 2018:247:715-719.

Affiliations

¹ Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.
² ASST Papa Giovanni XXIII Hospital, Bergamo, Italy.
³ BIOMERIS, Pavia, Italy.

PMID: 29678054

Abstract

Medical reports often contain a lot of relevant information in the form of free text. To reuse these unstructured texts for biomedical research, it is important to extract structured data from them. In this work, we adapted a previously developed information extraction system to the oncology domain, to process a set of anatomic pathology reports in the Italian language. The information extraction system relies on a domain ontology, which was adapted and refined in an iterative way. The final output was evaluated by a domain expert, with promising results.

Keywords: information extraction; text mining.

MeSH terms

Biomedical Research
Data Mining
Humans
Information Storage and Retrieval*
Italy
Language*
Natural Language Processing*