caTIES: a grid based system for coding and retrieval of surgical pathology reports and tissue specimens in support of translational research

J Am Med Inform Assoc. 2010 May-Jun;17(3):253-64. doi: 10.1136/jamia.2009.002295.

Abstract

The authors report on the development of the Cancer Tissue Information Extraction System (caTIES)--an application that supports collaborative tissue banking and text mining by leveraging existing natural language processing methods and algorithms, grid communication and security frameworks, and query visualization methods. The system fills an important need for text-derived clinical data in translational research such as tissue-banking and clinical trials. The design of caTIES addresses three critical issues for informatics support of translational research: (1) federation of research data sources derived from clinical systems; (2) expressive graphical interfaces for concept-based text mining; and (3) regulatory and security model for supporting multi-center collaborative research. Implementation of the system at several Cancer Centers across the country is creating a potential network of caTIES repositories that could provide millions of de-identified clinical reports to users. The system provides an end-to-end application of medical natural language processing to support multi-institutional translational research programs.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biological Specimen Banks*
  • Computer Graphics
  • Computer Security
  • Data Mining*
  • Humans
  • Information Dissemination*
  • Interinstitutional Relations
  • Multicenter Studies as Topic
  • Natural Language Processing*
  • Neoplasms / pathology*
  • Neoplasms / surgery
  • Translational Research, Biomedical*
  • United States
  • User-Computer Interface