A Natural Language Interface for Dissemination of Reproducible Biomedical Data Science

Med Image Comput Comput Assist Interv. 2018 Sep:11073:197-205. doi: 10.1007/978-3-030-00937-3_23. Epub 2018 Sep 13.

Abstract

Computational tools in the form of software packages are burgeoning in the field of medical imaging and biomedical research. These tools enable biomedical researchers to analyze a variety of data using modern machine learning and statistical analysis techniques. While these publicly available software packages are a great step towards a multiplicative increase in the biomedical research productivity, there are still many open issues related to validation and reproducibility of the results. A key gap is that while scientists can validate domain insights that are implicit in the analysis, the analysis itself is coded in a programming language and that domain scientist may not be a programmer. Thus, there is no/limited direct validation of the program that carries out the desired analysis. We propose a novel solution, building upon recent successes in natural language understanding, to address this problem. Our platform allows researchers to perform, share, reproduce and interpret the analysis pipelines and results via natural language. While this approach still requires users to have a conceptual understanding of the techniques, it removes the burden of programming syntax and thus lowers the barriers to advanced and reproducible neuroimaging and biomedical research.

Keywords: Natural language user interface; Neuro/medical·image analysis; Provenance tracking; Reproducibility; Surgical data science; Systems.