A de-identified database of 11,979 verbal autopsy open-ended responses

Gates Open Res. 2018 Apr 17:2:18. doi: 10.12688/gatesopenres.12812.1.

Abstract

As part of the Gates Grand Challenge 13, the Population Health Metrics Research Consortium (PHMRC) collected data to enable the development and validation of methods that measure cause-specific mortality in populations with incomplete or inadequate cause of death coding. This work yielded 11,979 verbal autopsy interviews (VAIs). In each, a field interviewer spoke with an individual familiar with the deceased and their final illness, and used a semi-structured questionnaire to collect information about the symptoms of the deceased in their final illness. The VAI collected demographic characteristics, possible risk factors (such as tobacco use), and other potentially contributing characteristics. It also included the open-ended question, "Could you please summarize, or tell us in your own words, any additional information about the illness and/or death of your loved one?" (open narrative). The VAI data were released in a de-identified format in September 2013 through the Global Health Data Exchange, in files that contain verbal autopsies that were collected at six sites in four countries (India, Mexico, Tanzania, and the Philippines). Due to research interest, we have now created redacted versions of the open narratives from the open-ended question of the questionnaire. We hope that this database will be the source of innovations that increase our knowledge about the causes of ill health and, through this knowledge, produce improvements in health for individuals and populations.

Keywords: Verbal autopsy; cause of death; natural language processing; open data.

Grants and funding

This work was funded by a grant from the Bill and Melinda Gates Foundation through the Grand Challenges in Global Health initiative [OPP37883]. The funders had no role in study design, data collection and analysis, interpretation of data, decision to publish, or preparation of the manuscript. The corresponding author had full access to all data analyzed and had final responsibility for the decision to submit this original research paper for publication.