Predicting the risk of asthma attacks in children, adolescents and adults: protocol for a machine learning algorithm derived from a primary care-based retrospective cohort

Zain Hussain; Syed Ahmar Shah; Mome Mukherjee; Aziz Sheikh

doi:10.1136/bmjopen-2019-036099

Predicting the risk of asthma attacks in children, adolescents and adults: protocol for a machine learning algorithm derived from a primary care-based retrospective cohort

BMJ Open. 2020 Jul 23;10(7):e036099. doi: 10.1136/bmjopen-2019-036099.

Authors

Zain Hussain¹, Syed Ahmar Shah^{2

3}, Mome Mukherjee^{1

3}, Aziz Sheikh^{1

3

4}

Affiliations

¹ Usher Institute, Edinburgh Medical School, The University of Edinburgh, Edinburgh, UK.
² Usher Institute, Edinburgh Medical School, The University of Edinburgh, Edinburgh, UK [email protected].
³ Asthma UK Centre for Applied Research (AUKCAR), The University of Edinburgh, Edinburgh, UK.
⁴ Division of Community Health Sciences, The University of Edinburgh, Edinburgh, UK.

Abstract

Introduction: Most asthma attacks and subsequent deaths are potentially preventable. We aim to develop a prognostic tool for identifying patients at high risk of asthma attacks in primary care by leveraging advances in machine learning.

Methods and analysis: Current prognostic tools use logistic regression to develop a risk scoring model for asthma attacks. We propose to build on this by systematically applying various well-known machine learning techniques to a large longitudinal deidentified primary care database, the Optimum Patient Care Research Database, and comparatively evaluate their performance with the existing logistic regression model and against each other. Machine learning algorithms vary in their predictive abilities based on the dataset and the approach to analysis employed. We will undertake feature selection, classification (both one-class and two-class classifiers) and performance evaluation. Patients who have had actively treated clinician-diagnosed asthma, aged 8-80 years and with 3 years of continuous data, from 2016 to 2018, will be selected. Risk factors will be obtained from the first year, while the next 2 years will form the outcome period, in which the primary endpoint will be the occurrence of an asthma attack.

Ethics and dissemination: We have obtained approval from OPCRD's Anonymous Data Ethics Protocols and Transparency (ADEPT) Committee. We will seek ethics approval from The University of Edinburgh's Research Ethics Group (UREG). We aim to present our findings at scientific conferences and in peer-reviewed journals.

Keywords: asthma; epidemiology; health informatics; public health.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adolescent
Adult
Aged
Aged, 80 and over
Algorithms
Asthma* / diagnosis
Child
Humans
Machine Learning*
Middle Aged
Primary Health Care
Retrospective Studies
Risk Factors
Young Adult

Grants and funding

MC_PC_19004/MRC_/Medical Research Council/United Kingdom