A threshold analysis assessed the credibility of conclusions from network meta-analysis

Deborah M Caldwell; A E Ades; Sofia Dias; Sarah Watkins; Tianjing Li; Nichole Taske; Bhash Naidoo; Nicky J Welton

doi:10.1016/j.jclinepi.2016.07.003

A threshold analysis assessed the credibility of conclusions from network meta-analysis

J Clin Epidemiol. 2016 Dec:80:68-76. doi: 10.1016/j.jclinepi.2016.07.003. Epub 2016 Jul 16.

Authors

Deborah M Caldwell¹, A E Ades², Sofia Dias², Sarah Watkins², Tianjing Li³, Nichole Taske⁴, Bhash Naidoo⁴, Nicky J Welton²

Affiliations

¹ School of Social and Community Medicine, University of Bristol, Canynge Hall, 39 Whatley Road, Bristol BS8 2PS, UK. Electronic address: [email protected].
² School of Social and Community Medicine, University of Bristol, Canynge Hall, 39 Whatley Road, Bristol BS8 2PS, UK.
³ Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, 615 N Wolfe Street, Baltimore, MD 21205, USA.
⁴ Centre for Clinical Practice, National Institute for Health and Care Excellence, 10 Spring Gardens, London SW1A 2BU, UK.

Abstract

Objective: To assess the reliability of treatment recommendations based on network meta-analysis (NMA).

Study design and setting: We consider evidence in an NMA to be potentially biased. Taking each pairwise contrast in turn, we use a structured series of threshold analyses to ask: (1) "How large would the bias in this evidence base have to be before it changed our decision?" and (2) "If the decision changed, what is the new recommendation?" We illustrate the method via two NMAs in which a Grading of Recommendations Assessment, Development and Evaluation (GRADE) assessment for NMAs has been implemented: weight loss and osteoporosis.

Results: Four of the weight-loss NMA estimates were assessed as "low" and six as "moderate" quality by GRADE; for osteoporosis, six were "low," nine were "moderate," and 1 was "high." The threshold analysis suggests plausible bias in 3 of 10 estimates in the weight-loss network could have changed the treatment recommendation. For osteoporosis, plausible bias in 6 of 16 estimates could change the recommendation. There was no relation between plausible bias changing a treatment recommendation and the original GRADE assessments.

Conclusions: Reliability judgments on individual NMA contrasts do not help decision makers understand whether a treatment recommendation is reliable. Threshold analysis reveals whether the final recommendation is robust against plausible degrees of bias in the data.

Keywords: Bias; Comparative effectiveness; GRADE; Health technology assessment; Mixed treatment comparison; Quality assessment; Reliability.

MeSH terms

Bias
Comparative Effectiveness Research / statistics & numerical data
Humans
Network Meta-Analysis*
Osteoporosis / therapy*
Outcome Assessment, Health Care / statistics & numerical data*
Reproducibility of Results
Weight Reduction Programs / statistics & numerical data*

Abstract

MeSH terms

Grants and funding