A new approach to a legacy concern: Evaluating machine-learned Bayesian networks to predict childhood lead exposure risk from community water systems

Environ Res. 2022 Mar;204(Pt B):112146. doi: 10.1016/j.envres.2021.112146. Epub 2021 Sep 29.

Abstract

Lead in drinking water continues to put children at risk of irreversible neurological impairment. Understanding drinking water system characteristics that influence blood lead levels is needed to prevent ongoing exposures. This study sought to assess the relationship between children's blood lead levels and drinking water system characteristics using machine-learned Bayesian networks. Blood lead records from 2003 to 2017 for 40,742 children in Wake County, North Carolina were matched with the characteristics of 178 community water systems and sociodemographic characteristics of each child's neighborhood. Bayesian networks were machine-learned to evaluate the drinking water variables associated with blood lead levels ≥2 μg/dL and ≥5 μg/dL. The model was used to predict geographic areas and water utilities with increased lead exposure risk. Drinking water characteristics were not significantly associated with children's blood lead levels ≥5 μg/dL but were important predictors of blood lead levels ≥2 μg/dL. Whether 10% of water samples exceeded 2 ppb of lead in the most recent year prior to the blood test was the most important water system predictor and increased the risk of blood lead levels ≥2 μg/dL by 42%. The model achieved an area under the receiver operating characteristic curve of 0.792 (±0.8%) during ten-fold cross validation, indicating good predictive performance. Water system characteristics may thus be used to predict areas that are at risk of higher blood lead levels. Current drinking water regulatory thresholds for lead may be insufficient to detect the levels in drinking water associated with children's blood lead levels.

Keywords: Bayesian networks; Blood lead levels; Drinking water; Health disparity; Machine learning; Risk assessment.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bayes Theorem
  • Child
  • Drinking Water*
  • Humans
  • Lead / analysis
  • Lead Poisoning* / epidemiology
  • Water Supply

Substances

  • Drinking Water
  • Lead